今天我們?yōu)榇蠹彝扑]一個(gè)新的arxiv家族的成員——jobrvix。這是一個(gè)專門為學(xué)術(shù)界找工作設(shè)計(jì)的arxiv: https://jobrxiv.org/
按照網(wǎng)站介紹,jobrxiv是在疫情之下應(yīng)運(yùn)而生的網(wǎng)站,初衷在于讓招聘市場(chǎng)更公平:每個(gè)要找工作的人可以接觸到每則招聘廣告,而所以實(shí)驗(yàn)室可以找到合適的候選人(To make recruitment fairer: with jobRxiv, every candidate has access to every job, and all labs can find the best candidates, whatever their budget)。
之所以冠以rxiv的名字,是因?yàn)橥渌A(yù)印本(preprint)網(wǎng)站一樣,jobrxiv也是本著開(kāi)放和免費(fèi)的宗旨。盡管以上原則被以預(yù)印本的形式廣泛地應(yīng)用于學(xué)術(shù)界出版中,在學(xué)術(shù)招聘方面這一方式的網(wǎng)站大概還不多見(jiàn)。所謂免費(fèi)就是:作為雇主,你可以免費(fèi)投廣告,作為雇員,可以瀏覽任何投放的廣告貨比三家。當(dāng)然,若是雇主想一次性投放多則廣告(打包),那么你需要支付一定數(shù)目的費(fèi)用,用以支持網(wǎng)站的維持。需要留意,盡管名字都帶arxiv,jobrxiv與biorxiv,medrxiv,以及preprint鼻祖arxiv都無(wú)任何實(shí)質(zhì)上的聯(lián)系。
網(wǎng)站主頁(yè)十分簡(jiǎn)明,就是一則則的招聘廣告。右側(cè)是檢索欄,幫助用戶迅速縮小范圍。招聘的類目上看,還是非常豐富的,從博士生到博士后,從大學(xué)教授到雜志編輯,還有項(xiàng)目主管、系主任等等等等。
小編試著輸入China,可惜只找到了四則廣告,而若是輸入Canada,彈出了33條招聘廣告,由此可見(jiàn)jobrxiv在國(guó)內(nèi)的影響力還很不足。在這里小編呼吁各位實(shí)驗(yàn)室老板,好好利用這個(gè)免費(fèi)的平臺(tái)。
若想關(guān)注Jobrxiv的招聘信息,你既可以通過(guò)直接登錄其網(wǎng)站,也可以通過(guò)社交媒體獲取最新信息。
既然說(shuō)到了招聘,小編順便為大家推薦一個(gè)在社交媒體上看到的最新招聘信息:來(lái)自法國(guó)巴黎薩克雷大學(xué)(Université Paris-Saclay)的40 個(gè) fully funded phd positions,涵蓋植物科學(xué)、食品科學(xué)、基因組學(xué)等多個(gè)領(lǐng)域。全球的申請(qǐng)季快結(jié)束了,疫情之下,想出國(guó)深造的朋友要抓緊時(shí)間了(鏈接見(jiàn)文末)。
說(shuō)完了jobrxiv,下面為大家?guī)?lái)biorxiv的上月好文速覽。
【修飾】CRISPR基因編輯先驅(qū)、諾獎(jiǎng)得主Doudna新文:組蛋白修飾對(duì)邊際效率的提升(看行文格式是要奔pnas去了?)
Decorating chromatin for enhanced genome editing using CRISPR-Cas9
CRISPR-associated (Cas) enzymes have revolutionized biology by enabling RNA-guided genome editing. Homology-directed repair (HDR) in the presence of donor templates is currently the most versatile way to introduce precise edits following CRISPR-Cas-induced double-stranded DNA cuts, but HDR efficiency is generally low relative to end-joining pathways that lead to insertions and deletions (indels). We tested the hypothesis that HDR could be increased using a Cas9 construct fused to PRDM9, a chromatin remodeling factor that deposits histone methylations H3K4me3 and H3K36me3 shown to mediate homologous recombination in human cells. Our results show that the fusion protein contacts chromatin specifically at the Cas9 cut site in DNA to double the observed HDR efficiency and increase the HDR:indel ratio by 3-fold compared to that induced by Cas9 alone. HDR enhancement occurred in multiple cell lines with no increase in off-target genome editing. These findings underscore the importance of chromatin structure for the choice of DNA repair pathway during CRISPR-Cas genome editing and provide a new strategy to increase the efficiency of HDR. Significance Statement CRISPR-Cas-mediated homology-directed repair (HDR) enables precision genome editing for diverse research and clinical applications, but HDR efficiency is often low due to competing end-joining pathways. Here, we describe a simple strategy to influence DNA repair pathway choice and improve HDR efficiency by engineering CRISPR-Cas9-methyltransferase fusion proteins. This strategy highlights the impact of histone modifications on DNA repair following CRISPR-Cas-induced double-stranded breaks and adds to the CRISPR genome editing toolbox.
【結(jié)構(gòu)】芝加哥大學(xué):深度學(xué)習(xí)助力蛋白質(zhì)側(cè)鏈包裝預(yù)測(cè)
An end-to-end deep learning method for rotamer-free protein side-chain packing
Protein side-chain packing (PSCP), the task of determining amino acid side-chain conformations, has important applications to protein structure prediction, refinement, and design. Many methods have been proposed to resolve this problem, but their accuracy is still unsatisfactory. To address this, we present AttnPacker, an end-to-end, SE(3)-equivariant deep graph transformer architecture for the direct prediction of side-chain coordinates. Unlike existing methods, AttnPacker directly incorporates backbone geometry to simultaneously compute all amino acid side-chain atom coordinates without delegating to a rotamer library, or performing expensive conformational search or sampling steps. Tested on the CASP13 and CASP14 native and non-native protein backbones, AttnPacker predicts side-chain conformations with RMSD significantly lower than the best side-chain packing methods (SCWRL4, FASPR, Rosetta Packer, and DLPacker), and achieves even greater improvements on surface residues. In addition to RMSD, our method also achieves top performance in side-chain dihedral prediction across both data sets.
【提速】密歇根大學(xué)Narayanasamy實(shí)驗(yàn)室:第三代測(cè)序比對(duì)流行工具minimap2再提速!
Accelerating Minimap2 for accurate long read alignment on GPUs
We extract better intra-read parallelism from chaining without loosing mapping accuracy by forward transforming Minimap2’s chaining algorithm. Further, we utilize the high memory available on modern cloud instances for better performance on the GPU by converting a sparse vector which defines the chaining workload to a dense one in order to optimize for better arithmetic intensity (more operations per byte of data fetched from high-latency global memory) on the GPU. We also optimize for better workload balancing, data locality and minimal branch divergence on the GPU. We show mm2-ax on an NVIDIA A100 GPU improves the chaining step with 12.6 - 5X speedup and 9.44 - 3.77X speedup : costup over the fastest version of Minimap2, mm2-fast, benchmarked on a single Google Cloud Platform instance of 30 SIMD cores.
【可視】廣州數(shù)智生物:組學(xué)分析的可視化在線工具GraphBio
GraphBio: a shiny web app to easily perform popular visualization analysis for omics data
Here, we present GraphBio, a shiny web app to easily perform visualization analysis for omics data. GraphBio provides 15 popular visualization analysis methods, including heatmap, volcano plots, MA plots, network plots, dot plots, chord plots, pie plots, four quadrant diagrams, venn diagrams, cumulative distribution curves, PCA, survival analysis, ROC analysis, correlation analysis and text cluster analysis. It enables experimental biologists without programming skills to easily perform popular visualization analysis and get publication-ready figures.
【綜述】深度學(xué)習(xí)在空間轉(zhuǎn)錄組學(xué)里的應(yīng)用
Deep Learning in Spatial Transcriptomics: Learning From the Next Next-Generation Sequencing
Spatial transcriptomics (ST) technologies are rapidly becoming the extension of single-cell RNA sequencing (scRNAseq), holding the potential of profiling gene expression at a single-cell resolution while maintaining cellular compositions within a tissue. Having both expression profiles and tissue organization enables researchers to better understand cellular interactions and heterogeneity, providing insight into complex biological processes that would not be possible with traditional sequencing technologies. The data generated by ST technologies are inherently noisy, high-dimensional, sparse, and multi-modal (including histological images, count matrices, etc.), thus requiring specialized computational tools for accurate and robust analysis. However, many ST studies currently utilize traditional scRNAseq tools, which are inadequate for analyzing complex ST datasets. On the other hand, many of the existing ST-specific methods are built upon traditional statistical or machine learning frameworks, which have shown to be sub-optimal in many applications due to the scale, multi-modality, and limitations of spatially-resolved data (such as spatial resolution, sensitivity and gene coverage). Given these intricacies, researchers have developed deep learning (DL)-based models to alleviate ST-specific challenges. These methods include new state-of-the-art models in alignment, spatial reconstruction, and spatial clustering among others. However, deep-learning models for ST analysis are nascent and remain largely underexplored. In this review, we provide an overview of existing state-of-the-art tools for analyzing spatially-resolved transcriptomics, while delving deeper into the DL-based approaches. We discuss the new frontiers and the open questions in this field and highlight the domains in which we anticipate transformational DL applications.
【移動(dòng)】巴黎薩克雷大學(xué)(Université Paris-Saclay):植物病原真菌灰霉菌基因組中的移動(dòng)元件
Botrytis cinerea strains infecting grapevine and tomato display contrasted repertoires of accessory chromosomes, transposons and small RNAs
The fungus Botrytis cinerea is a polyphagous pathogen that encompasses multiple host-specialized lineages. While several secreted proteins, secondary metabolites and retrotransposons-derived small RNAs have been characterized as virulence factors, their role in host specialization remain unknown. The aim of this study was to identify the genomic correlates of host-specialization in populations of B. cinerea associated with grapevine and tomato. Using PacBio sequencing, we produced complete assemblies of the genomes of strains Sl3 and Vv3 that represent the French populations T and G1 of B. cinerea, specialized on tomato and grapevine, respectively. Both assemblies revealed 16 core chromosomes that were highly syntenic with chromosomes of the reference strain B05.10. The main sources of variation in gene content were the subtelomeric regions and the accessory chromosomes, especially the chromosome BCIN19 of Vv3 that was absent in Sl3 and B05.10. The repertoires and density of transposable elements were clearly different between the genomes of Sl3 and Vv3 with a larger number of subfamilies (26) and a greater genome coverage in Vv3 (7.7%) than in Sl3 (14 subfamilies, 4.5% coverage). An Helitron-like element was found in almost all subtelomeric regions of the Vv3 genome, in particular in the flanking regions of a highly duplicated gene encoding a Telomere-Linked Helicase, while both features were absent from the Sl3 and B05.10 genomes. Different retrotransposons in the Sl3 and the Vv3 strains resulted in the synthesis of distinct sets of small RNAs. Finally, extending the study to additional strains indicated that the accessory chromosome BCIN19 and the small RNAs producing retrotransposons Copia_4 and Gypsy_7 are common features of the G1 population that are scarcely if ever found in strains isolated from other populations. This research reveals that accessory chromosomes, repertoires of transposons and their derived small RNAs differ between populations of B. cinerea specialized on different hosts. The genomic data characterized in our study pave the way for further studies aiming at investigating the molecular mechanisms underpinning host specialization in a polyphagous pathogen.
【無(wú)足】蚓螈(一種爬行動(dòng)物)基因組揭示無(wú)足動(dòng)物趨同演化的機(jī)制
Caecilian genomes reveal molecular basis of adaptation and convergent evolution of limblessness in snakes and caecilians
We present genome sequences for Geotrypetes seraphini (3.8Gb) and Microcaecilia unicolor (4.7Gb) caecilians, a limbless, mostly soil-dwelling amphibian clade with reduced eyes, and unique putatively chemosensory tentacles. We identify signatures of positive selection unique to caecilians in 1,150 orthogroups, with enrichment of functions for olfaction and detection of chemical signals. All our caecilian genomes are missing the ZRS enhancer of Sonic Hedgehog, shown by in vivo deletions to be required for limb development in mice and also absent in snakes, thus revealing a shared molecular target implicated in the independent evolution of limblessness in snakes and caecilians.
【輻射】美國(guó)杜克大學(xué):多組學(xué)研究為海膽發(fā)育的分子機(jī)理提供線索
Recent reconfiguration of an ancient developmental gene regulatory network in Heliocidaris sea urchins
Changes in developmental gene regulatory networks (dGRNs) underlie much of the diversity of life1, but the evolutionary mechanisms that operate on interactions with these networks remain poorly understood. Closely related species with extreme phenotypic divergence provide a valuable window into the genetic and molecular basis for changes in dGRNs and their relationship to adaptive changes in organismal traits. Here we analyze genomes, epigenomes, and transcriptomes during early development in two sea urchin species in the genus Heliocidaris that exhibit highly divergent life histories and in an outgroup species. Signatures of positive selection and changes in chromatin status within putative gene regulatory elements are both enriched on the branch leading to the derived life history, and particularly so near core dGRN genes; in contrast, positive selection within protein-coding regions have at most a modest enrichment in branch and function. Single-cell transcriptomes reveal a dramatic delay in cell fate specification in the derived state, which also has far fewer open chromatin regions, especially near dGRN genes with conserved roles in cell fate specification. Experimentally perturbing the function of three key transcription factors reveals profound evolutionary changes in the earliest events that pattern the embryo, disrupting regulatory interactions previously conserved for ~225 million years. Together, these results demonstrate that natural selection can rapidly reshape developmental gene expression on a broad scale when selective regimes abruptly change and that even highly conserved dGRNs and patterning mechanisms in the early embryo remain evolvable under appropriate ecological circumstances.
【tRNA】一個(gè)全新植物tRNA數(shù)據(jù)庫(kù)
PtRNAdb: A web resource of Plant tRNA genes from a wide range of plant species
tRNA, as well as their derived products such as short interspersed nuclear elements (SINEs), pseudogenes and transfer-RNA, derived fragments (tRFs) has now been shown to be vital for cellular life, functioning and adaptation during different stress conditions in all diverse life forms. In this study, we have developed PtRNAdb (www.nipgr.ac.in/PtRNAdb), a plant exclusive tRNA database containing 113849 tRNA gene sequences from phylogenetically diverse plant species. We have analysed a total of 106 nuclear, 89 plastidial and 38 mitochondrial genomes of plants by tRNAscan-SE software package, and after careful curation of the output data, we developed this database and integrated the data. The information about the tRNA gene sequences obtained, were further enriched with consensus sequence based study of tRNA genes based on their isoacceptors and isodecoders. We have also built covariance models based on the isoacceptors and isodecoders of all the tRNA sequences using infernal tool. The user can also perform BLAST not only against PtRNAdb entries but also against all the tRNA sequences stored in PlantRNA databases; and annotated tRNA genes across the plant kingdom available at NCBI. For the users’ ease, we have also incorporated the tRNAscan-SE tool for tRNA gene prediction, and ViennaRNA package for structural analysis on the home page of PtRNAdb. This resource is believed to be of high utility for plant researchers as well as molecular biologists to carry out further exploration of plant tRNAome on a wider spectrum, as well as for performing comparative and evolutionary studies related to tRNAs and their derivatives across all domains of life.
【檢驗(yàn)】加拿大不列顛哥倫比亞大學(xué)(University of British Columbia):一種新的統(tǒng)計(jì)學(xué)方法PicMin用以發(fā)現(xiàn)對(duì)環(huán)境適應(yīng)的基因
Using genome scans to identify genes used repeatedly for adaptation
Adaptation occurring in similar genes or genomic regions in distinct lineages provides evolutionary biologists with a glimpse at the fundamental opportunities for and constraints to diversification. With the widespread availability of high throughput sequencing technologies and the development of population genetic methods to identify the genetic basis of adaptation, studies have begun to compare the evidence for adaptation at the molecular level among distinct lineages. However, methods to study repeated adaptation are often oriented towards genome-wide testing to identify a set of genes with signatures of repeated use, rather than evaluating the significance at the level of an individual gene. In this study, we propose PicMin, a novel statistical method derived from the theory of order statistics that can test for repeated molecular evolution to estimate significance at the level of an individual gene, using the results of genome scans. This method is generalizable to any number of lineages and indeed, statistical power to detect repeated adaptation increases with the number of lineages that have signals of repeated adaptation of a given gene in multiple lineages. An implementation of the method written for R can be downloaded from https://github.com/TBooker/PicMin.
開(kāi)頭提及的巴黎薩克雷大學(xué)的40個(gè)全獎(jiǎng)phd招聘廣告: