meta data for this page
Differences
This shows you the differences between two versions of the page.
Both sides previous revisionPrevious revisionNext revision | Previous revision | ||
asa:seminar:2025:flash [2025/07/13 16:14] – ingo | asa:seminar:2025:flash [2025/07/16 16:35] (current) – ingo | ||
---|---|---|---|
Line 1: | Line 1: | ||
- | ====== Flash talks 2025 ====== | + | ====== Postersession 2025 ====== |
+ | ===== Abstract Book ===== | ||
+ | <WRAP round box> | ||
+ | {{ : | ||
+ | {{ : | ||
+ | </ | ||
+ | |||
+ | ===== Flash talks 2025 ===== | ||
<WRAP round box> | <WRAP round box> | ||
<hidden Abstract> | <hidden Abstract> | ||
Line 7: | Line 14: | ||
**Quiz:** {{ : | **Quiz:** {{ : | ||
+ | |||
**Team:** Boudouassel, | **Team:** Boudouassel, | ||
+ | {{ : | ||
+ | </ | ||
+ | <WRAP round box> | ||
+ | <hidden Abstract> | ||
+ | Accurate genome assembly is hampered by repetitive regions. Although long single molecule sequencing reads are better able to resolve genomic repeats than short-read data, most long-read assembly algorithms do not provide the repeat character- ization necessary for producing optimal assemblies. Here, we present Flye, a long-read assembly algorithm that generates arbitrary paths in an unknown repeat graph, called disjointigs, | ||
+ | </ | ||
+ | |||
+ | **Video** -> separate file | ||
+ | |||
+ | **Quiz:** {{ : | ||
+ | |||
+ | **Team:** Sarach, Deng | ||
+ | |||
+ | </ | ||
+ | <WRAP round box> | ||
+ | <hidden Abstract> | ||
+ | Gene prediction has remained an active area of bioinformatics research for a long time. Still, gene prediction in large eukaryotic genomes presents a challenge that must be addressed by new algorithms. The amount and significance of the evidence available from transcriptomes and proteomes vary across genomes, between genes, and even along a single gene. User-friendly and accurate annotation pipelines that can cope with such data heterogeneity are needed. The previously developed annotation pipelines BRAKER1 and BRAKER2 use RNA-seq or protein data, respectively, | ||
+ | </ | ||
+ | {{ : | ||
+ | |||
+ | **Team:** Biesecker, Inciler | ||
+ | </ | ||
+ | <WRAP round box> | ||
+ | <hidden Abstract> | ||
+ | Background Metagenomics is revolutionizing the study of microorganisms and their involvement in biological, biomedical, and geochemical processes, allowing us to investigate by direct sequencing a tremendous diversity of organisms without the need for prior cultivation. Unicellular eukaryotes play essential roles in most microbial communities as chief predators, decomposers, | ||
+ | Results MetaEuk is a toolkit for high-throughput, | ||
+ | Conclusion The open-source (GPLv3) MetaEuk software (https:// | ||
+ | </ | ||
+ | {{ : | ||
+ | |||
+ | **Quiz:** {{ : | ||
+ | |||
+ | **Team:** Kolos, Tahir | ||
+ | </ | ||
+ | <WRAP round box>A long-read RNA-seq approach to identify novel transcripts of very large genes. Uapinyoying et al. 2020 Genome Res 30: 885-897< | ||
+ | <hidden Abstract> | ||
+ | RNA-seq is widely used for studying gene expression, but commonly used sequencing platforms produce short reads that only span up to two exon junctions per read. This makes it difficult to accurately determine the composition and phasing of exons within transcripts. Although long-read sequencing improves this issue, it is not amenable to precise quantitation, | ||
+ | </ | ||
+ | {{ : | ||
+ | |||
+ | **Team:** Le, Fischer | ||
+ | </ | ||
+ | <WRAP round box> | ||
+ | <hidden Abstract> | ||
+ | The iCLIP and eCLIP techniques facilitate the detection of protein–RNA interaction sites at high resolution, based on diagnostic events at crosslink sites. However, previous methods do not explicitly model the specifics of iCLIP and eCLIP truncation patterns and possible biases. We developed PureCLIP (https:// | ||
+ | </ | ||
+ | {{ : | ||
+ | |||
+ | **Team:** Barie, Dieter | ||
+ | </ | ||
+ | <WRAP round box> | ||
+ | <hidden Abstract> | ||
+ | New genome assemblies have been arriving at a rapidly increasing pace, thanks to decreases in sequencing costs and improvements in third-generation sequencing technologies1, | ||
+ | </ | ||
+ | {{ : | ||
+ | |||
+ | **Team:** Bernshausen, | ||
+ | </ | ||
+ | <WRAP round box> | ||
+ | <hidden Abstract> | ||
+ | Proteins are essential to life, and understanding their structure can facilitate a mechanistic understanding of their function. Through an enormous experimental effort1, | ||
+ | </ | ||
+ | {{ : | ||
+ | |||
+ | **Team:** Alkanat, Paraparan | ||
+ | </ | ||
+ | <WRAP round box> | ||
+ | <hidden Abstract> | ||
+ | Genomic analyses are sensitive to contamination in public databases caused by incorrectly labeled reference sequences. Here, we describe Conterminator, | ||
+ | </ | ||
+ | {{ : | ||
+ | |||
+ | **Team:** Berger, Voss | ||
+ | </ | ||
+ | <WRAP round box> | ||
+ | <hidden Abstract> | ||
+ | Proteins are key to all cellular processes and their structure is important in understanding their function and evolution. Sequence-based predictions of protein structures have increased in accuracy1, and over 214 million predicted structures are available in the AlphaFold database2. However, studying protein structures at this scale requires highly efficient methods. Here, we developed a structural-alignment-based clustering algorithm—Foldseek cluster—that can cluster hundreds of millions of structures. Using this method, we have clustered all of the structures in the AlphaFold database, identifying 2.30 million non-singleton structural clusters, of which 31% lack annotations representing probable previously undescribed structures. Clusters without annotation tend to have few representatives covering only 4% of all proteins in the AlphaFold database. Evolutionary analysis suggests that most clusters are ancient in origin but 4% seem to be species specific, representing lower-quality predictions or examples of de novo gene birth. We also show how structural comparisons can be used to predict domain families and their relationships, | ||
+ | </ | ||
+ | {{ : | ||
+ | |||
+ | **Team:** Batman, Zeng | ||
+ | </ | ||
+ | <WRAP round box>Fast and sensitive taxonomic assignment to metagenomic contigs. Mirdita et al. Bioinformatics 37(18): | ||
+ | <hidden Abstract> | ||
+ | **Summary** | ||
+ | MMseqs2 taxonomy is a new tool to assign taxonomic labels to metagenomic contigs. It extracts all possible protein fragments from each contig, quickly retains those that can contribute to taxonomic annotation, assigns them with robust labels and determines the contig’s taxonomic identity by weighted voting. Its fragment extraction step is suitable for the analysis of all domains of life. MMseqs2 taxonomy is 2–18× faster than state-of-the-art tools and also contains new modules for creating and manipulating taxonomic reference databases as well as reporting and visualizing taxonomic assignments. | ||
+ | **Availability and implementation** | ||
+ | MMseqs2 taxonomy is part of the MMseqs2 free open-source software package available for Linux, macOS and Windows at https:// | ||
+ | </ | ||
+ | {{ : | ||
+ | |||
+ | **Quiz:** {{ : | ||
+ | **Team:** Ashirov, Qin | ||
+ | </ | ||
+ | <WRAP round box> | ||
+ | <hidden Abstract> | ||
+ | The vast majority of missense variants observed in the human genome are of unknown clinical significance. We present AlphaMissense, | ||
+ | </ | ||
+ | {{ : | ||
+ | |||
+ | **Team:** Li, Sahin | ||
</ | </ |