An Easy-to-Follow Pipeline for Long Noncoding RNA Identification: A Case Study in Diploid Strawberry Fragaria vesca.

An Easy-to-Follow Pipeline for Long Noncoding RNA Identification: A Case Study in Diploid Strawberry Fragaria vesca. Methods Mol Biol. 2019;1933:223-243 Authors: Kang C, Liu Z Abstract Long noncoding RNAs (lncRNAs), defined as transcripts longer than 200 nucleotides without coding potential, are a new class of regulatory molecules with roles in diverse biological processes. New lncRNAs can readily be identified by mining RNA-seq data from a wide range of plant species. However, challenges remain as to how one can distinguish functional lncRNAs from mRNAs coding for small peptides or products of pseudogenes without any function. In this chapter, stepwise instruction is provided using RNA-seq datasets of developing wild strawberry fruit to illustrate each step. The workflow can be divided into three parts. Part I concerns standard RNA-seq data processing and analysis; part II describes lncRNA identification; part III describes several approaches aimed at shedding lights on lncRNA function. The description is intended for beginners with easy-to-follow steps. Text boxes provide codes and explanations. While it is relatively easy to identify lncRNAs, it is difficult to infer their function in the absence of coding information. Multiple RNA-seq libraries across tissues and stages are useful resources for deducing possible function of lncRNAs based on their expression and co-regulation. PMID: 30945188 [PubMed - indexed for MEDLINE]
Source: Mol Biol Cell - Category: Molecular Biology Authors: Tags: Methods Mol Biol Source Type: research