EST序列
表达序列标签(expressed sequence tags,ESTs)是一组短的、由大量随机取出的 cDNA 库克隆、经从 5' 或 3' 末端一次测序得到的组织或细胞基因组的一段 cDNA 序列, 一般长 60~500 bp 。这一概念最早由 Venter于1991年提出。近年来由此形成的技术路线被广泛应用于基因识别、绘制基因表达图谱、寻找新基因等研究领域,并且取得了显著成效。
基本信息
- 中文名
表达序列标签
- 外文名
expressed sequence tags
- 别称
EST序列
- 缩写
ESTs
简介
At a time when the genomes of many species have been sequenced completely, a fundamental
resource expected by many researchers is a simple list of all of an organism's genes. A gene list,
together with associated physical reagents and electronic information, allows one to begin to
investigate the ways in which many genes interact in the complex system of the organism. However,
many species of medical and agricultural importance have not yet been prioritized for
genomic sequencing, and expressed cDNAs have provided the primary source of gene
sequences. Furthermore, when the genomic sequence of an organism becomes available, a collection
of cDNA sequences provides the best tool for identifying genes within the DNA sequence.
Thus, we can anticipate that the sequencing of transcribed products will remain a significant area
of interest well into the future.
The eara of high-throughput cDNA sequencing was initiated in 1991 by a landmark study from
Venter and his colleagues. The basic strategy involves selecting cDNA clones at random and
performing a single, automated, sequencing read from one or both ends of their inserts. They
introduced the term EST to refer to this new class of sequence, which is characterized by being
short (typically about 400–600 bases) and relatively inaccurate (around 2% error). The use of
single-pass sequencing was an important aspect of making the approach cost effective. In most
cases, there is no initial attempt to identify or characterize the clones. Instead, they are identified
using only the small bit of sequence data obtained, comparing it to the sequences of known genes and other ESTs. It is fully expected that many clones will be redundant with others already