304:
When a specific nucleotide is added, if the DNA polymerase incorporates it in the growing chain, the pyrophosphate is released and converted into ATP by ATP sulfurylase. ATP powers the oxidation of luciferase through the luciferase; this reaction generates a light signal recorded as a pyrogram peak. In this way, the nucleotide incorporation is correlated to a signal. The light signal is proportional to the amount of nucleotides incorporated during the synthesis of the DNA strand (i.e. two nucleotides incorporated correspond to two pyrogram peaks). When the added nucleotides aren't incorporated in the DNA molecule, no signal is recorded; the enzyme apyrase removes any unincorporated nucleotide remaining in the reaction. This method requires neither fluorescently-labelled nucleotides nor gel electrophoresis. Pyrosequencing, which was developed by Pål Nyrén and
Mostafa Ronaghi DNA, has been commercialized by Biotage (for low-throughput sequencing) and 454 Life Sciences (for high-throughput sequencing). The latter platform sequences roughly 100
289:. This method is easier and quicker than the dye primer approach, but may produce more uneven data peaks (different heights), due to a template dependent difference in the incorporation of the large dye chain-terminators. This problem has been significantly reduced with the introduction of new enzymes and dyes that minimize incorporation variability. This method is now used for the vast majority of sequencing reactions as it is both simpler and cheaper. The major reason for this is that the primers do not have to be separately labelled (which can be a significant expense for a single-use custom primer), although this is less of a concern with frequently used 'universal' primers. This is changing rapidly due to the increasing cost-effectiveness of second- and third-generation systems from Illumina, 454, ABI, Helicos, and Dover.
277:
273:
the four dideoxyribonucleotides; the incorporation of the chain terminating nucleotides by the DNA polymerase in a random position results in a series of related DNA fragments, of different sizes, that terminate with a given dideoxiribonucleotide. The fragments are then size-separated by electrophoresis in a slab polyacrylamide gel, or more commonly now, in a narrow glass tube (capillary) filled with a viscous polymer.
1308:
55:
237:, and is named after author Rob Carlson. Carlson accurately predicted the doubling time of DNA sequencing technologies (measured by cost and performance) would be at least as fast as Moore's law. Carlson curves illustrate the rapid (in some cases hyperexponential) decreases in cost, and increases in performance, of a variety of technologies, including DNA sequencing,
1320:
284:
An alternative to the labelling of the primer is to label the terminators instead, commonly called 'dye terminator sequencing'. The major advantage of this approach is the complete sequencing set can be performed in a single reaction, rather than the four needed with the labeled-primer approach. This
272:
deoxynucleotide). The deoxynucleotides lack in the OH group both at the 2' and at the 3' position of the ribose molecule, therefore once they are inserted within a DNA molecule they prevent it from being further elongated. In this sequencer four different vessels are employed, each containing only of
217:
The sequence of DNA encodes the necessary information for living things to survive and reproduce. Determining the sequence is therefore useful in fundamental research into why and how organisms live, as well as in applied subjects. Because of the key importance DNA has to living things, knowledge of
209:
are gaining an increasing share of the sequencing market. More genome data are now being produced by pyrosequencing than Sanger DNA sequencing. Pyrosequencing has enabled rapid genome sequencing. Bacterial genomes can be sequenced in a single run with several times coverage with this technique. This
303:
The pyrosequencing method is based on the detection of the pyrophosphate release on nucleotide incorporation. Before performing pyrosequencing, the DNA strand to sequence has to be amplified by PCR. Then the order in which the nucleotides have to be added in the sequencer is chosen (i.e. G-A-T-C).
324:. Addition of one (or more) nucleotide(s) results in a reaction that generates a light signal that is recorded by the CCD camera in the instrument. The signal strength is proportional to the number of nucleotides, for example, homopolymer stretches, incorporated in a single nucleotide flow.
1163:
507:. In many cases the assembly is not uniquely specified; depending on which enzyme acts, one of several different units may be incorporated. This can lead to a family of similar molecules being formed. This is particularly true for plant polysaccharides. Methods for the
502:
in different ways. However, the main theoretical reason is that whereas the other polymers listed here are primarily generated in a 'template-dependent' manner by one processive enzyme, each individual join in a polysaccharide may be formed by a different
263:
In chain terminator sequencing (Sanger sequencing), extension is initiated at a specific site on the template DNA by using a short oligonucleotide 'primer' complementary to the template at that region. The oligonucleotide primer is extended using a
268:, an enzyme that replicates DNA. Included with the primer and DNA polymerase are the four deoxynucleotide bases (DNA building blocks), along with a low concentration of a chain terminating nucleotide (most commonly a
218:
DNA sequences is useful in practically any area of biological research. For example, in medicine it can be used to identify, diagnose, and potentially develop treatments for genetic diseases. Similarly, research into
418:
are excised. This gives a certain complexity to map the read sequences back to the genome and thereby identify their origin. For more information on the capabilities of next-generation sequencing applied to whole
1108:
494:
are also biopolymers, it is not so common to talk of 'sequencing' a polysaccharide, for several reasons. Although many polysaccharides are linear, many have branches. Many different units (individual
1072:
1011:
906:
339:
Whereas the methods above describe various sequencing methods, separate related terms are used when a large portion of a genome is sequenced. Several platforms were developed to perform
1090:
308:
in a seven-hour run with a single machine. In the array-based method (commercialized by 454 Life
Sciences), single-stranded DNA is annealed to beads and amplified via
1102:
205:. This technique uses sequence-specific termination of a DNA synthesis reaction using modified nucleotide substrates. However, new sequencing technologies such as
1169:
1116:
1017:
1145:
1180:
474:
If the gene encoding the protein is known, it is currently much easier to sequence the DNA and infer the protein sequence. Determining part of a protein's
1157:
585:
Wheeler, David A.; Srinivasan, Maithreyan; Egholm, Michael; Shen, Yufeng; Chen, Lei; McGuire, Amy; He, Wen; Chen, Yi-Ju; Makhijani, Vinod (2008-04-17).
1151:
738:
374:
the RNA extracted from the sample to generate cDNA fragments. This can then be sequenced as described above. The bulk of RNA expressed in cells are
1139:
916:
406:
therefore indicates cellular activity, particularly desired in the studies of diseases, cellular behaviour, responses to reagents or stimuli.
285:
is accomplished by labelling each of the dideoxynucleotide chain-terminators with a separate fluorescent dye, which fluoresces at a different
1096:
994:
852:
650:
Carlson, Robert H. Biology Is
Technology: The Promise, Peril, and New Business of Engineering Life. Cambridge, MA: Harvard UP, 2010. Print
320:. When free nucleotides are washed over this chip, light is produced as ATP is generated when nucleotides join with their complementary
1122:
366:
molecules. While sequencing DNA gives a genetic profile of an organism, sequencing RNA reflects only the sequences that are actively
1174:
1078:
1045:
1028:
988:
138:
256:
976:
964:
806:
1023:
76:
30:
This article is about the genetics definition of "sequencing". For the sense of "sequencing" used in electronic music, see
1346:
943:
860:
828:
119:
1273:
792:
731:
91:
901:
824:
766:
458:
72:
241:, and a range of physical and computational tools used in protein expression and in determining protein structures.
1351:
881:
39:
98:
65:
1324:
1051:
1005:
43:
1034:
359:
198:
1312:
1278:
921:
774:
724:
508:
344:
105:
1242:
982:
770:
541:
524:
358:
is less stable in the cell, and also more prone to nuclease attack experimentally. As RNA is generated by
317:
382:, detrimental for cellular translation, but often not the focus of a study. This fraction can be removed
1263:
876:
371:
87:
468:
362:
from DNA, the information is already present in the cell's DNA. However, it is sometimes desirable to
1247:
598:
1237:
1164:
International
Conference on Computational Intelligence Methods for Bioinformatics and Biostatistics
1056:
844:
561:
428:
276:
1293:
1197:
1000:
440:
403:
325:
1288:
1227:
802:
677:
624:
616:
463:
453:
309:
250:
164:
35:
1268:
1232:
1040:
669:
606:
536:
479:
340:
202:
959:
512:
367:
211:
175:
which succinctly summarizes much of the atomic-level structure of the sequenced molecule.
31:
602:
112:
747:
566:
516:
495:
491:
298:
265:
234:
206:
184:
971:
Microsoft
Research - University of Trento Centre for Computational and Systems Biology
226:
is a burgeoning discipline, with the potential for many useful products and services.
1340:
1222:
818:
551:
499:
478:
sequence (often one end) by one of the above methods may be sufficient to identify a
420:
375:
312:. These DNA-bound beads are then placed into wells on a fiber-optic chip along with
238:
223:
1212:
1207:
1202:
546:
156:
27:
In genetics and biochemistry, determining the structure of an unbranched biopolymer
660:
Carlson, Robert (2003). "The Pace and
Proliferation of Biological Technologies".
395:
54:
673:
1217:
836:
798:
475:
286:
190:
168:
711:
620:
411:
407:
379:
321:
219:
681:
628:
587:"The complete genome of an individual by massively parallel DNA sequencing"
911:
848:
840:
810:
305:
152:
17:
662:
Biosecurity and
Bioterrorism: Biodefense Strategy, Practice, and Science
611:
586:
386:, however, to enrich for the messenger RNA, also included, that usually
886:
832:
788:
784:
780:
762:
556:
446:
424:
399:
363:
313:
255:
1084:
938:
891:
856:
504:
415:
280:
View of the start of an example dye-terminator read (click to expand)
167:(sometimes incorrectly called the primary sequence) of an unbranched
197:
fragment. So far, most DNA sequencing has been performed using the
970:
896:
343:(a subset of all DNA across all chromosomes that encode genes) or
275:
254:
716:
814:
391:
171:. Sequencing results in a symbolic linear depiction known as a
720:
520:
355:
194:
48:
38:. For other uses of the terms "sequencer" and "sequence", see
1073:
African
Society for Bioinformatics and Computational Biology
695:
370:
in the cells. To sequence RNA, the usual method is first to
1012:
Max Planck
Institute of Molecular Cell Biology and Genetics
696:"A practical guide to structural analysis of carbohydrates"
1091:
International
Nucleotide Sequence Database Collaboration
1256:
1190:
1132:
1065:
952:
930:
869:
754:
79:. Unsourced material may be challenged and removed.
783:, database of protein sequences grouping together
210:technique was also used to sequence the genome of
34:. For sequence learning in cognitive science, see
189:DNA sequencing is the process of determining the
1018:US National Center for Biotechnology Information
402:that support particular cellular functions. The
347:(sequencing of the all nuclear DNA of a human).
222:may lead to treatments for contagious diseases.
1103:International Society for Computational Biology
259:Part of a radioactively labelled sequencing gel
233:to describe the biotechnological equivalent of
1170:ISCB Africa ASBCB Conference on Bioinformatics
1117:Institute of Genomics and Integrative Biology
732:
8:
1146:European Conference on Computational Biology
1181:Research in Computational Molecular Biology
1158:International Conference on Bioinformatics
739:
725:
717:
712:https://www.nature.com/subjects/sequencing
641:Life 2.0. (2006, August 31). The Economist
1152:Intelligent Systems for Molecular Biology
610:
139:Learn how and when to remove this message
1140:Basel Computational Biology Conference
577:
316:which produce light in the presence of
229:The Carlson curve is a term coined by
1097:International Society for Biocuration
995:European Molecular Biology Laboratory
7:
1319:
77:adding citations to reliable sources
1123:Japanese Society for Bioinformatics
1085:European Molecular Biology network
410:RNA molecules are not necessarily
25:
1175:Pacific Symposium on Biocomputing
1079:Australia Bioinformatics Resource
1046:Swiss Institute of Bioinformatics
1029:Netherlands Bioinformatics Centre
989:European Bioinformatics Institute
1318:
1307:
1306:
977:Database Center for Life Science
965:Computational Biology Department
853:Arabidopsis Information Resource
53:
823:Specialised genomic databases:
330:True single molecule sequencing
64:needs additional citations for
1024:Japanese Institute of Genetics
390:of interest. Derived from the
1:
944:Rosalind (education platform)
861:Zebrafish Information Network
829:Saccharomyces Genome Database
1274:List of biological databases
793:Protein Information Resource
414:with their DNA template, as
394:these mRNAs are to be later
767:European Nucleotide Archive
459:Peptide mass fingerprinting
1368:
674:10.1089/153871303769201851
438:
296:
248:
182:
40:Sequencer (disambiguation)
29:
1302:
1052:Wellcome Sanger Institute
1006:J. Craig Venter Institute
486:Polysaccharide sequencing
44:Sequence (disambiguation)
1035:Philippine Genome Center
199:chain termination method
1279:Molecular phylogenetics
775:China National GeneBank
509:structure determination
445:Methods for performing
345:whole genome sequencing
163:means to determine the
983:DNA Data Bank of Japan
771:DNA Data Bank of Japan
542:Full genome sequencing
335:Large-scale sequencing
281:
260:
1264:Computational biology
779:Secondary databases:
279:
258:
1347:Biochemistry methods
761:Sequence databases:
525:methylation analysis
482:carrying this gene.
449:sequencing include:
73:improve this article
1057:Whitehead Institute
845:Rat Genome Database
612:10.1038/nature06884
603:2008Natur.452..872W
562:MicroRNA sequencing
498:) can be used, and
429:MicroRNA Sequencing
1294:Sequence alignment
1001:Flatiron Institute
441:protein sequencing
435:Protein sequencing
404:expression profile
372:reverse transcribe
282:
261:
1352:Molecular biology
1334:
1333:
1289:Sequence database
803:Protein Data Bank
797:Other databases:
597:(7189): 872–876.
523:spectroscopy and
464:Mass spectrometry
454:Edman degradation
251:Sanger sequencing
245:Sanger sequencing
193:order of a given
165:primary structure
149:
148:
141:
123:
36:sequence learning
16:(Redirected from
1359:
1322:
1321:
1310:
1309:
1269:List of biobanks
1233:Stockholm format
1041:Scripps Research
741:
734:
727:
718:
700:
699:
692:
686:
685:
657:
651:
648:
642:
639:
633:
632:
614:
582:
537:Exome sequencing
513:oligosaccharides
469:Protease digests
341:exome sequencing
203:Frederick Sanger
144:
137:
133:
130:
124:
122:
81:
57:
49:
21:
1367:
1366:
1362:
1361:
1360:
1358:
1357:
1356:
1337:
1336:
1335:
1330:
1298:
1252:
1186:
1128:
1109:Student Council
1061:
960:Broad Institute
948:
926:
865:
750:
745:
708:
703:
694:
693:
689:
659:
658:
654:
649:
645:
640:
636:
584:
583:
579:
575:
533:
517:polysaccharides
496:monosaccharides
492:polysaccharides
488:
443:
437:
353:
337:
332:
301:
295:
253:
247:
187:
181:
145:
134:
128:
125:
82:
80:
70:
58:
47:
32:music sequencer
28:
23:
22:
15:
12:
11:
5:
1365:
1363:
1355:
1354:
1349:
1339:
1338:
1332:
1331:
1329:
1328:
1316:
1303:
1300:
1299:
1297:
1296:
1291:
1286:
1281:
1276:
1271:
1266:
1260:
1258:
1257:Related topics
1254:
1253:
1251:
1250:
1245:
1240:
1235:
1230:
1225:
1220:
1215:
1210:
1205:
1200:
1194:
1192:
1188:
1187:
1185:
1184:
1178:
1172:
1167:
1161:
1155:
1149:
1143:
1136:
1134:
1130:
1129:
1127:
1126:
1120:
1114:
1113:
1112:
1100:
1094:
1088:
1082:
1076:
1069:
1067:
1063:
1062:
1060:
1059:
1054:
1049:
1043:
1038:
1032:
1026:
1021:
1015:
1009:
1003:
998:
992:
986:
980:
974:
968:
962:
956:
954:
950:
949:
947:
946:
941:
934:
932:
928:
927:
925:
924:
919:
914:
909:
904:
899:
894:
889:
884:
879:
873:
871:
867:
866:
864:
863:
821:
795:
777:
758:
756:
752:
751:
748:Bioinformatics
746:
744:
743:
736:
729:
721:
715:
714:
707:
704:
702:
701:
687:
668:(3): 203–214.
652:
643:
634:
576:
574:
571:
570:
569:
567:Sequence motif
564:
559:
554:
549:
544:
539:
532:
529:
487:
484:
472:
471:
466:
461:
456:
439:Main article:
436:
433:
421:transcriptomes
376:ribosomal RNAs
352:
351:RNA sequencing
349:
336:
333:
331:
328:
299:Pyrosequencing
297:Main article:
294:
293:Pyrosequencing
291:
266:DNA polymerase
249:Main article:
246:
243:
207:pyrosequencing
185:DNA sequencing
183:Main article:
180:
179:DNA sequencing
177:
147:
146:
61:
59:
52:
26:
24:
14:
13:
10:
9:
6:
4:
3:
2:
1364:
1353:
1350:
1348:
1345:
1344:
1342:
1327:
1326:
1317:
1315:
1314:
1305:
1304:
1301:
1295:
1292:
1290:
1287:
1285:
1282:
1280:
1277:
1275:
1272:
1270:
1267:
1265:
1262:
1261:
1259:
1255:
1249:
1246:
1244:
1241:
1239:
1236:
1234:
1231:
1229:
1226:
1224:
1223:Pileup format
1221:
1219:
1216:
1214:
1211:
1209:
1206:
1204:
1201:
1199:
1196:
1195:
1193:
1189:
1182:
1179:
1176:
1173:
1171:
1168:
1165:
1162:
1159:
1156:
1153:
1150:
1147:
1144:
1141:
1138:
1137:
1135:
1131:
1124:
1121:
1118:
1115:
1110:
1107:
1106:
1104:
1101:
1098:
1095:
1092:
1089:
1086:
1083:
1080:
1077:
1074:
1071:
1070:
1068:
1066:Organizations
1064:
1058:
1055:
1053:
1050:
1047:
1044:
1042:
1039:
1036:
1033:
1030:
1027:
1025:
1022:
1019:
1016:
1013:
1010:
1007:
1004:
1002:
999:
996:
993:
990:
987:
984:
981:
978:
975:
972:
969:
966:
963:
961:
958:
957:
955:
951:
945:
942:
940:
936:
935:
933:
929:
923:
920:
918:
915:
913:
910:
908:
905:
903:
900:
898:
895:
893:
890:
888:
885:
883:
880:
878:
875:
874:
872:
868:
862:
858:
854:
850:
846:
842:
838:
834:
830:
826:
822:
820:
819:Gene Ontology
816:
812:
808:
804:
800:
796:
794:
790:
786:
782:
778:
776:
772:
768:
764:
760:
759:
757:
753:
749:
742:
737:
735:
730:
728:
723:
722:
719:
713:
710:
709:
705:
697:
691:
688:
683:
679:
675:
671:
667:
663:
656:
653:
647:
644:
638:
635:
630:
626:
622:
618:
613:
608:
604:
600:
596:
592:
588:
581:
578:
572:
568:
565:
563:
560:
558:
555:
553:
552:Pathogenomics
550:
548:
545:
543:
540:
538:
535:
534:
530:
528:
526:
522:
518:
514:
510:
506:
501:
497:
493:
485:
483:
481:
477:
470:
467:
465:
462:
460:
457:
455:
452:
451:
450:
448:
442:
434:
432:
430:
426:
422:
417:
413:
409:
405:
401:
397:
393:
389:
385:
381:
377:
373:
369:
365:
361:
360:transcription
357:
350:
348:
346:
342:
334:
329:
327:
326:
323:
319:
315:
311:
307:
300:
292:
290:
288:
278:
274:
271:
267:
257:
252:
244:
242:
240:
239:DNA synthesis
236:
232:
231:The Economist
227:
225:
224:Biotechnology
221:
215:
213:
208:
204:
201:developed by
200:
196:
192:
186:
178:
176:
174:
170:
166:
162:
158:
154:
143:
140:
132:
121:
118:
114:
111:
107:
104:
100:
97:
93:
90: –
89:
85:
84:Find sources:
78:
74:
68:
67:
62:This article
60:
56:
51:
50:
45:
41:
37:
33:
19:
1323:
1311:
1283:
1218:Nexus format
1213:NeXML format
1208:FASTQ format
1203:FASTA format
1191:File formats
953:Institutions
690:
665:
661:
655:
646:
637:
594:
590:
580:
547:Genetic code
489:
473:
444:
387:
383:
364:sequence RNA
354:
338:
302:
283:
269:
262:
230:
228:
216:
212:James Watson
188:
172:
160:
157:biochemistry
150:
135:
126:
116:
109:
102:
95:
88:"Sequencing"
83:
71:Please help
66:verification
63:
1198:CRAM format
1119:(CSIR-IGIB)
235:Moore's law
1341:Categories
1284:Sequencing
1248:GTF format
1243:GFF format
1238:VCF format
1228:SAM format
991:(EMBL-EBI)
917:SOAP suite
837:VectorBase
799:BioNumbers
785:Swiss-Prot
573:References
476:amino-acid
408:Eukaryotic
396:translated
380:small RNAs
322:base pairs
287:wavelength
214:recently.
191:nucleotide
169:biopolymer
161:sequencing
129:April 2008
99:newspapers
1111:(ISCB-SC)
1081:(EMBL-AR)
1014:(MPI-CBG)
755:Databases
621:0028-0836
412:co-linear
368:expressed
306:megabases
220:pathogens
18:Sequenced
1313:Category
1183:(RECOMB)
1133:Meetings
1087:(EMBnet)
937:Server:
912:SAMtools
907:PANGOLIN
870:Software
849:PHI-base
841:WormBase
811:InterPro
682:15040198
629:18421352
531:See also
519:include
400:proteins
384:in vitro
173:sequence
153:genetics
1325:Commons
1160:(InCoB)
1105:(ISCB)
1093:(INSDC)
1075:(ASBCB)
979:(DBCLS)
973:(COSBI)
887:Clustal
833:FlyBase
807:Ensembl
781:UniProt
763:GenBank
599:Bibcode
557:RNA-Seq
490:Though
447:protein
425:RNA-Seq
416:introns
314:enzymes
113:scholar
1166:(CIBB)
1154:(ISMB)
1148:(ECCB)
1125:(JSBi)
1031:(NBIC)
1020:(NCBI)
1008:(JCVI)
997:(EMBL)
985:(DDBJ)
939:ExPASy
922:TopHat
902:MUSCLE
892:EMBOSS
882:Bowtie
857:GISAID
817:, and
789:TrEMBL
680:
627:
619:
591:Nature
505:enzyme
500:bonded
115:
108:
101:
94:
86:
1177:(PSB)
1099:(ISB)
1048:(SIB)
1037:(PGC)
967:(CBD)
931:Other
897:HMMER
877:BLAST
706:Links
480:clone
423:see:
392:exons
310:EmPCR
120:JSTOR
106:books
859:and
825:BOLD
815:KEGG
791:and
773:and
678:PMID
625:PMID
617:ISSN
515:and
427:and
155:and
92:news
42:and
670:doi
607:doi
595:452
521:NMR
511:of
398:to
378:or
356:RNA
318:ATP
270:di-
195:DNA
151:In
75:by
1343::
1142:()
855:,
851:,
847:,
843:,
839:,
835:,
831:,
827:,
813:,
809:,
805:,
801:,
787:,
769:,
765:,
676:.
664:.
623:.
615:.
605:.
593:.
589:.
527:.
431:.
388:is
159:,
740:e
733:t
726:v
698:.
684:.
672::
666:1
631:.
609::
601::
142:)
136:(
131:)
127:(
117:·
110:·
103:·
96:·
69:.
46:.
20:)
Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.