Sequencing - Knowledge (XXG)

304:

When a specific nucleotide is added, if the DNA polymerase incorporates it in the growing chain, the pyrophosphate is released and converted into ATP by ATP sulfurylase. ATP powers the oxidation of luciferase through the luciferase; this reaction generates a light signal recorded as a pyrogram peak. In this way, the nucleotide incorporation is correlated to a signal. The light signal is proportional to the amount of nucleotides incorporated during the synthesis of the DNA strand (i.e. two nucleotides incorporated correspond to two pyrogram peaks). When the added nucleotides aren't incorporated in the DNA molecule, no signal is recorded; the enzyme apyrase removes any unincorporated nucleotide remaining in the reaction. This method requires neither fluorescently-labelled nucleotides nor gel electrophoresis. Pyrosequencing, which was developed by Pål Nyrén and Mostafa Ronaghi DNA, has been commercialized by Biotage (for low-throughput sequencing) and 454 Life Sciences (for high-throughput sequencing). The latter platform sequences roughly 100

289:. This method is easier and quicker than the dye primer approach, but may produce more uneven data peaks (different heights), due to a template dependent difference in the incorporation of the large dye chain-terminators. This problem has been significantly reduced with the introduction of new enzymes and dyes that minimize incorporation variability. This method is now used for the vast majority of sequencing reactions as it is both simpler and cheaper. The major reason for this is that the primers do not have to be separately labelled (which can be a significant expense for a single-use custom primer), although this is less of a concern with frequently used 'universal' primers. This is changing rapidly due to the increasing cost-effectiveness of second- and third-generation systems from Illumina, 454, ABI, Helicos, and Dover. 277: 273:

the four dideoxyribonucleotides; the incorporation of the chain terminating nucleotides by the DNA polymerase in a random position results in a series of related DNA fragments, of different sizes, that terminate with a given dideoxiribonucleotide. The fragments are then size-separated by electrophoresis in a slab polyacrylamide gel, or more commonly now, in a narrow glass tube (capillary) filled with a viscous polymer.

1308: 55: 237:, and is named after author Rob Carlson. Carlson accurately predicted the doubling time of DNA sequencing technologies (measured by cost and performance) would be at least as fast as Moore's law. Carlson curves illustrate the rapid (in some cases hyperexponential) decreases in cost, and increases in performance, of a variety of technologies, including DNA sequencing, 1320: 284:

An alternative to the labelling of the primer is to label the terminators instead, commonly called 'dye terminator sequencing'. The major advantage of this approach is the complete sequencing set can be performed in a single reaction, rather than the four needed with the labeled-primer approach. This

272:

deoxynucleotide). The deoxynucleotides lack in the OH group both at the 2' and at the 3' position of the ribose molecule, therefore once they are inserted within a DNA molecule they prevent it from being further elongated. In this sequencer four different vessels are employed, each containing only of

217:

The sequence of DNA encodes the necessary information for living things to survive and reproduce. Determining the sequence is therefore useful in fundamental research into why and how organisms live, as well as in applied subjects. Because of the key importance DNA has to living things, knowledge of

209:

are gaining an increasing share of the sequencing market. More genome data are now being produced by pyrosequencing than Sanger DNA sequencing. Pyrosequencing has enabled rapid genome sequencing. Bacterial genomes can be sequenced in a single run with several times coverage with this technique. This

303:

The pyrosequencing method is based on the detection of the pyrophosphate release on nucleotide incorporation. Before performing pyrosequencing, the DNA strand to sequence has to be amplified by PCR. Then the order in which the nucleotides have to be added in the sequencer is chosen (i.e. G-A-T-C).

324:. Addition of one (or more) nucleotide(s) results in a reaction that generates a light signal that is recorded by the CCD camera in the instrument. The signal strength is proportional to the number of nucleotides, for example, homopolymer stretches, incorporated in a single nucleotide flow. 1163: 507:. In many cases the assembly is not uniquely specified; depending on which enzyme acts, one of several different units may be incorporated. This can lead to a family of similar molecules being formed. This is particularly true for plant polysaccharides. Methods for the 502:

in different ways. However, the main theoretical reason is that whereas the other polymers listed here are primarily generated in a 'template-dependent' manner by one processive enzyme, each individual join in a polysaccharide may be formed by a different

263:

In chain terminator sequencing (Sanger sequencing), extension is initiated at a specific site on the template DNA by using a short oligonucleotide 'primer' complementary to the template at that region. The oligonucleotide primer is extended using a

268:, an enzyme that replicates DNA. Included with the primer and DNA polymerase are the four deoxynucleotide bases (DNA building blocks), along with a low concentration of a chain terminating nucleotide (most commonly a 218:

DNA sequences is useful in practically any area of biological research. For example, in medicine it can be used to identify, diagnose, and potentially develop treatments for genetic diseases. Similarly, research into

418:

are excised. This gives a certain complexity to map the read sequences back to the genome and thereby identify their origin. For more information on the capabilities of next-generation sequencing applied to whole

1108: 494:

are also biopolymers, it is not so common to talk of 'sequencing' a polysaccharide, for several reasons. Although many polysaccharides are linear, many have branches. Many different units (individual

1072: 1011: 906: 339:

Whereas the methods above describe various sequencing methods, separate related terms are used when a large portion of a genome is sequenced. Several platforms were developed to perform

1090: 308:

in a seven-hour run with a single machine. In the array-based method (commercialized by 454 Life Sciences), single-stranded DNA is annealed to beads and amplified via

1102: 205:. This technique uses sequence-specific termination of a DNA synthesis reaction using modified nucleotide substrates. However, new sequencing technologies such as 1169: 1116: 1017: 1145: 1180: 474:

If the gene encoding the protein is known, it is currently much easier to sequence the DNA and infer the protein sequence. Determining part of a protein's

1157: 585:

Wheeler, David A.; Srinivasan, Maithreyan; Egholm, Michael; Shen, Yufeng; Chen, Lei; McGuire, Amy; He, Wen; Chen, Yi-Ju; Makhijani, Vinod (2008-04-17).

1151: 738: 374:

the RNA extracted from the sample to generate cDNA fragments. This can then be sequenced as described above. The bulk of RNA expressed in cells are

1139: 916: 406:

therefore indicates cellular activity, particularly desired in the studies of diseases, cellular behaviour, responses to reagents or stimuli.

285:

is accomplished by labelling each of the dideoxynucleotide chain-terminators with a separate fluorescent dye, which fluoresces at a different

1096: 994: 852: 650:

Carlson, Robert H. Biology Is Technology: The Promise, Peril, and New Business of Engineering Life. Cambridge, MA: Harvard UP, 2010. Print

320:. When free nucleotides are washed over this chip, light is produced as ATP is generated when nucleotides join with their complementary 1122: 366:

molecules. While sequencing DNA gives a genetic profile of an organism, sequencing RNA reflects only the sequences that are actively

1174: 1078: 1045: 1028: 988: 138: 256: 976: 964: 806: 1023: 76: 30:

This article is about the genetics definition of "sequencing". For the sense of "sequencing" used in electronic music, see

1346: 943: 860: 828: 119: 1273: 792: 731: 91: 901: 824: 766: 458: 72: 241:, and a range of physical and computational tools used in protein expression and in determining protein structures. 1351: 881: 39: 98: 65: 1324: 1051: 1005: 43: 1034: 359: 198: 1312: 1278: 921: 774: 724: 508: 344: 105: 1242: 982: 770: 541: 524: 358:

is less stable in the cell, and also more prone to nuclease attack experimentally. As RNA is generated by

317: 382:, detrimental for cellular translation, but often not the focus of a study. This fraction can be removed 1263: 876: 371: 87: 468: 362:

from DNA, the information is already present in the cell's DNA. However, it is sometimes desirable to

1247: 598: 1237: 1164:

International Conference on Computational Intelligence Methods for Bioinformatics and Biostatistics

1056: 844: 561: 428: 276: 1293: 1197: 1000: 440: 403: 325: 1288: 1227: 802: 677: 624: 616: 463: 453: 309: 250: 164: 35: 1268: 1232: 1040: 669: 606: 536: 479: 340: 202: 959: 512: 367: 211: 175:

which succinctly summarizes much of the atomic-level structure of the sequenced molecule.

31: 602: 112: 747: 566: 516: 495: 491: 298: 265: 234: 206: 184: 971:

Microsoft Research - University of Trento Centre for Computational and Systems Biology

226:

is a burgeoning discipline, with the potential for many useful products and services.

1340: 1222: 818: 551: 499: 478:

sequence (often one end) by one of the above methods may be sufficient to identify a

420: 375: 312:. These DNA-bound beads are then placed into wells on a fiber-optic chip along with 238: 223: 1212: 1207: 1202: 546: 156: 27:

In genetics and biochemistry, determining the structure of an unbranched biopolymer

660:

Carlson, Robert (2003). "The Pace and Proliferation of Biological Technologies".

395: 54: 673: 1217: 836: 798: 475: 286: 190: 168: 711: 620: 411: 407: 379: 321: 219: 681: 628: 587:"The complete genome of an individual by massively parallel DNA sequencing" 911: 848: 840: 810: 305: 152: 17: 662:

Biosecurity and Bioterrorism: Biodefense Strategy, Practice, and Science

611: 586: 386:, however, to enrich for the messenger RNA, also included, that usually 886: 832: 788: 784: 780: 762: 556: 446: 424: 399: 363: 313: 255: 1084: 938: 891: 856: 504: 415: 280:

View of the start of an example dye-terminator read (click to expand)

167:(sometimes incorrectly called the primary sequence) of an unbranched 197:

fragment. So far, most DNA sequencing has been performed using the

970: 896: 343:(a subset of all DNA across all chromosomes that encode genes) or 275: 254: 716: 814: 391: 171:. Sequencing results in a symbolic linear depiction known as a 720: 520: 355: 194: 48: 38:. For other uses of the terms "sequencer" and "sequence", see 1073:

African Society for Bioinformatics and Computational Biology

695: 370:

in the cells. To sequence RNA, the usual method is first to

1012:

Max Planck Institute of Molecular Cell Biology and Genetics

696:"A practical guide to structural analysis of carbohydrates" 1091:

International Nucleotide Sequence Database Collaboration

1256: 1190: 1132: 1065: 952: 930: 869: 754: 79:. Unsourced material may be challenged and removed. 783:, database of protein sequences grouping together 210:technique was also used to sequence the genome of 34:. For sequence learning in cognitive science, see 189:DNA sequencing is the process of determining the 1018:US National Center for Biotechnology Information 402:that support particular cellular functions. The 347:(sequencing of the all nuclear DNA of a human). 222:may lead to treatments for contagious diseases. 1103:International Society for Computational Biology 259:Part of a radioactively labelled sequencing gel 233:to describe the biotechnological equivalent of 1170:ISCB Africa ASBCB Conference on Bioinformatics 1117:Institute of Genomics and Integrative Biology 732: 8: 1146:European Conference on Computational Biology 1181:Research in Computational Molecular Biology 1158:International Conference on Bioinformatics 739: 725: 717: 712:https://www.nature.com/subjects/sequencing 641:Life 2.0. (2006, August 31). The Economist 1152:Intelligent Systems for Molecular Biology 610: 139:Learn how and when to remove this message 1140:Basel Computational Biology Conference‎ 577: 316:which produce light in the presence of 229:The Carlson curve is a term coined by 1097:International Society for Biocuration 995:European Molecular Biology Laboratory 7: 1319: 77:adding citations to reliable sources 1123:Japanese Society for Bioinformatics 1085:European Molecular Biology network 410:RNA molecules are not necessarily 25: 1175:Pacific Symposium on Biocomputing 1079:Australia Bioinformatics Resource 1046:Swiss Institute of Bioinformatics 1029:Netherlands Bioinformatics Centre 989:European Bioinformatics Institute 1318: 1307: 1306: 977:Database Center for Life Science 965:Computational Biology Department 853:Arabidopsis Information Resource 53: 823:Specialised genomic databases: 330:True single molecule sequencing 64:needs additional citations for 1024:Japanese Institute of Genetics 390:of interest. Derived from the 1: 944:Rosalind (education platform) 861:Zebrafish Information Network 829:Saccharomyces Genome Database 1274:List of biological databases 793:Protein Information Resource 414:with their DNA template, as 394:these mRNAs are to be later 767:European Nucleotide Archive 459:Peptide mass fingerprinting 1368: 674:10.1089/153871303769201851 438: 296: 248: 182: 40:Sequencer (disambiguation) 29: 1302: 1052:Wellcome Sanger Institute 1006:J. Craig Venter Institute 486:Polysaccharide sequencing 44:Sequence (disambiguation) 1035:Philippine Genome Center 199:chain termination method 1279:Molecular phylogenetics 775:China National GeneBank 509:structure determination 445:Methods for performing 345:whole genome sequencing 163:means to determine the 983:DNA Data Bank of Japan 771:DNA Data Bank of Japan 542:Full genome sequencing 335:Large-scale sequencing 281: 260: 1264:Computational biology 779:Secondary databases: 279: 258: 1347:Biochemistry methods 761:Sequence databases: 525:methylation analysis 482:carrying this gene. 449:sequencing include: 73:improve this article 1057:Whitehead Institute 845:Rat Genome Database 612:10.1038/nature06884 603:2008Natur.452..872W 562:MicroRNA sequencing 498:) can be used, and 429:MicroRNA Sequencing 1294:Sequence alignment 1001:Flatiron Institute 441:protein sequencing 435:Protein sequencing 404:expression profile 372:reverse transcribe 282: 261: 1352:Molecular biology 1334: 1333: 1289:Sequence database 803:Protein Data Bank 797:Other databases: 597:(7189): 872–876. 523:spectroscopy and 464:Mass spectrometry 454:Edman degradation 251:Sanger sequencing 245:Sanger sequencing 193:order of a given 165:primary structure 149: 148: 141: 123: 36:sequence learning 16:(Redirected from 1359: 1322: 1321: 1310: 1309: 1269:List of biobanks 1233:Stockholm format 1041:Scripps Research 741: 734: 727: 718: 700: 699: 692: 686: 685: 657: 651: 648: 642: 639: 633: 632: 614: 582: 537:Exome sequencing 513:oligosaccharides 469:Protease digests 341:exome sequencing 203:Frederick Sanger 144: 137: 133: 130: 124: 122: 81: 57: 49: 21: 1367: 1366: 1362: 1361: 1360: 1358: 1357: 1356: 1337: 1336: 1335: 1330: 1298: 1252: 1186: 1128: 1109:Student Council 1061: 960:Broad Institute 948: 926: 865: 750: 745: 708: 703: 694: 693: 689: 659: 658: 654: 649: 645: 640: 636: 584: 583: 579: 575: 533: 517:polysaccharides 496:monosaccharides 492:polysaccharides 488: 443: 437: 353: 337: 332: 301: 295: 253: 247: 187: 181: 145: 134: 128: 125: 82: 80: 70: 58: 47: 32:music sequencer 28: 23: 22: 15: 12: 11: 5: 1365: 1363: 1355: 1354: 1349: 1339: 1338: 1332: 1331: 1329: 1328: 1316: 1303: 1300: 1299: 1297: 1296: 1291: 1286: 1281: 1276: 1271: 1266: 1260: 1258: 1257:Related topics 1254: 1253: 1251: 1250: 1245: 1240: 1235: 1230: 1225: 1220: 1215: 1210: 1205: 1200: 1194: 1192: 1188: 1187: 1185: 1184: 1178: 1172: 1167: 1161: 1155: 1149: 1143: 1136: 1134: 1130: 1129: 1127: 1126: 1120: 1114: 1113: 1112: 1100: 1094: 1088: 1082: 1076: 1069: 1067: 1063: 1062: 1060: 1059: 1054: 1049: 1043: 1038: 1032: 1026: 1021: 1015: 1009: 1003: 998: 992: 986: 980: 974: 968: 962: 956: 954: 950: 949: 947: 946: 941: 934: 932: 928: 927: 925: 924: 919: 914: 909: 904: 899: 894: 889: 884: 879: 873: 871: 867: 866: 864: 863: 821: 795: 777: 758: 756: 752: 751: 748:Bioinformatics 746: 744: 743: 736: 729: 721: 715: 714: 707: 704: 702: 701: 687: 668:(3): 203–214. 652: 643: 634: 576: 574: 571: 570: 569: 567:Sequence motif 564: 559: 554: 549: 544: 539: 532: 529: 487: 484: 472: 471: 466: 461: 456: 439:Main article: 436: 433: 421:transcriptomes 376:ribosomal RNAs 352: 351:RNA sequencing 349: 336: 333: 331: 328: 299:Pyrosequencing 297:Main article: 294: 293:Pyrosequencing 291: 266:DNA polymerase 249:Main article: 246: 243: 207:pyrosequencing 185:DNA sequencing 183:Main article: 180: 179:DNA sequencing 177: 147: 146: 61: 59: 52: 26: 24: 14: 13: 10: 9: 6: 4: 3: 2: 1364: 1353: 1350: 1348: 1345: 1344: 1342: 1327: 1326: 1317: 1315: 1314: 1305: 1304: 1301: 1295: 1292: 1290: 1287: 1285: 1282: 1280: 1277: 1275: 1272: 1270: 1267: 1265: 1262: 1261: 1259: 1255: 1249: 1246: 1244: 1241: 1239: 1236: 1234: 1231: 1229: 1226: 1224: 1223:Pileup format 1221: 1219: 1216: 1214: 1211: 1209: 1206: 1204: 1201: 1199: 1196: 1195: 1193: 1189: 1182: 1179: 1176: 1173: 1171: 1168: 1165: 1162: 1159: 1156: 1153: 1150: 1147: 1144: 1141: 1138: 1137: 1135: 1131: 1124: 1121: 1118: 1115: 1110: 1107: 1106: 1104: 1101: 1098: 1095: 1092: 1089: 1086: 1083: 1080: 1077: 1074: 1071: 1070: 1068: 1066:Organizations 1064: 1058: 1055: 1053: 1050: 1047: 1044: 1042: 1039: 1036: 1033: 1030: 1027: 1025: 1022: 1019: 1016: 1013: 1010: 1007: 1004: 1002: 999: 996: 993: 990: 987: 984: 981: 978: 975: 972: 969: 966: 963: 961: 958: 957: 955: 951: 945: 942: 940: 936: 935: 933: 929: 923: 920: 918: 915: 913: 910: 908: 905: 903: 900: 898: 895: 893: 890: 888: 885: 883: 880: 878: 875: 874: 872: 868: 862: 858: 854: 850: 846: 842: 838: 834: 830: 826: 822: 820: 819:Gene Ontology 816: 812: 808: 804: 800: 796: 794: 790: 786: 782: 778: 776: 772: 768: 764: 760: 759: 757: 753: 749: 742: 737: 735: 730: 728: 723: 722: 719: 713: 710: 709: 705: 697: 691: 688: 683: 679: 675: 671: 667: 663: 656: 653: 647: 644: 638: 635: 630: 626: 622: 618: 613: 608: 604: 600: 596: 592: 588: 581: 578: 572: 568: 565: 563: 560: 558: 555: 553: 552:Pathogenomics 550: 548: 545: 543: 540: 538: 535: 534: 530: 528: 526: 522: 518: 514: 510: 506: 501: 497: 493: 485: 483: 481: 477: 470: 467: 465: 462: 460: 457: 455: 452: 451: 450: 448: 442: 434: 432: 430: 426: 422: 417: 413: 409: 405: 401: 397: 393: 389: 385: 381: 377: 373: 369: 365: 361: 360:transcription 357: 350: 348: 346: 342: 334: 329: 327: 326: 323: 319: 315: 311: 307: 300: 292: 290: 288: 278: 274: 271: 267: 257: 252: 244: 242: 240: 239:DNA synthesis 236: 232: 231:The Economist 227: 225: 224:Biotechnology 221: 215: 213: 208: 204: 201:developed by 200: 196: 192: 186: 178: 176: 174: 170: 166: 162: 158: 154: 143: 140: 132: 121: 118: 114: 111: 107: 104: 100: 97: 93: 90: – 89: 85: 84:Find sources: 78: 74: 68: 67: 62:This article 60: 56: 51: 50: 45: 41: 37: 33: 19: 1323: 1311: 1283: 1218:Nexus format 1213:NeXML format 1208:FASTQ format 1203:FASTA format 1191:File formats 953:Institutions 690: 665: 661: 655: 646: 637: 594: 590: 580: 547:Genetic code 489: 473: 444: 387: 383: 364:sequence RNA 354: 338: 302: 283: 269: 262: 230: 228: 216: 212:James Watson 188: 172: 160: 157:biochemistry 150: 135: 126: 116: 109: 102: 95: 88:"Sequencing" 83: 71:Please help 66:verification 63: 1198:CRAM format 1119:(CSIR-IGIB) 235:Moore's law 1341:Categories 1284:Sequencing 1248:GTF format 1243:GFF format 1238:VCF format 1228:SAM format 991:(EMBL-EBI) 917:SOAP suite 837:VectorBase 799:BioNumbers 785:Swiss-Prot 573:References 476:amino-acid 408:Eukaryotic 396:translated 380:small RNAs 322:base pairs 287:wavelength 214:recently. 191:nucleotide 169:biopolymer 161:sequencing 129:April 2008 99:newspapers 1111:(ISCB-SC) 1081:(EMBL-AR) 1014:(MPI-CBG) 755:Databases 621:0028-0836 412:co-linear 368:expressed 306:megabases 220:pathogens 18:Sequenced 1313:Category 1183:(RECOMB) 1133:Meetings 1087:(EMBnet) 937:Server: 912:SAMtools 907:PANGOLIN 870:Software 849:PHI-base 841:WormBase 811:InterPro 682:15040198 629:18421352 531:See also 519:include 400:proteins 384:in vitro 173:sequence 153:genetics 1325:Commons 1160:(InCoB) 1105:(ISCB) 1093:(INSDC) 1075:(ASBCB) 979:(DBCLS) 973:(COSBI) 887:Clustal 833:FlyBase 807:Ensembl 781:UniProt 763:GenBank 599:Bibcode 557:RNA-Seq 490:Though 447:protein 425:RNA-Seq 416:introns 314:enzymes 113:scholar 1166:(CIBB) 1154:(ISMB) 1148:(ECCB) 1125:(JSBi) 1031:(NBIC) 1020:(NCBI) 1008:(JCVI) 997:(EMBL) 985:(DDBJ) 939:ExPASy 922:TopHat 902:MUSCLE 892:EMBOSS 882:Bowtie 857:GISAID 817:, and 789:TrEMBL 680: 627: 619: 591:Nature 505:enzyme 500:bonded 115: 108: 101: 94: 86: 1177:(PSB) 1099:(ISB) 1048:(SIB) 1037:(PGC) 967:(CBD) 931:Other 897:HMMER 877:BLAST 706:Links 480:clone 423:see: 392:exons 310:EmPCR 120:JSTOR 106:books 859:and 825:BOLD 815:KEGG 791:and 773:and 678:PMID 625:PMID 617:ISSN 515:and 427:and 155:and 92:news 42:and 670:doi 607:doi 595:452 521:NMR 511:of 398:to 378:or 356:RNA 318:ATP 270:di- 195:DNA 151:In 75:by 1343:: 1142:() 855:, 851:, 847:, 843:, 839:, 835:, 831:, 827:, 813:, 809:, 805:, 801:, 787:, 769:, 765:, 676:. 664:. 623:. 615:. 605:. 593:. 589:. 527:. 431:. 388:is 159:, 740:e 733:t 726:v 698:. 684:. 672:: 666:1 631:. 609:: 601:: 142:) 136:( 131:) 127:( 117:· 110:· 103:· 96:· 69:. 46:. 20:)

Text is available under the Creative Commons Attribution-ShareAlike License. Additional terms may apply.

Index