| Abe,Mamitsuka, Predicting protein secondary structure using stochastic tree grammars, 1997 | :: | 1 |
| Altman,Raychaudhuri, Whole-genome expression analysis: challenges beyond clustering, 2001 | :: | 1 |
| Baldi,Brunak, Bioinformatics: The Machine Learning Approach, 2001 | :: | 1 |
| Barbrook,Howe,Blake,Robinson, The phylogeny of the Canterbury Tales, 1998 | :: | 2 |
| Barnbrook, Language and Computers, 1996 | :: | 1 |
| Binongo,Smith, The application of principal component analysis to stylometry, 1999 | :: | 1 |
| Brendel,Busse, Genome structure described by formal languages, 1984 | :: | 1 |
| Brown,Wilson, RNA pseudoknot modeling using intersections of stochastic context free grammars with applications to database search, 1996 | :: | 1 |
| Brown, Small subunit ribosomal RNA modeling using stochastic context-free grammars, 2000 | :: | 1 |
| Burge,Karlin, Prediction of complete gene structures in human genomic DNA, 1997 | :: | 1 |
| Campbell, Historical Linguistics: An Introduction, 1999 | :: | 1 |
| Chomsky, Syntactic Structures, 1957 | :: | 36 |
| Collado-Vides, A transformational-grammar approach to the study of the regulation of gene expression, 1989 | :: | 1 |
| Darwin, The Descent of Man, 1871 | :: | 5 |
| Dawkins, The Selfish Gene, 1976 | :: | 43 |
| Dong,Searls, Gene structure prediction by linguistic methods, 1994 | :: | 1 |
| Durbin,Krogh,Mitchison,Eddy, Biological Sequence Analysis: Probabilistic Models of Proteins and Nucleic Acids, 1998 | :: | 1 |
| Ferrer, Hypertextual representation of literary working papers, 1995 | :: | 1 |
| Garcia-Vallve,Romeu,Palau, Horizontal gene transfer in bacterial and archaeal complete genomes, 2000 | :: | 1 |
| Harrison,Gerstein, Studying genomes through the aeons: protein families, pseudogenes and proteome evolution, 2002 | :: | 1 |
| Head, Formal language theory and DNA: an analysis of the generative capacity of specific recombinant behaviors, 1987 | :: | 1 |
| Holmes,Forsyth, The Federalist revisited: new directions in authorship attribution, 1995 | :: | 1 |
| Holmes,Rubin, Pairwise RNA structure comparison with stochastic context-free grammars, 2002 | :: | 1 |
| Hoorn,Frank,Kowalczyk,van der, Neural network identification of poets using letter sequences, 1999 | :: | 1 |
| Hoover, Statistical stylistics and authorship attribution: an empirical investigation, 2001 | :: | 1 |
| Hoyle,Rattray,Jupp,Brass, Making sense of microarray data distributions, 2002 | :: | 1 |
| Huynen,van Nimwegen, The frequency distribution of gene family sizes in complete genomes, 1998 | :: | 1 |
| Jeong, The large-scale organization of metabolic networks, 2000 | :: | 1 |
| Jung,Lee, Circularly permuted proteins in the protein structure database, 2001 | :: | 1 |
| Jurafsky,Martin, Speech and Language Processing, 2000 | :: | 1 |
| Knudsen,Hein, RNA secondary structure prediction using stochastic context-free grammars and evolutionary history, 1999 | :: | 1 |
| Leopold,Kindermann, Text categorization with support vector machines, 2002 | :: | 1 |
| Leung,Mellish,Robertson, Basic Gene Grammars and DNA-ChartParser for language processing of Escherichia coli promoter DNA sequences, 2001 | :: | 1 |
| Lin,Gerstein, Whole-genome trees based on the occurrence of folds and orthologs: implications for comparing genomes on different levels, 2000 | :: | 1 |
| Lupas,Ponting,Russell, On the evolution of protein folds: are similar motifs in different protein folds the result of convergence, insertion, or relics of an ancient peptide world?, 2001 | :: | 1 |
| Lyngso,Pedersen, RNA pseudoknot prediction in energy-based models, 2000 | :: | 1 |
| Mandelbrot, The Fractal Geometry of Nature, 1983 | :: | 3 |
| Mantegna, Linguistic features of noncoding DNA sequences, 1994 | :: | 1 |
| Marcotte, Detecting protein function and protein-protein interactions from genome sequences, 1999 | :: | 1 |
| McWhorter, The Power of Babel: A Natural History of Language, 2001 | :: | 2 |
| Mushegian, The minimal genome concept, 1999 | :: | 1 |
| Nowak,Komarova,Niyogi, Computational and evolutionary aspects of language, 2002 | :: | 31 |
| Park,Lappe,Teichmann, Mapping protein family interactions: intramolecular and intermolecular protein family interaction repertoires in the PDB and yeast, 2001 | :: | 1 |
| Pellegrini, Assigning protein functions by comparative genome analysis: protein phylogenetic profiles, 1999 | :: | 1 |
| Pennock, Tower of Babel: The Evidence against the New Creationism, 1999 | :: | 3 |
| Platnick,Cameron, Cladistic methods in textual, linguistic, and phylogenetic analysis, 1977 | :: | 2 |
| Popov,Segal,Trifonov, Linguistic complexity of protein sequences as compared to texts of human languages, 1996 | :: | 1 |
| Przytycka,Srinivasan,Rose, Recursive domains in proteins, 2002 | :: | 1 |
| Qian,Luscombe,Gerstein, Protein family and fold occurrence in genomes: power-law behaviour and evolutionary model, 2001 | :: | 1 |
| Reese,Kulp,Tammana,Haussler, Genie--gene finding in Drosophila melanogaster, 2000 | :: | 1 |
| Rivas,Eddy, The language of RNA: a formal grammar that includes pseudoknots, 2000 | :: | 1 |
| Rivas,Eddy, Noncoding RNA gene detection using comparative sequence analysis, 2001 | :: | 1 |
| Rosenblueth, Syntactic recognition of regulatory regions in Escherichia coli, 1996 | :: | 1 |
| Rudman, The state of authorship attribution studies: some problems and solutions, 1998 | :: | 1 |
| Rzhetsky,Gomez, Birth of scale-free molecular networks and the number of distinct DNA and protein domains per genome, 2001 | :: | 1 |
| Sakakibara, Stochastic context-free grammars for tRNA modeling, 1994 | :: | 1 |
| Schultz,Milpetz,Bork,Ponting, SMART, a simple modular architecture research tool: identification of signalling domains, 1998 | :: | 1 |
| Schuster,Fontana,Stadler,Hofacker, From sequences to shapes and back: a case study in RNA secondary structures, 1994 | :: | 1 |
| Searls, The linguistics of DNA, 1992 | :: | 1 |
| Searls, String Variable Grammar: a logic grammar formalism for DNA sequences, 1995 | :: | 1 |
| Searls, Linguistic approaches to biological sequences, 1997 | :: | 1 |
| Searls, From Jabberwocky to genome: Lewis Carroll and computational biology, 2001 | :: | 1 |
| Searls, Mining the bibliome, 2001 | :: | 1 |
| Searls, Reading the book of life, 2001 | :: | 1 |
| Shieber, Evidence against the context-freeness of natural language, 1985 | :: | 3 |
| Smadja, Retrieving collocations from text: XTRACT, 1993 | :: | 1 |
| Snel,Bork,Huynen, Genome phylogeny based on gene content, 1999 | :: | 1 |
| Spenser,Howe, Estimating distances between manuscripts based on copying errors, 2001 | :: | 1 |
| Swadesh, Lexicostatistical dating of prehistoric ethnic contacts: with special reference to North American Indians and Eskimos, 1952 | :: | 1 |
| Tanselle, Literature and Artifacts, 1998 | :: | 1 |
| Tatusov,Galperin,Natale,Koonin, The COG database: a tool for genome-scale analysis of protein functions and evolution, 2000 | :: | 1 |
| Tekaia,Lazcano,Dujon, The genomic tree as revealed from whole proteome comparisons, 1999 | :: | 1 |
| Trifonov, Interfering contexts of regulatory sequence elements, 1996 | :: | 1 |
| Uemura,Hasegawa,Kobayashi,Yokomori, Tree-adjoining grammars for RNA structure prediction, 1999 | :: | 1 |
| Warnow, Mathematical approaches to comparative linguistics, 1997 | :: | 4 |
| Westhead,Slidel,Flores,Thornton, Protein structural topology: automated analysis and diagrammatic representation, 1999 | :: | 1 |
| White, A quality control algorithm for DNA sequencing projects, 1993 | :: | 1 |
| Yandell,Majoros, Genomics and natural language processing, 2002 | :: | 1 |
| Zipf, Human Behavior and the Principle of Least Effort, 1949 | :: | 11 |