Genomics is an interdisciplinary field of biology that concentrates on studying the structure, function, mapping, and editing of genomes. A genome is a complete set of DNA of an organism; it includes all of the genes.

The global genomics industry is expected to reach a smashing $27.6 billion by 2025. As its part, the genetic testing market will be worth over $22 billion by 2024.

Machine Learning in Genomics

Current and Future Applications

  • Gene editing
  • Genome sequencing
  • Clinical Workflows
  • Consumer genomics products
  • Disease prediction with genomics
  • Pharmacy genomics
  • Genetic Screening of new borns
  • Agriculture

Some Examples:


Clustering (Unsupervised learning)

  • Binning of Metagenomics Contigs
  • Identification of Plasmids and Chromosomes
  • Clustering reads into chromosomes for better assembly
  • Clustering of reads as a preprocessor for assembly of reads

Classification (Supervised learning)

  • Classifying shorter sequences into classes (phylum, genus, species, etc)
  • Phylogenetic inference of the sequences
  • Detection of Plasmids and Chromosomes
  • Finding coding regions
  • Chromosome prediction in human genomics