Chado: the GMOD Database Schema
Chado is a relational database schema that underlies many GMOD installations. It is capable of representing many of the general classes of data frequently encountered in modern biology such as sequence, sequence comparisons, phenotypes, genotypes, ontologies, publications, and phylogeny. It has been designed to handle complex representations of biological knowledge and is the most sophisticated relational schemas currently available in molecular biology.
GMOD
The Generic Model Organism Database project or GMOD is a collection of open source software tools for managing, visualising, storing, and disseminating genetic and genomic data.
GMOD includes software such as:
- JBrowse: Super-fast genome annotation viewer
- Galaxy Project: Data analysis and integration
- Tripal: Chado web interface
- MAKER: Genome annotation pipeline
- CMap: Comparative map viewer
- Pathway Tools: Metabolic, regulatory pathways
- Canto: Literature annotation tool
All these tools use Chado as an underlying database schema.
References
- Github repo Chado: GMOD/Chado
- Documentation: GMOD
- Introduction to Chado: wiki Chado
Chado use case: storing genomic, genetic and breeding data of Rosaceae and Gossypium crops in Chado
Sook Jung, Taein Lee, Stephen Ficklin, Jing Yu, Chun-Huai Cheng, Dorrie Main
The Journal of Biological Databases and Curation, March 2016. DOI: 10.1093/database/baw010
A Chado case study: an ontology-based modular schema for representing genome-associated biological information
Christopher Mungall and David Emmert
Bioinformatics, July 2007. DOI: 10.1093/bioinformatics/btm189
The FlyBase database of the Drosophila genome projects and community literature
The FlyBase Consortium
Nucleic Acids Research, January 2003. DOI: 10.1093/nar/gkg094
Relevant Tags
About the Author
Latest Articles
-
Chado: the GMOD Database Schema
Chado is a relational database schema that underlies many GMOD installations. It is capable of representing many of the general classes of data frequently encountered in modern biology such as sequence, sequence comparisons, phenotypes, genotypes, ontologies, publications, and phylogeny. It has been designed to handle complex representations of biological knowledge and is the most sophisticated relational schemas currently available in molecular biology.JAN 2025 · PIERRE-EDOUARD GUERIN -
Error Messages with a CLI
I am an anxious person. So error messages always makes my heart beat faster. Hopefully, following the Pareto Principle, 80% of error messages are mild while 20% are the really tough one. The point is to solve the first kind as quickly as possible and effortless. To do so, allow the user to solve the issue by himself with clear messages and hints (in the case of errors related to input files or parameters). Clear presentation of the context and precise localization of the error in the code will save a lot of useless and tedious work to the developer. The time spared on the easy errors just by having better messages, then can be reallocated to the second kind of errors, the troublemakers.NOV 2024 · PIERRE-EDOUARD GUERIN -
Generative AI: Integrate openAI API with Python
I was fortunate to follow the course of Sven Warris about software tools to integrate genAI into your own work and applications. The course is aimed at data scientists and bioinformaticians.MAY 2024 · PIERRE-EDOUARD GUERIN