FlyBase : Database of Drosophila Genes and Genomes
Drosophila, FlyBase, biocuration, ontologies
FlyBase is the premier database of genetic, genomic and functional data for Drosophila species, with a focus on the model organism Drosophila melanogaster. FlyBase integrates information from the research literature and various external databases and projects, and makes these data available to the research community via our public website.
The FlyBase project is a collaboration between four groups based at Harvard University, USA (PI Bill Gelbart); Indiana University, USA (PIs Thom Kaufman and Kathy Matthews); The University of New Mexico, USA (PI Maggie Werner-Washburne); and The University of Cambridge, UK (PI Nick Brown).
The FlyBase-Cambridge division comprises a team of literature curators, ontologists and a software developer. Our primary task is to identify genetic data from scientific articles concerning Drosophila and to record these data in a systematic manner. Relevant data include all genetic entities (genes, alleles, transgenic constructs, etc.) and certain types of associated information such as phenotypes, genetic interactions and gene product functions. Curated data are subsequently integrated into the FlyBase database and website.
Other responsibilities of FlyBase-Cambridge include the production and maintenance of the FlyBase bibliography, and the development and maintenance of controlled vocabularies (ontologies) used in literature curation.
FlyBase-Cambridge staff give regular presentations and workshops within the University of Cambridge and around the UK, and we also attend the Annual Drosophila Research Conference in the USA and the European Drosophila Research Conference. We are also affiliated with the International Society for Biocuration.
Enquiries regarding FlyBase-Cambridge should be sent to the Group Manager, Steven Marygold. General enquires and feedback about FlyBase should be communicated using the 'Contact FlyBase' form.
5 key publications
- Vasilevsky N A, Brush M H, Paddock H, Ponting L, Tripathy S J, Larocca G M, Haendel M A (2013) On the reproducibility of science: unique identification of research resources in the biomedical literature. PeerJ. 1:e148. [PMID: 24032093]
- Marygold S J, Leyland P C, Seal R L, Goodman J L, Thurmond J, Strelets V B, Wilson R J; FlyBase consortium (2013) FlyBase: improvements to the bibliography. Nucleic Acids Res. 41(D1):D751-D757 [PMID: 23125371]
- McQuilton P; FlyBase Consortium (2012) Opportunities for text mining in the FlyBase genetic literature curation workflow. Database 2012:bas039 [PMID: 23160412]
- Bunt S M, Grumbling G B, Field H I, Marygold S J, Brown N H, Millburn G H; the FlyBase Consortium (2012) Directly e-mailing authors of newly published papers encourages community curation. Database. 2012(0):bas024 [PMID: 22554788]
- Fang R, Schindelman G, Van Auken K, Fernandes J, Chen W, Wang X, Davis P, Tuli M A, Marygold S, Millburn G, Matthews B, Zhang H, Brown N, Gelbart W M, Sternberg P W (2012) Automatic categorization of diverse experimental information in the bioscience literature. BMC Bioinformatics. 13(1):16 [PMID: 22280404]