Rat Strain Ontology: structured controlled vocabulary designed to facilitate access to strain data at RGD
© Nigam et al.; licensee BioMed Central Ltd. 2013
Received: 31 May 2013
Accepted: 2 October 2013
Published: 22 November 2013
The Rat Genome Database (RGD) (http://rgd.mcw.edu/) is the premier site for comprehensive data on the different strains of the laboratory rat (Rattus norvegicus). The strain data are collected from various publications, direct submissions from individual researchers, and rat providers worldwide. Rat strain, substrain designation and nomenclature follow the Guidelines for Nomenclature of Mouse and Rat Strains, instituted by the International Committee on Standardized Genetic Nomenclature for Mice. While symbols and names aid in identifying strains correctly, the flat nature of this information prohibits easy search and retrieval, as well as other data mining functions. In order to improve these functionalities, particularly in ontology-based tools, the Rat Strain Ontology (RS) was developed.
The Rat Strain Ontology (RS) reflects the breeding history, parental background, and genetic manipulation of rat strains. This controlled vocabulary organizes strains by type: inbred, outbred, chromosome altered, congenic, mutant and so on. In addition, under the chromosome altered category, strains are organized by chromosome, and further by type of manipulations, such as mutant or congenic. This allows users to easily retrieve strains of interest with modifications in specific genomic regions. The ontology was developed using the Open Biological and Biomedical Ontology (OBO) file format, and is organized on the Directed Acyclic Graph (DAG) structure. Rat Strain Ontology IDs are included as part of the strain report (RS: ######).
As rat researchers are often unaware of the number of substrains or altered strains within a breeding line, this vocabulary now provides an easy way to retrieve all substrains and accompanying information. Its usefulness is particularly evident in tools such as the PhenoMiner at RGD, where users can now easily retrieve phenotype measurement data for related strains, strains with similar backgrounds or those with similar introgressed regions. This controlled vocabulary also allows better retrieval and filtering for QTLs and in genomic tools such as the GViewer.
The Rat Strain Ontology has been incorporated into the RGD Ontology Browser (http://rgd.mcw.edu/rgdweb/ontology/view.html?acc_id=RS:0000457#s) and is available through the National Center for Biomedical Ontology (http://bioportal.bioontology.org/ontologies/1150) or the RGD ftp site (ftp://rgd.mcw.edu/pub/ontology/rat_strain/).
The use of the rat for genetics studies in Europe can be traced back to the first half of the eighteenth century. Experimentally, Crampe et al. mated an albino female to a wild gray male in 1880. In the F1 offspring, three mutant genes were phenotypically observed: c (albino), a (non-agouti), and h (hooded). An early effort to track new strains and substrains, focused on when rats were transferred from one lab to another, resulting in new substrains that could affect animals both phenotypically and genotypically by the resultant changes in environment, dietary conditions or breeding strategy, as well as spontaneous genetic variations. The list of codes used to designate laboratories developing and maintaining rat colonies was first published in 1973. Efforts have also been made to capture differences in phenotype by integrating microsatellite markers into the genetic linkage maps and radiation hybrid maps. In order to make significant comparisons, determine relationships amongst strains, and select an appropriate model for biomedical studies, knowledge of the different rat strains and their breeding approaches is crucial. The first attempt to create a phylogenetic tree for 13 inbred strains (homozygous strain produced by brother-sister mating for at least 20+ generations) using genetic markers was done by Canzian et al.. This was followed by an enhanced version comprising 63 inbred strains and 214 substrains (genetically diverse inbred strains due to separation after 20 generations or separated due to any genetic difference), which was plotted using the percentage of genotypic differences. Thomas et al. presented phylogenetic relationships of 48 inbred strains, using the allele size of each strain at each microsatellite locus. A phylogenetic tree is also available at The National BioResource Project for the Rat in Japan, for 132 rat strains. Maximum parsimony analysis was used to calculate this tree (http://www.anim.med.kyoto-u.ac.jp/nbr/phylo.aspx). Leveraging these efforts to represent relationships amongst strains, the Rat Genome Database (RGD) has created standardized data formats for capturing strain background and breeding variations to represent all registered strains in a format that is hierarchical and computable.
RGD: a unique resource for registering rat strains
RGD is a universally accessible database that has an exclusive collection of rat genetic and genomic data curated from current research publications and direct data submission by rat researchers and rat providers. RGD currently has a catalogue of more than 2900 strains and substrains. RGD provides official assignment of rat strain symbols and names, and encourages researchers to submit strain data prior to publication through an online strain registration form (http://www.rgd.mcw.edu/tu/strains/#StrainRegistration), to ensure proper identification of their strains in their manuscripts. RGD validates the nomenclature of the submitted strains following the nomenclature guidelines laid out by the International Committee on Standardized Genetic Nomenclature for Mouse and the Rat Genome and Nomenclature Committee[9, 10]. The registered symbol and name of the strain along with a unique identifier, the RGD ID, are assigned and sent to submitters for reference in their publication. Strains from major rat resources such as the PhysGen Program for Genomic Applications (PhysGen,http://pga.mcw.edu/), Rat Resource and Research Center (RRRC,http://www.rrrc.us/), National BioResource Project (NBRP,http://www.anim.med.kyoto-u.ac.jp/nbr/Default.aspx) in Japan, and commercial rat providers such as Charles River (CRL,http://www.criver.com), Harlan Laboratories (http://www.harlan.com/), Sigma Advanced Genetic Engineering Labs (SAGE,http://www.sageresearchmodels.com) and Transposagen (http://www.transposagenbio.com) regularly submit strains to RGD for nomenclature and ID assignment and the creation of strain reports. These distributors mention the specific nomenclature on their websites, reminding researchers to use the correct nomenclature in their publications so that the information can be extracted and attached to the appropriate strain.
Results and discussion
Rat Strain Ontology
Organization of the ontology
Since techniques used to alter the chromosomes also play a crucial role in determining the strains, mutant strains are further divided. For example, mutants created by N-ethyl-N-nitrosourea (ENU)[11, 12], zinc-finger nucleases (ZFN) and transcription activator-like effector nuclease (TALEN) have separate nodes under the parental strains. The technique used to create the specific mutant is mentioned in parenthesis. The mutant strain in which a particular gene is mutated is placed under the relative chromosome number; for example, gene Tgfb1 (transforming growth factor, beta 1) maps to chromosome 1 in the rat, so the heterozygous mutant strain SS-Tgfb1em3Mcwi-/+ (RS:0003129) is placed under SS/JrHsdMcwi Heterozygous (ZFN) mutants (Figure 3B). This strain, having a mutation in chromosome 1, is also under SS/JrHsdMcwi (ZFN) mutants (chr 1) a sub-branch of "chromosome 1 mutant" under "chromosome altered". These substantial improvements have helped in making this vocabulary more robust and usable. In addition, users can now view all the homozygous, heterozygous and wild type strains under a single node.
Searching a strain in the ontology
The RGD Ontology Browser (http://rgd.mcw.edu/rgdweb/ontology/search.html) can easily be used to search a strain. When a desired strain symbol; for example, BN is searched in the browser the result page displays the number of terms that match the searched term in all the different ontologies. Clicking on the ontology name, "RS: Rat Strains" or on the number of terms displays all the strains that have the searched term "BN" in them in the Rat Strain Ontology. By first clicking on the tree sign adjacent to "BN mutants", then BN/NHsdMcwi mutants and finally BN/NHsdMcwi (ENU) mutants, a list of all the strains is generated by this technique using the parental BN strain. A click on any individual strain takes the user to the ontology report page, where the term is displayed in a "driller" format, with the searched term in the middle column along with siblings, parents in the left column and children in the right (Figure 4). Clicking the "View Strain Report" option takes the user to the respective RGD strain report page which displays the RS ID. This option is available for the curated strains and not for placeholders. If a strain is searched using the general keyword search in RGD, then the result page lists all the strains. A click on the strain symbol goes to the strain report page which has the RS ID of the strain mentioned as ontology ID. Using the same example, if BN/NHsdMcwi is searched in keyword search of RGD, then the report page shows all the strains that have the searched term BN/NHsdMcwi in them. A click on the strain symbol takes the user to the individual strain report page that has the RS: 0000145, this ID links to the RGD Ontology Browser showing the different substrains derived from it. The tree view of the Rat Strain Ontology at NCBO BioPortal[16, 17] also displays the hierarchy of the strains in a similar fashion.
As RGD has a vast collection of strains, selecting an appropriate strain from a list of over 2900 strains is not easy; it is here that the RS Ontology has a vital role. Users can scroll down the lists of different types of strains, or restrict their choices by "chromosome altered" and then by the different strain types. RGD’s robust usage of the RS Ontology for classifying strains makes it valuable for biologists using rats in their research, as it helps them in predicting the genomic contents a particular strain may have inherited from the parental strains. The RS Ontology is included in the RGD Ontology Browser, and the strain report pages, which have comprehensive descriptions of characteristics, origin, disease, phenotype and physiological information, behavior, drug reactions and reproductive notes make the RS Ontology annotations an important navigational tool. The rat strains that are curated from published articles are annotated by Mammalian Phenotype Ontology and MEDIC disease ontology which are used to conduct effective searches for strains based on disease and phenotypes. These annotations help in assigning strains to their respective disease portals.
RGD tools, such as PhenoMiner, display experimental records associated with phenotypic measurements of rat strains used in experiments. PhenoMiner has 18580 records with quantified phenotype values attached to consomic strains, 11524 values attached to inbred strains, 2870 to congenic strains, 2204 to all mutant (ZFN) strains and 2063 to all mutant (ENU) strains as of September 2013. These are entered into PhenoMiner by using the RS Ontology and three other ontologies, namely, clinical measurement (CMO), measurement method (MMO), and experimental condition ontologies (XCO). All rat QTLs are annotated to the RS Ontology to facilitate querying, retrieval and filtering of QTL data. All the congenic and consomic, strains that have an introgressed segment can be visualized in GViewer which can be accessed from the disease portals. QTL report pages have a link that leads to a narrower region which can be visualized by zooming in with GBrowse[25, 26] which displays the congenic and congenic substrains that have the desired region. As stated earlier, this information is captured in the chromosome altered node of the RS Ontology.
The Rat Strain Ontology is a new tool for annotating rat strains in a standardized manner which reflects the breeding history and genetic makeup of the strains to facilitate querying and retrieval, analysis and comparisons amongst strains. The latest version of the Rat Strain Ontology has been revised to classify all of the wild type, heterozygous, and homozygous strains, with the mutants further grouped under these strain subtypes. As the development process continues, new strains are continually being added and application of this vocabulary is continually expanding to allow investigators to integrate, consolidate and compare phenotypic measurement data from diverse sources.
Development of the ontology
This ontology is developed using OBO-Edit[27, 28], a Java based tool that uses a graph-oriented approach to display and edit the ontologies. RGD currently uses OBO-Edit2 for editing and adding new strain information. In some instances, strain symbols are used as placeholders for the graph nodes in order to maintain the relationship and hierarchical structure of the ontology. For example no details are known for the parent strain ACI (RS:0000012), whereas details are known about the substrains ACI/N, ACI/Kun, ACI/SegHsd etc. So, in these cases, the parent term ACI was used as a placeholder so that the children terms could be added. Textual synonyms including the RGD ID are entered via Term Editor.
This ontology is free and available to all users. This can be viewed in the RGD Ontology Browser athttp://rgd.mcw.edu/rgdweb/ontology/search.html, as well as at the National Center for Biomedical Ontology (NCBO) BioPortal websitehttp://bioportal.bioontology.org/ontologies/1150. Systematic versions can be downloaded from the RGD ftp siteftp://rgd.mcw.edu/pub/ontology/rat_strain/.
RGD is funded by the National Heart, Lung, and Blood Institute on behalf of the National Institutes of Health (HL64541). PhenoMiner is funded by the National Heart, Lung, and Blood Institute on behalf of the National Institutes of Health (HL094271).
- Castle WE: The domestication of the rat. Proc Natl Acad Sci U S A. 1947, 33: 109-117. 10.1073/pnas.33.5.109.View ArticleGoogle Scholar
- Listing F: Stardardized nomenclature for inbred strains of rats. Transplantation. 1973, 16: 221-245. 10.1097/00007890-197309000-00010.View ArticleGoogle Scholar
- Jacob HJ, Brown DM, Bunker RK, Daly MJ, Dzau VJ, Goodman A, Koike G, Kren V, Kurtz T, Lernmark A, et al: A genetic linkage map of the laboratory rat, rattus norvegicus. Nat Genet. 1995, 9: 63-69. 10.1038/ng0195-63.View ArticleGoogle Scholar
- Kwitek AE, Gullings-Handley J, Yu J, Carlos DC, Orlebeke K, Nie J, Eckert J, Lemke A, Andrae JW, Bromberg S, Pasko D, Chen D, Scheetz TE, Casavant TL, Soares MB, Sheffield VC, Tonellato PJ, Jacob HJ: High-density rat radiation hybrid maps containing over 24,000 sslps, genes, and ests provide a direct link to the rat genome sequence. Genome Res. 2004, 14: 750-757. 10.1101/gr.1968704.View ArticleGoogle Scholar
- Canzian F, Ushijima T, Pascale R, Sugimura T, Dragani TA, Nagao M: Construction of a phylogenetic tree for inbred strains of rat by arbitrarily primed polymerase chain reaction (ap-pcr). Mamm Genome. 1995, 6: 231-235. 10.1007/BF00352406.View ArticleGoogle Scholar
- Canzian F: Phylogenetics of the laboratory rat rattus norvegicus. Genome Res. 1997, 7: 262-267. 10.1101/gr.7.3.262.View ArticleGoogle Scholar
- Thomas MA, Chen CF, Jensen-Seaman MI, Tonellato PJ, Twigger SN: Phylogenetics of rat inbred strains. Mamm Genome. 2003, 14: 61-64. 10.1007/s00335-002-2204-5.View ArticleGoogle Scholar
- Serikawa T, Mashimo T, Takizawa A, Okajima R, Maedomari N, Kumafuji K, Tagami F, Neoda Y, Otsuki M, Nakanishi S, Yamasaki K, Voigt B, Kuramoto T: National bioresource project-rat and related activities. Exp Anim. 2009, 58: 333-341. 10.1538/expanim.58.333.View ArticleGoogle Scholar
- Carter TC, Dunn LC, Falconer DS, et al: Committee on standardized nomenclature for inbred strains of mice. Cancer Res. 1952, 12: 602-613.Google Scholar
- Staats J: Standardized nomenclature for inbred strains of mice: seventh listing for the international committee on standardized genetic nomenclature for mice. Cancer Res. 1980, 40: 2083-2128.Google Scholar
- van Boxtel R, Gould MN, Cuppen E, Smits BM: Enu mutagenesis to generate genetically modified rat models. Methods Mol Biol. 2010, 597: 151-167. 10.1007/978-1-60327-389-3_11.View ArticleGoogle Scholar
- Smits BM, Mudde JB, van de Belt J, Verheul M, Olivier J, Homberg J, Guryev V, Cools AR, Ellenbroek BA, Plasterk RH, Cuppen E: Generation of gene knockouts and mutant models in the laboratory rat by enu-driven target-selected mutagenesis. Pharmacogenet Genomics. 2006, 16: 159-169.Google Scholar
- Geurts AM, Cost GJ, Freyvert Y, Zeitler B, Miller JC, Choi VM, Jenkins SS, Wood A, Cui X, Meng X, Vincent A, Lam S, Michalkiewicz M, Schilling R, Foeckler J, Kalloway S, Weiler H, Menoret S, Anegon I, Davis GD, Zhang L, Rebar EJ, Gregory PD, Urnov FD, Jacob HJ, Buelow R: Knockout rats via embryo microinjection of zinc-finger nucleases. Science. 2009, 325: 433-10.1126/science.1172447.View ArticleGoogle Scholar
- Bard JB, Rhee SY: Ontologies in biology: design, applications and future challenges. Nat Rev Genet. 2004, 5: 213-222. 10.1038/nrg1295.View ArticleGoogle Scholar
- Laulederkind SJ, Tutaj M, Shimoyama M, Hayman GT, Lowry TF, Nigam R, Petri V, Smith JR, Wang SJ, de Pons J, Dwinell MR, Jacob HJ: Ontology searching and browsing at the rat genome database. Database (Oxford). 2012, 2012: bas016.View ArticleGoogle Scholar
- Musen MA, Noy NF, Shah NH, Whetzel PL, Chute CG, Story MA, Smith B, team N: The national center for biomedical ontology. J Am Med Inform Assoc. 2012, 19: 190-195. 10.1136/amiajnl-2011-000523.View ArticleGoogle Scholar
- Rubin DL, Lewis SE, Mungall CJ, Misra S, Westerfield M, Ashburner M, Sim I, Chute CG, Solbrig H, Storey MA, Smith B, Day-Richter J, Noy NF, Musen MA: National center for biomedical ontology: advancing biomedicine through structured organization of scientific knowledge. OMICS. 2006, 10: 185-198. 10.1089/omi.2006.10.185.View ArticleGoogle Scholar
- Smith CL, Goldsmith CA, Eppig JT: The mammalian phenotype ontology as a tool for annotating, analyzing and comparing phenotypic information. Genome Biol. 2005, 6: R7-10.1186/gb-2005-6-5-p7.View ArticleGoogle Scholar
- Davis AP, Wiegers TC, Rosenstein MC, Mattingly CJ: Medic: a practical disease vocabulary used at the comparative toxicogenomics database. Database (Oxford). 2012, 2012: bar065.Google Scholar
- Laulederkind SJ, Liu W, Smith JR, Hayman GT, Wang SJ, Nigam R, Petri V, Lowry TF, de Pons J, Dwinell MR, Shimoyama M: Phenominer: quantitative phenotype curation at the rat genome database. Database (Oxford). 2013, 2013: bat015.View ArticleGoogle Scholar
- Shimoyama M, Nigam R, McIntosh LS, Nagarajan R, Rice T, Rao DC, Dwinell MR: Three ontologies to define phenotype measurement data. Front Genet. 2012, 3: 87.View ArticleGoogle Scholar
- Nigam R, Laulederkind SJ, Hayman GT, Smith JR, Wang SJ, Lowry TF, Petri V, de Pons J, Tutaj M, Liu W, Jayaraman P, Munzenmaier DH, Worthey EA, Dwinell MR, Shimoyama M, Jacob HJ: Rat genome database: a unique resource for rat, human and mouse quantitative trait locus (qtl) data. Physiol Genomics. 2013, 18: 809-816.View ArticleGoogle Scholar
- Garrett MR, Rapp JP: Two closely linked interactive blood pressure qtl on rat chromosome 5 defined using congenic dahl rats. Physiol Genomics. 2002, 8: 81-86.View ArticleGoogle Scholar
- Cowley AW, Roman RJ, Jacob HJ: Application of chromosomal substitution techniques in gene-function discovery. J Physiol. 2004, 554: 46-55. 10.1113/jphysiol.2003.052613.View ArticleGoogle Scholar
- Laulederkind SJ, Hayman GT, Wang SJ, Lowry TF, Nigam R, Petri V, Smith JR, Dwinell MR, Jacob HJ, Shimoyama M: Exploring genetic, genomic, and phenotypic data at the rat genome database. Curr Protoc Bioinformatics. 2012, Chapter 1: Unit1 14.Google Scholar
- Shimoyama M, Smith JR, Hayman T, Laulederkind S, Lowry T, Nigam R, Petri V, Wang SJ, Dwinell M, Jacob H, Team RGD: Rgd: a comparative genomics platform. Hum Genomics. 2011, 5: 124-129.View ArticleGoogle Scholar
- Day-Richter J, Harris MA, Haendel M, Lewis S, Gene Ontology OBOEWG: Obo-edit–an ontology editor for biologists. Bioinformatics. 2007, 23: 2198-2200. 10.1093/bioinformatics/btm112.View ArticleGoogle Scholar
- Smith B, Ashburner M, Rosse C, Bard J, Bug W, Ceusters W, Goldberg LJ, Eilbeck K, Ireland A, Mungall CJ, Consortium OBI, Leontis N, Rocca-Serra P, Ruttenberg A, Sansone SA, Scheuermann RH, Shah N, Whetzel PL, Lewis S: The obo foundry: coordinated evolution of ontologies to support biomedical data integration. Nat Biotechnol. 2007, 25: 1251-1255. 10.1038/nbt1346.View ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.