Foundry MIGS table

From MIBBI

Jump to: navigation, search
Index Item name Definition Examples
1. MIGS Investigation
1.1. Investigation
1.1.1. Project name or identifier
1.2. Submit to trace archives and INSDC Depending on the study (large-scale e.g. done with next generation sequencing technology, or small-scale) sequences have to be submitted to SRA (Short Read Archives), ENA (European Nucleotide Archive), DRA (DDBJ Read Archive) or via the classical Webin/Sequin systems to Genbank, EMBL and DDBJ
1.3. Investigation type Choose from: eukaryote, bacteria_archaea, plasmid, virus, organelle, metagenome, miens-survey, miens-culture
2. MIGS Organism
2.1. Subspecific genetic lineage Information about the genetic distinctness of this lineage; i.e., biovar, serovar, serotype, biovar, or any relevant genetic typing schemes like Group I plasmid. Can also be alternative taxonomic information. Ideally, use subspecific genetic lineage information from CABRI (www.cabri.org).
2.2. Known pathogenicity Those organisms to which the entity is pathogenic. human, animal, plant, fungi, bacteria, etc.
2.3. Observed biotic relationship Free-living or in a host (with named relationship to that host) pathogen, commensal, symbiont, opportunist
2.4. Specific host Where the organism was isolated from a living or dead host, please provide a taxid and report whether it is a lab animal.
2.5. Host specificity or range List of hosts potentially usable by the organism.
2.6. Host disease status Health or disease status of the specific host at the time of collection
2.7. Trophic level The position held by the organism in it's food chain. autotroph, carboxydotroph, chemoautotroph, chemoheterotroph, chemolithoautotroph, chemolithotroph, chemoorganoheterotroph, chemoorganotroph, chemosynthetic, chemotroph, copiotroph, diazotroph, facultative, autotroph, heterotroph, lithoautotroph, lithoheterotroph, lithotroph, methanotroph, methylotroph, mixotroph, obligate, chemoautolithotroph, oligotroph, organoheterotroph, organotroph, photoautotroph, photoheterotroph, photolithoautotroph, photolithotroph, photosynthetic, phototroph
2.8. Propagation This will involve different vocabularies for different taxa (virus) lytic, lysogenic, temperate, obligately lytic; (plasmid) incompatibility group; (eukaryote) asexual, sexual.
2.9. Relationship to oxygen Whether the organism is aerobic or anaerobic aerobe, anaerobe
2.10. Encoded traits Should include key traits like antibiotic resistance or xenobiotic degradation phenotypes for plasmids, converting genes for phage.
3. Organismal nucleic acid component
3.1. Ploidy Ploidy level of genome for given organism or cell type allopolyploid, haploid, tetraploid
3.2. Number of replicons Chromosome count for eukaryotes (haploid), bacteria; segments for viruses
3.3. Extrachromosomal elements
3.4. Estimated size
4. Environment
4.1. Biome Biomes are defined based on factors such as plant structures, leaf types, plant spacing, and other factors like climate. desert, taiga, deciduous woodland, coral reef
4.2. Feature The relevant part of the local environment harbour, cliff, lake
4.3. Matter That which was displaced by the sample, prior to the sampling event. Environmental matter terms are generally mass nouns. air, soil, water
5. Geographic location
5.1. Country The geographical origin of the sample by country. Ideally, country names should be chosen from the INSDC list (http://www.insdc.org/page.php?page=country), or the GAZ ontology (http://bioportal.bioontology.org/visualize/40651)
5.2. Sample site morphology State whether a point, transect or area.
5.3. Latitude and longitude Provide as appropriate, given sample site morphology. Values should be reported in decimal degrees under WGS84.
5.4. Altitude or elevation
5.5. Depth Provide where appropriate. Distance from surface for samples collected underground or underwater.
6. Sampling event
6.1. Date
6.1.1. Date given as YYYY-MM-DD
6.2. Time
6.2.1. Time, given as HH:MM in 24-hour format
6.2.2. State whether UTC or local (give time difference from UTC for local)
6.3. Device or method The apparatus or protocol employed to collect the sample (short phrase only). biopsy, niskin bottle, push core
7. Sample description
7.1. Amount or size of sample collected The gross sample described as appropriate (mass, volume, number, etc.)
8. Sample processing
8.1. Description of process Processing applied to the sample during or after retrieving the sample its environment. Ideally, use an ontology such as OBI (see http://bioportal.bioontology.org/visualize/40832). filtering of sea water, storage in ethanol
9. Sample provenance
9.1. isolation and growth conditions Publication reference in the form of a PubMed ID (pmid), digital object identifier (DOI) or URL describing the isolation and growth conditions for the organism/material.
9.2. Reference for biomaterial The primary publication, if isolated before genome publication; otherwise, the primary genome report.
9.3. Source material identifiers The name of the culture collection, the holder of the voucher or an institution. Could enumerate a list of common resources, as done by the American Type Culture Collection (ATCC), German Collection of Microorganisms and Cell Cultures (DSMZ), etc. Can state 'not deposited'. Ideally use an ontology such as OBI where appropriate; for a browser of OBI terms please see http://bioportal.bioontology.org/visualize/40547
10. Study design
10.1. Design description Provide a short phrase at most. Ideally, use an ontology such as OBI (see http://bioportal.bioontology.org/visualize/40832).
10.2. Factors Use only where appropriate for a given study design. A factor is a parameter that is deliberately varied to study its influence on the outcome, in contrast to confounders, or covariates such as temperature, pH or humidity. Ideally use an ontology such as OBI (see http://bioportal.bioontology.org/visualize/40832).
11. Nucleic acid sequencing
11.1. Nucleic acid preparation
11.1.1. Extraction method Link to a literature reference, electronic resource or a standard operating procedure (SOP) through a DOI, PMID or URL.
11.1.2. Amplification method Link to a literature reference, electronic resource or a standard operating procedure (SOP) through a DOI, PMID or URL.
11.1.3. Cleanup Link to a literature reference, electronic resource or a standard operating procedure (SOP) through a DOI, PMID or URL.
11.2. Library construction
11.2.1. Library size The total number of clones in the library prepared for the project.
11.2.2. Library reads sequenced The total number of clones sequenced.
11.2.3. Library vector The cloning vector types used to construct the library.
11.2.4. Library screening strategy Specific enrichment or screening methods applied before and/or after creating clone libraries. enriched, screened, normalized
11.3. PCR amplification
11.3.1. Target gene Targeted gene or locus name for marker gene studies.
11.3.2. Target subfragment Name of subfragment of a gene or locus. It is important, for example, to identify special regions on marker genes like V6 on 16S rRNA.
11.3.3. PCR primers PCR primers, barcodes and adaptors that were used to amplify the sequence of the targeted gene or locus. If multiple forward or reverse primers are present in a single PCR reaction, describe all the primers used. Primer sequence should be reported in uppercase, except barcode or adaptor sequence, which should be in lowercase.
11.3.4. Multiplex identifiers Molecular barcodes, called Multiplex Identifiers (MIDs), that are used to specifically tag samples in a sequencing run. Their sequence should be reported in uppercase.
11.3.5. PCR conditions
11.3.5.1. Denaturation Provide time and temperature for this step.
11.3.5.2. Annealing Provide time and temperature for this step.
11.3.5.3. Elongation Provide time and temperature for this step.
11.3.5.4. Final elongation Provide time and temperature for this step.
11.4. Sequencing
11.4.1. Sequencing method The type of sequencing used. dideoxysequencing, pyrosequencing, polony
11.4.2. Sequence quality check Indicate if the sequence has been called automatically or has undergone a manual editing procedure (e.g., by inspecting the raw data or chromatograms). Applies only to sequences that are not submitted to SRA,ENA or DRA.
12. Nucleic acid sequencing data processing
12.1. Chimera check State whether the data have been checked for chimeric sequence, and how.
12.2. Assembly
12.2.1. Assembly method How was the assembly done (e.g. with a text based assembler like phrap or a flowgram assembler).
12.2.2. Estimated error rate The estimated error rate associated with the finished sequences (e.g., error rate of 1 in 1000 bp).
12.2.3. Method of calculation of error rate
12.3. Finishing strategy
12.3.1. Status Was the genome project intended to produce a complete or draft genome.
12.3.2. Coverage The fold coverage of the sequencing expressed as 2x, 3x, 18x etc.
12.3.3. Number of contigs Number of contigs produced for the genome.
12.4. Additional information
12.4.1. Relevant Standard Operating Procedures (SOPs)
12.4.2. Relevant electronic resources
Personal tools