Professional Documents
Culture Documents
Wilson, and the Association for Molecular Pathology Training and Education Committee
Author information Article notes Copyright and License information This article has been corrected. See J Mol Diagn. 2010 September 1; 11(5): 494. This article has been cited by other articles in PMC.
ABSTRACT
The complexity and inherent variation of the human genome sequence have placed unprecedented demands on bioinformatics resources to assure organized data management.1 Genomic data are continuously translated into clinical molecular tests, and laboratory reports are generated for patient management and clinical and epidemiological studies. The consistent use of uniform nomenclature in the management of DNA sequence data is especially critical for concise communication of diagnostic testing and genetic risk assessment. Just as standards were established early in the Human Genome Project for uniform documentation and collation of sequence data, conventions for standardized nomenclature of variant sequencesmutations and polymorphismshave been developed and promulgated.2,3,4,5 Although in this article we use the term mutation to imply a deleterious genetic sequence variation, our discussion here is relevant to all small genetic sequence variations, whether neutral or deleterious. Despite the nominal acceptance of these standards, clinical mutation testing and screening for major genetic disorders still suffer from the use of nonstandard and variable mutation nomenclature. In fact, colloquial designations for mutations of clinical importance are used so broadly that many geneticists and molecular diagnosticians are probably unaware that they are nonstandard. This may cause confusion when cross-referencing between the original literature and modern databases. In an effort to clarify the nomenclature recommendations of the Human Genome Variation Society (HGVS), we first briefly illustrate how to name a particular sequence variant (either novel or known) using standard nomenclature. Recommendations for methods of interpreting sequence variants, whether deleterious or neutral, have been reviewed elsewhere.6 We next raise issues of standard and nonstandard nomenclature in a limited number of examples of genes that have been commonly used for molecular diagnostics. It is noteworthy that nomenclature problems exist not only for germline mutations and polymorphisms but also for somatic alterations in genes associated with cancer. Go to:
and noncoding untranslated regions [5 untranslated region (UTR) and 3-UTR]; splice variants may lack one or more of the coding exons. Nucleotide numbering is in relation to the translation initiation codon, starting with number 1 at the A of the ATG. Standard mutation nomenclature based on coding DNA reference sequences and protein-level amino acid sequences requires prefixes c. and p., respectively, as in Figure 2. Standard nomenclature based on genomic DNA reference sequences and RNA reference sequences is not shown. Genomic DNA reference sequence simply indicates any human DNA sequence in the database that is not based on a cDNA sequence. Standard mutation nomenclature based on a genomic DNA reference sequence requires a prefix g. and numbering starts with number 1 for the first nucleotide in the file.
Figure 1 Example of nucleotide numbering based on a coding DNA sequence. Exonic sequences are numbered sequentially from the initiation codon to the stop codon. Untranslated sequences in the 5- and 3-UTRs, as well as in intronic sequences, are ...
Figure 2 Example of standard mutation nomenclature based on a coding DNA sequence. Note that the amino acid change for c.1A>T is described as p.0? because amino acid changes secondary to codon 1 mutations are frequently ... Figure 3 illustrates the process for finding a reference sequence that describes a novel mutation or for searching for the sequence surrounding a particular mutation. As shown in Figure 3, it is essential to find and use the gene symbol approved by the Human Genome Organisation (HUGO) Gene Nomenclature Committee (HGNC; http://www.gene.ucl.ac.uk/nomenclature/index.html).7,8 A major problem has been the highly variable use of gene nomenclature in the literature, producing multiple symbols and names for one and the same gene9,10 or one gene/protein symbol that stands for completely different genes or proteins.11,12,13,14 Up to one third of human genes may have been affected by the homonym problem,15 mainly because of the nonuse of HGNC-approved official gene symbols.
Figure 3 How to find a DNA reference sequence and HGNC-approved gene symbol. BLAST, Basic Local Alignment Search Tool; HUGO, Human Genome Organisation; NCBI, National Center for Biotechnology Information.
In addition to the use of the HGNC-approved gene symbol, one needs to find the most appropriate reference sequence for a novel mutation. The most appropriate reference sequence may be a coding DNA sequence based on full-length mRNA or a genomic DNA reference sequence. Even if one finds the mutation based on a reference sequence, it may not be the most updated or the most appropriate reference sequence. For example, the reference sequence that has been used to identify a novel exonic mutation might comprise the sequence of only one exon of the gene. In this case, it is appropriate to search for a coding DNA reference sequence based on full-length cDNA. Go to:
Table 1 Standard and Colloquial Nomenclature for CFTR Mutations and Variants
Go to:
of CFTR using both genomic reference sequences and the coding DNA reference sequence (NM_000492.3) in Table 1. Go to:
demonstrated, because those amino acid changes likely indicate pathogenic mechanisms of the mutations and provide clinically important information. Go to:
diagnostic reagent manufacturers and has set a standard in these fields. Consequently, the fact that not all journals or medical societies insist on consistent use of HGVSrecommended nomenclature may cause additional confusion; for example, a novel mutation inCFTR might be described in one report by standard nomenclature and in another report by the traditional description. Therefore, the use of HGVS standard nomenclature can be accompanied by the colloquial term in parentheses. To avoid confusion in the future, the HGVS standard nomenclature should be used for newly discovered CFTR mutations as well as those in other genes of clinical or research interest. Many reports describe new mutations only in terms of the amino acid change, despite the degeneracy of the amino acid code. It is conceivable that future therapies targeted to specific alterations in DNA would be different for different mutations that cause the same amino acid change. Mutation nomenclature should be unequivocal and should be described at the DNA level as discussed in the previous section. Go to:
SUMMARY
We have raised issues of standard and nonstandard nomenclature of gene variants and mutations, using a limited number of commonly tested genes as examples, particularly CFTR. Similar issues and problems exist for many other genes. The confusion surrounding nomenclature has potential far-reaching impact, and care needs to be taken to communicate accurately. As we move forward in defining nomenclature rules, it is important for us (laboratory scientists) to educate ourselves and other health care professionals so that standard nomenclature of gene variants and mutations is used uniformly across a wide variety of medical specialties. Go to:
NOTE:
Standardized report format, including standard nomenclature for gene sequence variants, is important in laboratory medicine and clinical practice. College of American Pathologists Molecular Pathology Committee has been compiling comprehensive recommendations for molecular diagnostics laboratories (ML Gulley, RM Braziel, KC Halling, ED Hsi, JA Kant, MN Nikiforova, JA Nowak, S Ogino, A Oliveira, HF Polesky, L Silverman, RR Tubbs, VM Van Deerlin, GV Vance, J Versalovic, for the College of American Pathologists Molecular Pathology Resource Committee; Clinical Laboratory Reports in Molecular Pathology, revised version submited to Arch Pathol Lab Med). Go to:
ACKNOWLEDGMENTS
We thank Wayne Grody, S. Terence Dunn, Marsha Speevak, Daniel Farkas, Victoria Pratt, Jeffrey Kant, and the Council, Publications Committee and Genetics Subdivision of the
Association for Molecular Pathology for helpful discussion and critical reading of the manuscript. Go to:
FOOTNOTES
The 2005 Association for Molecular Pathology Training and Education Committee consists of Deborah Payne (Chair), Mary C. Lowery Nordberg, Jerald Z. Gong, Amy E. Krafft, Shuji Ogino, Timothy S. Uphoff, Peter Donahue, Jennifer Hunt, and Gladys Garrison. Standard of practice is not being defined by this article, and there may be alternatives
Nomenclatura mutaiilor
Nomenclatura mutaiilor a fost stabilit de ctre Genome Database Nomenclature Commitee i poate fi accesat la adresahttp://www.gene.ucl.ac.uk/nomenclature/ Mutaiile pot fi descrise n dou modaliti: prin efectul lor sau prin descrierea secvenei modificrii. Nomenclatura pe baza efectului alelei mutante este folosit mai ales la organismele de laborator. O alel nul sau amorf nu produce nici-un efect; o alel hipomorf produce o cantitate sau o activitate redus a produsului genei; o alel hipermorf determin o cantitate sau activitate crescut de produs; o alel neomorf este o alel cu activitate sau produs nou; o alel antimorf este o alel a crei activitate sau produs antagonizeaz activitatea produsului normal. Nomenclatura pentru descrierea mutaiei este ampl (vezi Antonarakis et al.,1998). Vom prezenta aici cteva repere. Substituiile de amino acizi. Reziduurile de aminoacizi (desemnai prin codul de trei litere sau cel cu o singur liter[1]; vezi tabelul 4.2) sunt numrate ncepnd cu metionina (corespunztoare codonului iniiator) considerat numrul 1. Astfel mutaia cu sens greit n codonul 6 al genei pentru -globin i substituia acidului glutamic cu valina ce produce sicklemia se scrie: Glu6Val sau E6V. O mutaie non sens ce nlocuiete glutamina din poziia 39 a -globinei cu un codon stop, producnd o form trunchiat a globinei i determinnd o-talasemia, se scrie Gln39X sau Q39X Substituiile nucleotidice. n descriere se va scrie mai nti baza original, numrul nucleotidului cu aceast baz, simbolul mai mare (>) i apoi noul nucleotid din poziia respectiv. Numrul nucleotidului se calculeaz de la primul nucleotid A din codonul iniiator (start) ATG care este +1 (nucleotidul precedent este -1; nu exist nucleotid 0). Dac este necesar se va folosi g. i c. pentru a indica o secven de ADN genomic sau ADN complementar. De exemplu, o mutaie cu sens greit se poate scrie G1444>A (uneori se mai scrie 1444G>A) Pentru modificri n introni, se va specifica numrul intronului (desemnat cu iniialele IVS de la intervening sequence), numrul nucleotidului (+1, +2) ncepnd cu situl donor 5 sau invers (-1, -2) de la situsul 3acceptor. D ex., IVS33+2T>A reprezint o mutaie n intronul 33 n situsul donor, n care T este nlocuit cu A.
Deleiile i inseriile. Deleiile mici sunt indicate prin termenul del scris dup numerele nucleotidelor deletate; de ex., 1524-1527del sau 1524-1527delAGTA deleia a 4 nucleotide (AGTA). Pentru a preciza modificarea aminoacizilor atunci se scrie mai nti simbolul lor. De ex., F508del semnific deleia fenilalaninei n poziia 508 (aceast modificare frecvent n mucoviscidoz s-a scris iniial F508, semnul delta semnificnd deleie). Inseriile mici sunt desemnate similar cu termenul ins dup cele dou nucleotide ntre care se produce inseria, urmat de nucleotidele inserate. De ex., 1277-1278insTATC denot inseria a patru baze ntre nucleotidele 1277 i 1278, o cauz comun a bolii Tay-Sachs
[1] Codul cu o liter: A-alanina, C-cisteina, D-acidul aspartic, E.acidul glutamic, F-fenilalanina, G-lizina, Hhistidina, I-isoleucina, K-lizina, M-metionina, N-asparagina, P-prolina, Q-glutamina, R-arginina, S-serina, Ttreonina, V-valina, W-triptofanul, Y-tirozina; X-semnific un codon stop.
[2] n unele boli, din motive neclare, aceast probabilitate crete considerabil; de ex. n
acondroplazie mutaia CpG din gena pentru FGFR3 este de cel puin 100 de ori mai frecvent dect cea interesnd alte dinucleotide.
[3] Efectul depinde de poziia codonului stop, la nceputul genei mai grav dect n regiunea terminal
a genei.
[4] S-a stabilit prin convenie (1993) c nomenclatura tripletelor repetitive s se fac n or dinea
produc i expansiuni i regresii (contracii) dar, pentru motive neclare, predomin expansiunile.
[6] Imediat sub prag repetiia este nepatogen, imediat peste prag ea produce boal. [7] Unii autori consider c tractul poliglutaminic din produsul genei determin agregarea proteinei n
anumite celule, pe care le omoar; alii cred c regiunea poliglutaminic ar interfera activitatea enzimei glicolitice gliceraldehid-3-fosfat dehidrogenaza determinnd o reducere masiv a produciei de energie din glucoz; stresul metabolic este sever n neuroni, care obin energie numai din glucoz.
[8] Existena dezechilibrului de nlnuire exclude posibilitatea ca expansiunile trinucleotidice s
surse medicale) iar doza de dublare (valoarea de iradiere dincolo de care proporia mutaiilor se dubleaz) este de 30-80 remi per generaie.
[12] Metilarea este un proces postreplicativ i catena nou sintetizat nu este imediat metilat.
[13] Cea mai abundent dintre ADN-glicozilaze este uracil ADN-glicozilaza (UNG) care produce ndeprtarea reziduurilor de uracil ce rezult prin dezaminarea citozinei.
[14] Complexul BASC conine proteinele BRCA1, ATM, complexul NBS1-MRE11-RAD50, proteinele MSH2/MSH6 i MLH1 care intervin i n calea MMR i helicaza BLM . [15] ISCN-(International System of Chromosomal Nomenclature) Sistemul Internaional de
Nomenclatur a Cromosomilor
[16] Unii autori mpart anomaliile de numr n poliploidii, aneuploidii i mixoploidii (mozaicuri i
himere).
[17] Celulele normale sunt disomice pentru fiecare autosom i cromosomul X la femeie. [18] Efectul fenotipic al unui astfel de mozaic poate fi mai grav n condiiile n care prin pierderea
cromosomului suplimentar rezult o disomie uniparental, iar cromsomul resepctiv are loci amprentai genetic (vezi capitolul 5.C i 5.D)
[20] Prezena ambelor centromere rmase active determin anomalii de ataare la fibrele fusului de
diviziune i dificulti de segregare n cursul diviziunii, ceea ce conduce la dispariia frecvent a lor.
[21] S-a discutat i reducerea competenei imunologice dintre mam i ft odat cu cretere vrstei
materne, fapt ca ar permite supravieuira embrionilor trisomici; lipsesc probe pentru aceast ipotez.
[22] Mecanismul afectrii hepatice, diferit de afectarea pulmonar, implic agregarea i stocarea
[A1]De adaugat o completare privind clusterul serpinelor si, la sfrit agregarea hepatic precum si mutaia ce determin alterarea coagulrii; vezi Wilson, paginile 173-174