Presentation on theme: "Tissue Microarray Data Standards"— Presentation transcript:
1 Tissue Microarray Data Standards Jules J. Berman, Ph.D., M.D. Program Director, Pathology Informatics Cancer Diagnosis Program National Cancer Institute *All opinions herein are Dr. Berman’s and do not represent those of any federal agency.
2 Letter Format from U of Colorado at Boulder Style Guide LETTERHEAD HEREJuly 14, 2000Joe Doe1234 Main StreetHometown, CO 80000Dear Mr. Doe:When typing formal letters on CU letterhead, leave at least one inch of white space below the letterhead and use one-inchmargins on the left, right, and bottom sides. Align all type flush left with no paragraph indentations. Double space betweenparagraphs.As for other spacing, it's traditional to triple space between the date and the address block, double space between the addressblock and the salutation, double space after the salutation, and leave four lines for the signature. However, it's fine to onlydouble space after the date, especially if doing do will save the letter from running on to a second page.This block style is the current business letter writing style standard and has been approved by efficiency experts because itsaves keystrokes. To readers, this flush left style looks more contemporary than the indented paragraph style. Finally, adoptingthis style will help convey a consistent image for all CU correspondence.Sincerely,Ann WrightStyle Guide Editor
3 Valid XML document for letter <?xml version="1.0"?><document name="letter"><style name="block"></style><creation_date>July 14, 2000</creation_date><addressee>Joe Doe</addressee><addressee_address>1234 Main Street Hometown, CO 80000</addressee_address><salutation>Dear Mr. Doe:</salutation><letter_body>When typing formal letters on CU letterhead, leave at leastone inch of white space below the letterhead and use one-inchmargins on the left, right, and bottom sides. Align all typeflush left with no paragraph indentations. Double space betweenparagraphs.</letter_body><closing>Sincerely,</closing><sender_name>Ann Wright</sender_name><sender_title>Style Guide Editor</sender_title></document>
4 Other letter data elements may include: <cc><sender address><date_received><time_received><time_sent><typist><hidden comment> (<!--)
5 There will be two dominant uses of XML parsers: 1. To parse on any valid XML document in order to find the Common Data Elements2. To process specific XML document types:Example:Gene array XML documentsTissue Microarray XML documentsSurgical Pathology Report XML documents
6 #!/usr/bin/perl -wuse XML::Parser;$p1 = new XML::Parser(Style => 'Debug');$p1->parsefile('xmlshort.xml');exit;
7 C:\newinfo>perl xmlshort.pl \\ (name letter)document || #10;document \\ (name block)document //document \\ ()document creation_date || July 14, 2000document addressee || Joe Doedocument addressee_address || 1234 Main Street Hometown, CO 80000document salutation || Dear Mr. Doe:document letter_body || #10;document letter_body || When typing formal letters on CU letterhead, leave at least one inch of white space below the letterhead and use one-inch margins on theleft, right, and bottom sides. Align all type flush left with no paragraph indentations. Double space between paragraphs.document closing || Sincerely,document sender_name || Ann Wrightdocument sender_title || Style Guide Editor//C:\newinfo>
8 CDE for Date of Birth |birthdate| September 15, 1970 |birthday| September 15, 1970|D.O.B.| September 15, 1970|d.o.b.| September 15, 1970|date of birth| September 15, 1970|date-of-birth| September 15, 1970|date_of_birth| September 15, 1970|dob| September 15, 1970|DOB| September 15, 1970
9 Representation of CDE |date_of_birth| September 15, 1970 |date_of_birth| some delta time
10 CDEs become XML tags<date_of_birth>10/17/00</date_of_birth>
11 CDEs become self-attributing XML tags <date_of_birth defn=“http://www.cde.org”>10/17/00 </date_of_birth>
12 The difference between Uniform Standards and Uniform Requirements 1. Standards are about giving people an opportunity to cooperate and share. All good standards are publicly available and free.2. Requirements are about forcing people to act a certain way.
13 Minimum information about a [GENE] microarray experiment - MIAME Draft March 21, 2001, based on November 17, 2000The goal of the MIAME is to specify the minimum information that must be reported about a microarray (or any DNA array) based gene expression monitoring experiment in order to ensure the interpretability, as well as potential verification of the results by third parties. The background aim is to facilitate the establishing public repositories and data exchange format for microarray based gene expression data. The MGED group will be encouraging the scientific journals and funding agencies to adopt policies requiring data submissions to repositories, once MIAME compliant repositories are established.
14 a.array related information unique ID as used in part 1 (for commercial or standard arrays aunique ID given by the provider may be used)array design name (e.g., "Stanford Human 10K set")platform type: insitu synthesized or spottedarray provider (source)surface type: glass, membrane, othersurface type namearray support (e.g. slide) dimensionsnumber of elements on the arraya reference system allowing to locate each element (spot) on thearray (in the simplest case the number of columns and rows issufficient)production protocol (obligatory if applicable)optional "qualifier, value, source" list (see Introduction)