Top Banner
September 26 & 27, 2011
81

September 26 & 27, 2011

Feb 26, 2016

Download

Documents

ehren

September 26 & 27, 2011. No Smoking!. (yes, they are serious about this). Facilities. Restrooms at the ends of the hall and upstairs. Lunch will be on your own. The Natcheteria is upstairs, and they have been warned that we are here. Details on box lunches will come. - PowerPoint PPT Presentation
Welcome message from author
This document is posted to help you gain knowledge. Please leave a comment to let me know what you think about it! Share it to your friends and learn new things together.
Transcript
Page 1: September 26 & 27, 2011

September 26 & 27, 2011

Page 2: September 26 & 27, 2011

No Smoking!(yes, they are serious about this)

Page 3: September 26 & 27, 2011

Facilities• Restrooms at the ends of the hall and upstairs.• Lunch will be on your own. The Natcheteria is

upstairs, and they have been warned that we are here. Details on box lunches will come.

• Coffee Breaks will be in the back of the room (and most likely spilling out into the hall).

• Parking• We are close here, so please be a good

neighbor.

Page 4: September 26 & 27, 2011

Program

• Opportunities for questions and discussion after each presentation.

• Please use the microphones.• And tell us who you are.

• There will be an open session on Tuesday afternoon. Bring your suggestions, questions, and gripes.

Page 5: September 26 & 27, 2011

Thank You to the Peer Reviewers

• Steve DeRose• Jeff Fisher• Daniel Grossberg• Mirko Janc• Sheila Morrissey• Evan Owens• Bruce Rosenblum• Sasha Schwarzman• Kim Tryka

Page 6: September 26 & 27, 2011

And to the Program Committee

• Laura Kelly• Deborah Lapeyre• B. Tommie Usdin

(Who were also peer reviewers)

Page 7: September 26 & 27, 2011

September 26 & 27, 2011

Page 8: September 26 & 27, 2011

Interesting things that have happened since we last met

• NISO Z39.96 JATS version 0.4 was released as a Draft Standard for Trial Use on March 30, 2011

• NCBI hosted a meeting in August 2011 to update the NCBI Book DTD

Page 9: September 26 & 27, 2011

NISO JATS version 0.4

• Release March 30, 2011.• As a Draft Standard for Trial Use• 6-month comment period ends September 30• Project name has shifted from “NLM DTD” to

“Journal Article Tag Suite (JATS)” or “NISO JATS”

Page 10: September 26 & 27, 2011

Why the name change?• Because the standard doesn't define a “DTD”

it defines a set of XML elements and attributes and three Article models (Tag Sets) in plain text: - Archiving and Interchange Tag Set,- Journal Publishing Tag Set- Article Authoring Tag Set.

• Coincidentally, these Tag Sets correspond to the three “old NLM DTDs”- Archiving and Interchange DTD,- Journal Publishing DTD- Article Authoring DTD.

Page 11: September 26 & 27, 2011

In plain text?

Page 12: September 26 & 27, 2011
Page 13: September 26 & 27, 2011
Page 14: September 26 & 27, 2011

New home for JATS files at NLM

• New home website created for the NISO JATS non-normative supporting documents:

http://jats.nlm.nih.gov

This site includes schema versions (DTD, XSD, RNG) for each of the Tag Sets along with the documentation (Tag Library) for each.

Page 15: September 26 & 27, 2011

http://jats.nlm.nih.gov/publishing/0.4/JATS-journalpublishing0.dtd

Page 16: September 26 & 27, 2011

http://jats.nlm.nih.gov/publishing/tag-library/0.4

Page 17: September 26 & 27, 2011

And dtd.nlm.nih.gov?

• Still includes all documentation for older versions, NLM DTD 1.0 through 3.0.

• At the Tag Set level, the old site and new site are linked together pretty well.

Page 18: September 26 & 27, 2011
Page 19: September 26 & 27, 2011

What happens next?

After the Comment Period closes on September 30, the NISO Working Group will address all comments and produce a new version of the Standard Document (NISO JATS 1.0) along with all of the non-normative supporting documents (schemas and documentation).

Page 20: September 26 & 27, 2011

Important Conference Info!

Any suggestions that come from the floor of this meeting will not be considered official comments.

(similarly for any suggestions made to one of the Working Group members in the hall or in an email or in a note passed at recess.)

All comments must be made through the NISO website to be considered by the Working Group http://www.niso.org/apps/group_public/document.php?document_id=6135).

Page 21: September 26 & 27, 2011

• Then Version 1.0 will be voted on by the NISO members.

• The Standard will be in Continuous Maintenance, which means we will continue to accept comments from users and continue to update the Standard Document and the Supporting Documentation.

Page 22: September 26 & 27, 2011

The Book Model

• NCBI hosted a meeting in August 2011 to update the NCBI Book DTD.

• The goal was to build a general book model from the JATS.

Page 23: September 26 & 27, 2011

Some decisions included:

• The new book model is not intended for ALL books. – Technical monographs, government

reports, multivolume monographic series, reference books, encyclopedias, college-level textbooks, and professional books were all declared to be in scope.

– Children's books, legislative materials, scripture and sacred literature, dictionaries, magazine-like books, and K-12 textbooks were all declared to be out of scope.

Page 24: September 26 & 27, 2011

• Like the JATS, the New Book Model will focus on tagging content rather than formatting or layout.

• The new model is intended for publishers who have journal content in JATS who wish to tag their book content in a similar way.

• A book "chapter" body is very similar in structure to a journal article body.

Page 25: September 26 & 27, 2011

• The book model will use the same structures and element names from JATS where it makes sense.

• Books may exist as a single XML document or a collection of book-part- (e.g. chapter)-level documents.

• The book model will be based on NISO JATS 1.0(so, it will not be released until AFTER NISO Z39.96 JATS 1.0 is official)

Page 26: September 26 & 27, 2011

March 2003

NLM

DTDs v 1.0

Page 27: September 26 & 27, 2011

March 2003

NLM

DTDs v 1.0N

ovember 2003

NLM

DTDs v 1.1

Page 28: September 26 & 27, 2011

March 2003

NLM

DTDs v 1.0N

ovember 2003

NLM

DTDs v 1.1

Novem

ber 2004 N

LM DTDs v 2.0

Page 29: September 26 & 27, 2011

March 2003

NLM

DTDs v 1.0N

ovember 2003

NLM

DTDs v 1.1

Novem

ber 2004 N

LM DTDs v 2.0

September 2005

NLM

DTDs v 2.1

This was when the Article Archiving and Journal Publishing models became more open and we added the Authoring model.

Page 30: September 26 & 27, 2011

March 2003

NLM

DTDs v 1.0N

ovember 2003

NLM

DTDs v 1.1

Novem

ber 2004 N

LM DTDs v 2.0

September 2005

NLM

DTDs v 2.1

NLM

DTDs v 2.2June 2006

Page 31: September 26 & 27, 2011

March 2003

NLM

DTDs v 1.0N

ovember 2003

NLM

DTDs v 1.1

Novem

ber 2004 N

LM DTDs v 2.0

September 2005

NLM

DTDs v 2.1

NLM

DTDs v 2.2June 2006

NLM

DTDs v 2.3M

arch 2007

Decision to formalize standard with NISO

Page 32: September 26 & 27, 2011

March 2003

NLM

DTDs v 1.0N

ovember 2003

NLM

DTDs v 1.1

Novem

ber 2004 N

LM DTDs v 2.0

September 2005

NLM

DTDs v 2.1

NLM

DTDs v 2.2June 2006

NLM

DTDs v 2.3M

arch 2007

NLM

DTDs v 3.0N

ovember 2008

Backward-incompatible release

Page 33: September 26 & 27, 2011

March 2003

NLM

DTDs v 1.0N

ovember 2003

NLM

DTDs v 1.1

Novem

ber 2004 N

LM DTDs v 2.0

September 2005

NLM

DTDs v 2.1

NLM

DTDs v 2.2June 2006

NLM

DTDs v 2.3M

arch 2007

NLM

DTDs v 3.0N

ovember 2008

Backward-incompatible release

NLM

DTDs v 3.1

NLM DTD Working Group is dissolved, and the NISO Journal Article Tag Suite Working Group is created.

Page 34: September 26 & 27, 2011

March 2003

NLM

DTDs v 1.0N

ovember 2003

NLM

DTDs v 1.1

Novem

ber 2004 N

LM DTDs v 2.0

September 2005

NLM

DTDs v 2.1

NLM

DTDs v 2.2June 2006

NLM

DTDs v 2.3M

arch 2007

NLM

DTDs v 3.0N

ovember 2008

Backward-incompatible release

NISO

Z39.96 JATS v 0.4M

arch 2011

Page 35: September 26 & 27, 2011

March 2003

NLM

DTDs v 1.0N

ovember 2003

NLM

DTDs v 1.1

Novem

ber 2004 N

LM DTDs v 2.0

September 2005

NLM

DTDs v 2.1

NLM

DTDs v 2.2June 2006

NLM

DTDs v 2.3M

arch 2007

NLM

DTDs v 3.0N

ovember 2008

Backward-incompatible release

NISO

Z39.96 JATS v 0.4M

arch 2011

September 30, 2011

End of public comment period

Page 36: September 26 & 27, 2011

March 2003

NLM

DTDs v 1.0N

ovember 2003

NLM

DTDs v 1.1

Novem

ber 2004 N

LM DTDs v 2.0

September 2005

NLM

DTDs v 2.1

NLM

DTDs v 2.2June 2006

NLM

DTDs v 2.3M

arch 2007

NLM

DTDs v 3.0N

ovember 2008

Backward-incompatible release

NISO

Z39.96 JATS v 0.4M

arch 2011

September 30, 2011

End of public comment period

FUTURE!

Page 37: September 26 & 27, 2011

March 2003

NLM

DTDs v 1.0N

ovember 2003

NLM

DTDs v 1.1

Novem

ber 2004 N

LM DTDs v 2.0

September 2005

NLM

DTDs v 2.1

NLM

DTDs v 2.2June 2006

NLM

DTDs v 2.3M

arch 2007

NLM

DTDs v 3.0N

ovember 2008

Backward-incompatible release

NISO

Z39.96 JATS v 0.4M

arch 2011

September 30, 2011

End of public comment period

FUTURE!

NISO

Z39.96 JATS v 1.0

Page 38: September 26 & 27, 2011

March 2003

NLM

DTDs v 1.0N

ovember 2003

NLM

DTDs v 1.1

Novem

ber 2004 N

LM DTDs v 2.0

September 2005

NLM

DTDs v 2.1

NLM

DTDs v 2.2June 2006

NLM

DTDs v 2.3M

arch 2007

NLM

DTDs v 3.0N

ovember 2008

Backward-incompatible release

NISO

Z39.96 JATS v 0.4M

arch 2011

September 30, 2011

End of public comment period

FUTURE!

NISO

Z39.96 JATS v 1.0

Page 39: September 26 & 27, 2011

March 2003

NLM

DTDs v 1.0N

ovember 2003

NLM

DTDs v 1.1

Novem

ber 2004 N

LM DTDs v 2.0

September 2005

NLM

DTDs v 2.1

NLM

DTDs v 2.2June 2006

NLM

DTDs v 2.3M

arch 2007

NLM

DTDs v 3.0N

ovember 2008

Backward-incompatible release

NISO

Z39.96 JATS v 0.4M

arch 2011

September 30, 2011

End of public comment period

FUTURE!

NISO

Z39.96 JATS v 1.0

Expect to release new NCBI/NLM Book Model just after NISO JATS 1.0 is released

Page 40: September 26 & 27, 2011

What’s the new book model called?

• Some names either considered and rejected or not considered:– JATSBook– DocJATs– BATS– Buttercup

• Book Interchange Tag Suite (BITS)

Page 41: September 26 & 27, 2011

Best Practices or Common Practice?

Page 42: September 26 & 27, 2011
Page 43: September 26 & 27, 2011
Page 44: September 26 & 27, 2011
Page 45: September 26 & 27, 2011
Page 46: September 26 & 27, 2011
Page 47: September 26 & 27, 2011
Page 48: September 26 & 27, 2011
Page 49: September 26 & 27, 2011
Page 50: September 26 & 27, 2011
Page 51: September 26 & 27, 2011
Page 52: September 26 & 27, 2011
Page 53: September 26 & 27, 2011
Page 54: September 26 & 27, 2011
Page 55: September 26 & 27, 2011
Page 56: September 26 & 27, 2011

• Sometimes people want to be told what to do.

Page 57: September 26 & 27, 2011

• Working on the JATS project, we get a lot of requests for "Best Practices”

“How do I tag …?”

Page 58: September 26 & 27, 2011

• But what are you asking for when you are asking for “Best Practices”?

• And how do you feel about someone who knows what’s best for you?

Page 59: September 26 & 27, 2011
Page 60: September 26 & 27, 2011

• The JATS Tag Sets - especially the Archiving and Interchange and even the Journal Publishing are very flexible models that allow content to be tagged in different ways

Page 61: September 26 & 27, 2011

The DTD Spectrum

Conversion Creation

Is Journal Publishing meeting our needs?

Not really. It is too restrictive for some users, and not prescriptive enough to be a good Authoring model.

Page 62: September 26 & 27, 2011

The DTD Spectrum

Conversion Creation

Article Authoring DTD

Page 63: September 26 & 27, 2011

A reasonable question• (1) It seems from the element reference page for <chem-struct-wrap> that one could

omit explicit labels because "A <chem-struct-wrap> may also be numbered, automatically by a formatting application or by preserving the number inside a <label> element." Having seen this, but not found similar comments about "automatic numbering" for other elements that may typically be numbered/labelled, I would like to know what the assumption is about omitting labels in general for these (e.g. chemical structures, equations, figures, tables, etc.): is a formatting application expected by default to generate a number/label? If so, is there a way to suppress numbering for some occurrences?

• (2) Relatedly, what is the expected behaviour for an <xref> element that has no content (e.g. one that (a) references an element for which automatic numbering has been assumed and which therefore lacks a <label>, or (b) one that references an element possessing a <label>)?

• Message from Simon Newton to [email protected] on September 7, 2011

Page 64: September 26 & 27, 2011

A reasonable question• (1) It seems from the element reference page for <chem-struct-wrap> that one could

omit explicit labels because "A <chem-struct-wrap> may also be numbered, automatically by a formatting application or by preserving the number inside a <label> element." Having seen this, but not found similar comments about "automatic numbering" for other elements that may typically be numbered/labelled, I would like to know what the assumption is about omitting labels in general for these (e.g. chemical structures, equations, figures, tables, etc.): is a formatting application expected by default to generate a number/label? If so, is there a way to suppress numbering for some occurrences?

• (2) Relatedly, what is the expected behaviour for an <xref> element that has no content (e.g. one that (a) references an element for which automatic numbering has been assumed and which therefore lacks a <label>, or (b) one that references an element possessing a <label>)?

• Message from Simon Newton to [email protected] on September 7, 2011

Page 65: September 26 & 27, 2011

A reasonable question• (1) It seems from the element reference page for <chem-struct-wrap> that one could

omit explicit labels because "A <chem-struct-wrap> may also be numbered, automatically by a formatting application or by preserving the number inside a <label> element." Having seen this, but not found similar comments about "automatic numbering" for other elements that may typically be numbered/labelled, I would like to know what the assumption is about omitting labels in general for these (e.g. chemical structures, equations, figures, tables, etc.): is a formatting application expected by default to generate a number/label? If so, is there a way to suppress numbering for some occurrences?

• (2) Relatedly, what is the expected behaviour for an <xref> element that has no content (e.g. one that (a) references an element for which automatic numbering has been assumed and which therefore lacks a <label>, or (b) one that references an element possessing a <label>)?

• Message from Simon Newton to [email protected] on September 7, 2011

Page 66: September 26 & 27, 2011

• Simon is asking for “Best Practices”

• So I was thrilled to see the following response:

Page 67: September 26 & 27, 2011

I don't think any assumptions are made regarding when and exactly how numbering should be automated; there is only a recognition that it commonly done in publishing systems, and JATS is designed to support this (or no numbering at all) or not, depending on local policies.

Neither is there any expectation that by default, a formatting application will number things.

This means you have both the opportunity and the burden to define a policy that makes the most sense for your data and workflow.

Message from Weldell Piez to [email protected] on September 8, 2011

Page 68: September 26 & 27, 2011

I don't think any assumptions are made regarding when and exactly how numbering should be automated; there is only a recognition that it commonly done in publishing systems, and JATS is designed to support this (or no numbering at all) or not, depending on local policies.

Neither is there any expectation that by default, a formatting application will number things.

This means you have both the opportunity and the burden to define a policy that makes the most sense for your data and workflow.

Message from Weldell Piez to [email protected] on September 8, 2011

Page 69: September 26 & 27, 2011

Best Practices must be scoped

• They must make sense with your content.… with your workflow… and for any users of your content down

the line.

Page 70: September 26 & 27, 2011

A “quick” question

Question on usage sent in an email earlier this week from a large medical journal on "how the NLM uses the JATS Tags":

“We have just a quick question for NLM: Do you use the article-type attribute values? If we were to use our own attribute [value]s, would that pose problems for aggregators?”

Page 71: September 26 & 27, 2011

Parse the question

What are the Best Practices for @article-type values?

Will it give our users problems if we just made something up?

Page 72: September 26 & 27, 2011

I had to answer from 2 points of view:

Factual content about the @article-type attribute:

The @article-type attribute was a controlled list of values in the Journal Publishing model until we moved to version 3.0 at the end of 2008, which should give you an idea of how important the NLM DTD Working Group thought that that attribute is.

Page 73: September 26 & 27, 2011

More facts

The one thing I would like to remind you is that the @article-type attribute is really for identifying the article type to your processors – it is not "Content". The area we've designed to display the article headings (like TOC headings for example) is the <article-categories> element with the <subj-group> structure inside.

Page 74: September 26 & 27, 2011

More facts

So, for example. If you have three journals and each one displayed corrections with a different heading, you could use @article-type="correction" in all of the files so that your processing systems wouldn't need to know that you make this distinction. But then you could have one journal that uses <subject>Correction</subject>, and two that use <subject>Erratum</subject>.

Page 75: September 26 & 27, 2011

And then my opinion about how the article-type should be used (based on what we do in PMC) from the point of view of Aggregators.

The short best answer is that you probably need to speak with your aggregators about the values that you will be using and they will be accepting. Aggregators are going to want to normalize the list of article types so that they don't have to process many different values that mean the same thing in their system. This can happen in two ways.

Page 76: September 26 & 27, 2011

The easiest way for the aggregator is for them to give you a set of values to use and anything outside of that set is not accepted. The way that a lot of aggregators work is that they normalize these values on the way in to their preferred set. But this means you will need to have a conversation with them to map your values to their values. In reality at PMC, we see things a little of both ways.

Page 77: September 26 & 27, 2011

We have the PMC Tagging Guidelines (http://www.ncbi.nlm.nih.gov/pmc/pmcdoc/tagging-guidelines/article/style.html) – which is essentially a "Best Practices" for tagging articles in NLM XML for submission to PMC. We have a list of @article-type attribute values there that we will accept. If we find something outside of this set in a submission that can obviously map to one of our values, we will normalize it on ingest. For example, we use @article-type="correction" in PMC.

Page 78: September 26 & 27, 2011

When we get a submission with @article-type="erratum" or @article-type="errata", we convert it to @article-type="correction". But if we get a submission with @article-type="expression-of-embarassment" or @aricle-type="do-over", we will contact the submitter to be sure we have the mapping right.

Page 79: September 26 & 27, 2011

Essentially - If you were sending this to PMC, this is what we would do. OR "Your data in MY workflow can be handled in these ways".

The “Best Practices” are related to the project.

Page 80: September 26 & 27, 2011

• Which is just restating Wendell's position. She has both the burden and the opportunity to define a policy that makes sense for her data and her workflow.

• So, Best Practices are great - within the scope of a project.

Page 81: September 26 & 27, 2011

Time to get on with the conference.