ISO Archiving Standards - Ninth US Workshop- Minutes
National Archives - Center for Electronic Records
College Park, MD 20740-6001 USA
September 30 - October 1, 1997
The Ninth US Workshop on Data Archive Standards was held at the
National Archives and Records Administration's (NARA) Archives II Complex
in College Park, MD
on September 30 - October 1, 1997.
The list of attendees is available at:
http://ssdoo.gsfc.nasa.gov/nost/isoas/us09/participants.html.
Action Items
Materials Distributed
Future Meetings
Discussion Items
Action Items
ACTION ITEMS
| AI #
| Description
| Date
| Actionee
| Status
| Comments
|
| U/9710/01
| Read RLG draft Workplan Document
| TBD
| All
| Open
|
|
| U/9710/02
| Incorporate three suggestions
| TBD
| Don Sawyer
| Open
|
|
| U/9710/03
| Section 1.1; bullet #1 - Rework to identify why each of the three terms appear in the OAIS title
| TBD
| Randy Davis
| Open
|
|
| U/9710/04
| Delete Figure 2-4
| TBD
| Don Sawyer
| Open
|
|
| U/9710/05
| Rework Representation Information Section (in Section 4)
| TBD
| Randy Davis
| Open
|
|
| U/9710/06
| Write Scenario of Cataloging, etc.
| TBD
| Randy Davis
| Open
|
|
| U/9710/07
| Send comments re. section 4.3 to Lou Reich
| 1998-10-03
| All
| Open
|
|
| U/9710/08
| Send revised section 4.3 to members for comments with or without above comments)
| TBD
| Lou Reich
| Open
|
|
| U/9710/09
| Provide edits to WB
| 1998-10-03
| Paul Grunberger
| Open
|
|
| U/9710/10
| Update Functions, with Paul Grunberger material considered
| TBD
| Mike Martin
| Open
|
|
| U/9710/11
| Make statements re. nature of federation and how we can Model it
| TBD
| Paul Grunberger
| Open
|
|
| U/9710/12
| Send Randy Davis comments re. transformation cases, other
| 1997-10-03
| Lou Reich
| Open
|
|
| U/9710/13
| Upgrade classification material
| TBD
| Mike Martin
| Open
|
|
| U/9710/14
| Look for "orphan flows" in Paul Grunberfer diagrams
| TBD
| All
| Completed
|
|
Materials Distributed
| Item #
| Description
| By
|
| 01
| Presentation Charts - Developing an ISO Ref Model for an OAIS
| Don Sawyer
|
| 02
| RLG Preservation WG on Digital Archiving - Draft Report
| Don Sawyer
|
| 03
| Open Archival Information System (OAIS) Reference Model (RM)
White Book, Version 1.2
| Don Sawyer
|
| 04
| Annex A: Scenarios of Existing Archives
| Don Sawyer
|
| 05
| Guide for Managing Electronic Records from an Archival Perspective
| Bruce Ambacher
|
Future Meetings
| Date
| Meeting
| Location
| Subject
|
| Oct 27-29, 1997
| Fifth International Workshop
| ESRIN
Frascati, Italy
| Updates to RM White Book-2
CIP Work Plan
|
| Oct 27-Nov 5, 1997
| International CCSDS Panel 2 Workshop
| ESRIN
Frascati, Italy
| All active Panel 2 Work Packages
|
| January 20-22, 1998
| Tenth US Workshop
| NARA
College Park, MD
| Updates to RM White Book-2
|
| March/April 1998
| Eleventh US Workshop
| NARA
College Park, MD
| Finalize RM draft Red Book
|
| May 1998
| Sixth International Workshop
| likely Houston, TX
| Approve RM as CCSDS Red Book and ISO Draft Standard
|
Discussion Items
Agenda
The meeting agenda, available at
http://ssdoo.gsfc.nasa.gov/nost/isoas/us09/agenda.html,
was accepted and followed during the meeting.
Activities Since Last US Meeting
Don Sawyer itemized and described the major activities which had taken
place since the last US meeting in July.
Society of American Archivists (SAA)
- Bruce Ambacher and Don Sawyer attended this meeting in Chicago and gave papers
- Meeting seen as interesting with over 1000 participants consisting of
corporate people, agencies, museums, academia, volunteers.
- Impression that community is, as a whole, just getting their feet wet
with electronic forms of data despite considerable experience at some
archives such as NARA
- Digital libraries appear to be the major archival thrust for forming
views on preservation of electronic forms
- Statement made that there is no difference between archives and digital
libraries when it comes to digital information
Research Libraries Group (RLG)
- This group had a booth at above SAA meeting
- Don Sawyer has not yet again contacted Nancy Elkington
- Scott Roley/NARA was present at the SAA and has since invited Don Sawyer to
present at the NAGARA meeting in July. The proposed session is
"Distributed Archives: Models for Standards, Licensing, and Oversight"
and an RLG perspective will also be given in this session.
- Bruce Ambacher handed out a copy of a draft RLG work plan which showed
areas in which we have interest and should try to coordinate with
- One such activity relates to a Recommended Practices guide (which we
are considering as a New Work Item.)
- Mike Martin has worked with RLG as part of his PDS effort. He has not felt
they have accomplished much; probably running into copyright
restrictions
CEOS ATT
- Don Sawyer developed draft charts for Claude Huc to present at CEOS Archive
Task Team in September
- Claude since reported that they do want to work with us and have a
strong action to review the OAIS document
Status of US08 Action Items
Action Items from the July US DA Workshop were reviewed.
The following action items remain open.
OPEN ACTION LIST SUMMARY - U.S. DA Workshops
(as of Sept 30 1997)
| AI #
| Description
| Actionee
| Date
| Status
| Comments
TBD
Review of WB changes
Develop work plans for new work
Write memo to RLG regarding new work plans
|
| U/9708/02
| Check meaning of Migration in "Preservation" Paper
| All
| TBD
| Open
|
|
| U/9708/03
| Rewrite text re Physical Migration per discussion
| Mike Martin
| TBD
| Open
| some of this is included in new text
|
| PREVIOUS ACTIONS
|
| U/9707/01
| Follow up with Elkington re RLG activities
(url
http://www.rlg.org/preserve)
| Don Sawyer
| TBD
| Open
| Don has yet to get back to Nancy
|
| U/9707/03
| Contact Gerry Gibbon regarding Cross Reference material
| Don Sawyer
| TBD
| Open
| Gerry has not responded to Don's E-Mail
|
| U/9707/06
| Analyze and report on SOMO data Archive study
| Mike Martin
| TBD
| Open
|
|
| U/9707/07
| Upgrade OAIS classification descriptions as per SOMO survey
| Don Sawyer
Mike Martin
| TBD
| Open
|
|
| U/9707/16
| Develop Concept paper on differentiating
federated archive vs a distributed archive
| Paul Grunberger
| TBD
| Open
|
|
| U/9707/17
| Develop NARA scenario on special collection
| Bruce Ambacher
| TBD
| Open
|
|
Current Work Plan
This table shows the current work plan as amended during this meeting.
| Date
| Description
| Comments
|
| May 1997
| Submitted OAIS RM White Book 1.0 to ISO as Committee Draft
| Submitted via CCSDS Management Council and ISO/TC20 SC13
|
| Oct 10, 1997
| Issue White Book-2 of RM
| Will reflect inputs from US Workshop #9 and
all international comments
|
| Oct 27-29, 1997
| International DA Workshop #5
| at Esrin in Frascati, Italy
|
| Jan 20-22, 1998
| US Workshop #10
| at NARA
|
| Mar/Apr 1998
| US Workshop #11
| at NARA
|
| May 1998
| International DA Workshop #6
|
|
| May 1998
| Publish RM as CCSDS Red Book and ISO Draft International Standard
|
|
| Nov 1998
| International DA Workshop #7
|
|
| Nov 1998
| Publish RM as CCSDS Blue Book and ISO International Standard
|
|
Summary of Changes to Reference Model (Item 3)
- Don Sawyer produced and distributed an upgraded document
- He itemized the areas still OPEN as found in the 'Dear Reader'
note of version 1.2
- Don Sawyer then reviewed the revised Table of Contents and indicated areas
where the major changes have been made:
- Section 1: Don Sawyer noted there were changes in the Purpose and Scope and
hoped this section was settling down
- Section 2: Sections 2.1 and 2.2, which involved moving the environment
figure forward and improving readability of information model
discussions
- Section 3, now "Responsibilities", has had a number of changes in 3.1,
3.3, 3.4 and 3.5. Section 3.4 is completely rewritten.
- Section 4.1: Mike Martin commented on his update of this section:
- He noted that most of Paul Grunberger's flows are still there
- Paul Grunberger has further modified the Matrix
- Mike Martin has tried to make his nomenclature more consistent with that used by
the rest of the document
- Paul Grunberger feels he still has some orphan flows which need to be
searched out
- Section 4.2
- Don Sawyer noted that the Information Model organization is basically all new
- There are still some questions as to whether this is the right
organization
- Changes in model feed back to affect terminology in the various
sections
- Lou Reich plans to further smooth the writing
- Section 5 - Migration
- The special writing session addressed mainly Migration
- Randy Davis has rewritten this section
- Section 6-Randy Davis has rewritten the scenario to be more generic and complete
- Annex A-There are new scenarios: one from Life Sciences and one from CNES
on their archive
Note:
There may need to be a new section on Federated Archives. Paul Grunberger has not
yet produced draft material but hopes to next weekend.
Since this material had only been put on the web the previous night,
no one had read it. Therefore, a reading session was held with a focus
on sections 1.1, 2.1, 2.2, 4.2, and 4.1.
Detailed Discussion of Reference Model - White Book-1.2 (Item 3)
Section 1 - Introduction
Section 1.1 - Purpose and Scope
Action Item:
Don Sawyer
will update Section 1 - Introduction
by 971003.
Section 2 - OAIS Concepts
The International Workshop asked for less detail in section.
Keep a high level conceptual view.
Sections 2.1 and 2.2
- Paul Grunberger felt 2.2.1 and 2.2.1.1 should be at the same level,
along with material immediately under 2.1
- Mike Martin sugested that Fig 2-2 is vague and the OAIS term
"More meaningful Information" should use the OAIS term
"Content Information."
- Don Sawyer said this was to be about information in general,
so it could apply to all the information objects,
and should be written in that manner
- Paul Grunberger felt the text should include a qualifying sentence
to this affect
- Randy Davis would like to see the distinction among: data, knowledge,
and information
- There were different opinions as to the meaning of the words and
discussion followed
- Don Sawyer feels there is no useful information without a knowledge
entity to understand the information
- Paul Grunberger feared that too much treatment would bog down the reader
in this area
- Randy Davis liked (data + representation) plus representation.... equals
Information object. He would show in some manner the recursion aspect
- Randy Davis would like to be able to express all this as an equation
- Don Sawyer indicated that some Representation Data in itself may need other
Representation Data
- Suggestion made to include a more extensive example consisting
of ASCII data in a table.
- Information will be introduced with a short description relating to
knowledge exchange to bring out the importance of the 'knowledge base'
that a person or entity needs to be able to understand any information.
- More Meaningful Information will be replace with just 'Information'.
Action Item:
Don Sawyer
will incorporate these last three suggestions
by 971003.
Sect 2.2.1
Section 3 - Detailed Models
Section 3.1
Action Item:
Section 3.1.1
Section 4 - Migration Perspectives
Sect 4.2 - Information Model
- After reviewing attempt to reorganize this to work better with Int'l
actions affecting section 2 reorganization, a decision was made to try
to merge and reorganize material in 4.2.1 and 4.2.2 with minimum
refernce to Section 2.
- Don Sawyer: What is dividing line between 4.2.2 and 4.2.3?
- Lou Reich: Sect 4.2.2 concentrates on information objects to be stored- CI,
preservation info., etc., while 4.2.3 talks more of archive operations.
One builds on an information model while the other builds on the
activity assuming the model exists
- Some inconsistencies in style noted by Randy Davis (Bold, capitalize, etc)
- Mike Martin did not like basic "currency" phrase
- Lou Reich will combine 4.2.1 and 4.2.2
- Don Sawyer feels this will require a bit more text explaining the Information
Object figure - digital/physical split, digital bit sequences,
representation recursion.
- Lou Reich feels this has been treated in section 2, more might be viewed as
redundant. He also indicated that he plans some changes in the flow of
the text, including following the IO with the taxonomy view, which was
seen as smoother by the group
- Mike Martin felt more treatment of the Transfer Info Package was advisable
- Don Sawyer asked what was the significance of the word Transfer since all
IPs get transferred somewhere
- The following differentiation among AIP, AIU and AIC terms was offered
- -Originally, the AIP was to be the basic storage (atomic) item
- -The Int'ls preferred AIU as the "unit" to be stored, not
liking to view any package as an atomic unit. So AIU was
created
- -AIC is a collection of AIUs
- -AIP is a sort of short-hand term that can represent either.
- Randy Davis: more description is needed about Information Object
- Randy Davis: need example of physical object and example of digital object
- Lou Reich: feels this will all be resolved in the revised write-up
- Randy Davis suggested that this text should be interspersed with scenarios
- Randy Davis asked for more explanation as to just what constitutes a digital
object. (Is a CD-ROM full of files one, or many, digital objects. The
answer given was, "depends on how these are addressed."
- Content information is just a special case of an information object
- Randy Davis again mentioned the advantage of giving examples for
these figures
- Lou Reich will move figures 4-5 and 4-8 and modify figures 4-6 and 4-7
- page 4-18: Need to make more clear just what is Representation
Information
- Mike Martin: In Representation Information section, the text seems
repetitive
- Randy Davis will rework Representation Information section
- Need to write description of differentiation between physical-digital
and physical-containing digital terms and decide where it
should fit
- Need a mapping from SIP to DIP
- Need to Update Migration Text
- Agreed to drop concept of TIP superclass
Section 4.2.2.3
- Randy Davis asked how rigid are these requirements to be observed, since
not all documentation will contain all these materials
- Lou Reich feels now these are objectives and with time will become more
observed in practice. We may eventually even have more categories
- It was noted that these are shown as subtypes in one place and in
another as container objects.
- Lou Reich stated that perhaps the subtypes should be Preservation
Description IOs
- Add "objects" to Figure 4-10
- Discussion on Catalog Information in PDI followed
- Catalog Information may be generated and collected later, eventually
becoming part of Context, or Reference, or Content Information, or
a separate package to be pointed to. Provenance would need to be
updated. Does every change have to go back through Ingest?
- The same thing could be said about derived products, but how do these
point backward to source products?
- Lou Reich would like to break down Packaging Info into Delimitation and
Identification.
- Randy Davis to write Scenario on Cataloguing, Software and Derived Products
Section 4.2.2
- There was additional discussion re Physical vs digital objects
- Randy Davis: Re software - he used to feel we need to archive software but has
changed his mind. It should be ancillary information.
- Don Sawyer: S/W could be part of Provenance if it includes the transformation
algorithm relating to a previous product. It may be a part of Context,
or referenced by Context, if it is provided as a 'possible aid to
access'.
- Don Sawyer believes that a query for CI or PDI, in an object sense, must
logically go thru PI information and its associated methods.
- Lou Reich You don't query the AIP directly, you query the container object and
let the system decide how to respond
o There was extended discussion about the meaning, significance, etc of
all the structures being introduced in this section. Generally, the
inheritance diagrams were not understood. A mistake in Figure 4-16,
where AIP should have been AIU, compounded the problem.
Section 4.2.3
- The extra model shown is ONLY within the Archive
- Lou Reich will rework this entire area
- To Lou Reich, PI is not contained, as is PDI and other entities
- Randy Davis: All this aggregation of AIPs into AIUs/AICs is not too
clear and wondered if it were necessary
- In Fig 4-16, this should be an AIU
- Randy Davis feels we need to ellucidate an overall architecture of the
archive OR get a more complete description of the activity
- Need to build up to Figures 4-17 and 4-18
- Randy Davis would like to see the term AIP disappear and just talk about
AIUs and AICs.
- Don Sawyer likes the term AIP because it allows one to talk about common
features of the 'package to be preserved' without getting into units
and collections of units. This is convenient to give an overview such
as is needed and given in section 2.
- Don Sawyer views that the producer and the Archive negotiate what is to
be in a SIP, and how these are to be submitted. Multiple SIPs (in
some form) provide all this necessary information, but any one SIP
may contain only parts of required categories (e.g., CI, Rep. Info,
PDI). It was agreed that we should 'finesse' the information
modeling issue of how a SIP
really inherited from an IP to avoid too much complexity.
- John Garrett would like to break the SIP into two parts, the
Content Information and the PI
- Randy Davis: In BNF: Info Package ::= Content Info[PDI] or PDI
- It was agreed to retain IP, but look to drop AIP.
- It was agreed to try PI outside the IP, but with reluctance by some
overcome by the need to allow the review to proceed.
Section 4.3
- Lou Reich: Send comments re section 4.3 to Lou Reich by Oct 3 (end of week)
- Lou Reich: Send revised section 4.3 to members for comments
(These two activities may be done in parallel)
- Lou Reich suggested we split chapter 4 into two sections, between
Functional Model and Information Model. We may want the
Information Model to come
first, but then some functional model terms need to be introduced
earlier.
OAIS Version 2.0
- Don Sawyer noted that we need to put the next-version WB on the Web by
mid-Oct so the P2 international people can have a chance to review
it for discussion during the Oct 27-29 DA meeting at ESRIN.
- Mike Martin to upgrade Functions material, with Paul Grunberger edits considered
- Paul Grunberger to provide electronic edits by 3 Oct.
Section (New) - Federated Archives
- There was discussion as just what Federated Archives meant to help
Paul Grunberger start writing
- Paul Grunberger should cover the level of autonomy of each node
- Is the management breakdown different in Federated versus Distributed?
- Lou Reich: Currently accepted perceptions of the practice are most
likely not correct
- Is there a single centralized management or a single decentralized
management among the nodes?
- If you have a local catalog schema in a federated environment, can one
reach the catalog from a remote location or another node
- Paul Grunberger to make statements re nature of federation and how we
can model it
Section 5 - Archive Classifications
Action Item:
Section 6 - Archive Classifications
- Don Sawyer said we agreed to look at each class and provide rationalization
as to why this particular classification is important (if it is).
- Lou Reich doesn't mind going with the current classification text " as is"
- Mike Martin discussed expanding the utility of each type of archive
classification.
He feels there will not be any great differentiating significances among
the classifications we have now.
- Mike Martin to upgrade classification section
Section 7 - Illustrative Scenarios
- No comments at this time.
Annex A - Scenarios of Existing Archives
- No comments at this time.
Annex B - Migration Issues
- This Annex is to be dropped since we believe the important concepts
are now in the migration section
- Lou Reich did not feel transformation text in migration section covered
enough cases
- Lou Reich will send comments to Randy Davis by end of week
Annex C - Compatability with Other Standards
- Lou Reich: Delete current text but keep the annex with new text.
Annex D - Brief Guide to OMT
The Workshop was adjourned at 1700 hours, Wednesday, October 1, 1997.
Wider Views
Overview of the Ninth US Workshop
Overview of US Effort
Overview of International Effort
URL: http://ssdoo.gsfc.nasa.gov/nost/isoas/us09/minutes.html
A service of
NOST at
NSSDC.
Access statistics for this web are available.
Comments and suggestion are always welcome.
Editor: Robert Stephens (stephens@us.net) +1.301.949-0965
Curator: John Garrett (garrett@ncf.gsfc.nasa.gov) +1.301.286.3575
Responsible Official: Code 633.2 / Don Sawyer (Donald.Sawyer@gsfc.nasa.gov) +1.301.286.2748
Last Revised: November 19, 1997, John Garrett (March 25, 1998, John Garrett)