Recommended Formats Statement 2020-2021

Transcription

Library of CongressRecommended Formats Statement2021-2022For online version, see Recommended Formats Statement - 2021-2022 (link)Introduction to the 2021-2022 revision . 3I.Textual Works. 5i. Textual Works – Print (books, etc.). 5ii. Textual Works – Digital . 7iii. Textual Works – Electronic serials . 9II.Still Image Works . 12i. Photographs – Print . 12ii. Photographs – Digital. 13iii. Other Graphic Images – Print (posters, postcards, fine prints) . 14iv. Other Graphic Images – Digital . 15v. Microforms . 17III. Moving Image Works . 18i. Motion Pictures – Digital and Physical Media . 18ii. Video – File Based and Physical Media . 19IV. Audio Works . 21i. Audio – On Tangible Media (digital and analog) . 21ii. Audio – Media-independent (digital) . 22V. Musical Scores . 24i. Musical Scores – Print . 24ii. Musical Scores – Digital . 26VI. Datasets . 28i. Datasets . 28ii. Databases . 30VII. GIS, Geospatial and Non-GIS Cartographic . 31i.Geographic Information System (GIS) – Vector Data. 31ii.GIS Vector and Raster Combined . 31iii.GIS Raster and Georeferenced Images . 33Page 1

iv.Non-GIS Cartographic . 33VIII. Design and 3D . 35i.2D and 3D Computer Aided Design . 35ii.Design (schematics, architectural drawings) - Print. 37iii.Scanned 3D Objects (output from photogrammetry scanning). 38IX. Software and Video Games . 39X.Web Archives. 42Page 2

LOC Recommended Formats Statement2021-2022Introduction to the 2021-2022 revisionThe success of the Recommended Formats Statement since it was first launched in 2014 and the way inwhich it has become such an important tool for the community has encouraged the Library to take theopportunity to look more closely at it for this edition of the RFS. The Library has moved beyond thestandard review process which it has undertaken annually in order to the keep the RFS current andrelevant. Over the course of the past year, the Library has engaged in a more thorough examination,both of the organizational structure of the RFS and of the processes through which each year’s version isrevised. This root-and-branch analysis has already resulted in significant changes implemented in 2020including the new layout of the RFS to include categories for Design and 3D; GIS, Geospatial and Non-GISCartographic; and Musical Scores.Underpinning these changes has been the establishment of a new internal model, which the Library hasused in this instance to assess the digital file formats in the RFS. This model is based on the conceptualframework of Levels of Service. The Levels of Service concept helps define the degree to which theLibrary can manage specific formats throughout the lifecycle by considering both global/communitycriteria and local/institutional criteria. This allows for a more structured and transparent analysis of thefile formats and a clear record of that analysis in a matrix workbook with each content category on aseparate worksheet.The global/community criteria for digital file formats have been based on the seven sustainability factorsdeveloped for the Library’s Sustainability of Digital Formats website: Disclosure, Adoption,Transparency, Self-documentation, External dependencies, Impact of patents and Technical protectionmechanisms. Each of these factors may have different emphasis or importance depending on thecommunity of practice and content type. Some may not be applicable or essential for every format.The local/institutional factors estimate the level of resources at The Library of Congress available topreserve and manage the digital file formats over time. These include Staff experience and expertise,Software/Hardware/Operating System availability (including appropriate number of licenses),Representation/extent in LC collections/storage and Established workflow/functionality.The use of this evaluation model has enabled the Library to sharpen and focus its analysis of the digitalfile formats in the Recommended Formats Statement. In providing a consistent review structure acrossall content categories, it now serves as a means to document improvements over the years as well asidentify gaps that need to be filled.Overall, the analysis has allowed us to establish clearer definitions of ‘Preferred’ and ‘Acceptable’ whencategorizing digital file formats in the RFS:Preferred formats: Global/community: Meets or exceeds benchmarks for all relevant sustainability factors Local/institutional: The Library of Congress has the skills, experience, workflows, tools andsystems to manage and preserve these formats in current systems with confidence.Acceptable formats: Global/community: Meets minimum acceptability across benchmarks or does not meet allrelevant sustainability factors. Local/institutional: The Library of Congress can manage this format at a basic level ofacquisition, management and preservation; and a greater ability for management andpreservation is within the Library’s capacity with further investment.Page 3Introduction

LOC Recommended Formats Statement2021-2022The success in using this model in evaluating and assessing the digital file formats in the RecommendedFormats Statement opens the possibility of adapting it to apply to those other characteristics of creativeworks, both physical and digital, which the RFS covers in its remit to address all types of creative works.The Recommended Formats Statement is not intended to serve as an answer to all the questions raisedin preserving and providing long-term access to creative content. It does not provide instructions forreceiving material into repositories, managing that content or undertaking the many ongoing taskswhich will be necessary to maintain this content so that it may be used well into the future. Tacklingeach of those aspects is a project in and of itself as each form of content has a unique set of facets andnuances. The RFS provides guidance on identifying sets of formats which are not drawn so narrowly asto discourage creators from working within them, but will instead encourage creators to use them toproduce works in formats which will make preserving them and making them accessible simpler. TheLibrary hopes that the RFS will help make it realistic to build, grow and save creative output for ourindividual and collective benefit for generations to come.The Library of Congress, realizing its unique position, is pleased to be able to contribute a resource likethe Recommended Formats Statement for the benefit of all involved with creative works. Thecommitment of time and resources to the ongoing revision and indeed improvement of the RFS reflectsthe priority the Library places on working collaboratively to ensure that all might succeed in ourcommon goal to share and disseminate creative output and to benefit the nation and the world at large.Page 4Introduction

LOC Recommended Formats Statement2021-2022I.Textual WorksNOTE: See also Musical Scoresi. Textual Works – Print (books, etc.)0BA. PaperB. Printing Process, inorder of preferencePreferredC. Binding andPackaging1.1.2.3.1.2.D. Size1.2.E. Rarity, SpecialFeatures,Illustrations1.2.3.Page 5Archival quality paper (ISO 11108: 1996 for Archival Paper)Lithography (offset printing press)Electrophotography (digital press)Inkjet (inkjet printer using stable pigment or dye-based inks)Slip-cased, if availableBinding, in descending order of preference:a. Hard coveri. Library binding (NISO Z39.78-2000)ii. Sewniii. Glued onlyb. Soft coveri. Sewnii. Glued onlyiii. Spiral- or plastic-boundiv. Stapledc. Loose-leaf (including all binders and indexespublished as part of the deposit and offered for saleand distribution)Larger-sized editions (Note: large-type editions are notpreferred over editions with conventional size typefaces)For broadsides and musical compositions, the Library prefersitems:a. In protective foldersb. Rolled (rather than folded)Limited editions (including those with special binding orspecial features)Editions with the greatest number of unique features (such aspop-ups, overlaps, magnifiers, overlays, tabs, notches, etc.)Illustrated editions; original color illustrations preferred overblack and white reproductionsAcceptableI. Textual Works

LOC Recommended Formats Statement2021-2022i. Textual Works – Print (books, etc.)0BF. Completeness1.2.G. Metadata1.2.Page 6Complete work. For items published in a finite number ofseparate components, all elements published as part of thework and offered for sale or distribution must be submitted.All updates, supplements, releases, and supersessionspublished as part of the work and offered for sale ordistribution must be submitted. Insertions (including allbinders and indexes) must be received in a regular and timelymanner for proper maintenance of the deposit.As displayed on item:a. Titleb. Creatorc. Creation Date or Start Date/End Dated. Place of Publicatione. Publisher/Producer/Distributorf. ISBNAs displayed on item, if available:a. Other relevant identifiers (e.g., DOI, LCCN, etc.)b. Editionc. Subject descriptorsd. AbstractsI. Textual Works

LOC Recommended Formats Statement2021-2022ii. Textual Works – Digital1BA. TechnicalCharacteristics, inorder of preferenceB. Formats, in order ofpreferencePreferredCharacter encoding, in descending order of preference:1. UTF-8, UTF-16 (with BOM), US-ASCII2. ISO 8859XML-based markup formats, with included or accessibleDTD/schema, XSD/XSL presentation stylesheet(s), and explicitlystated character encodinga. EPUB3-compliant. (Other versions of EPUB are alsopreferred formats but EPUB3 is the most common.)b. BITS-compliant (NLM Book DTD)c. Other widely-used book DTDs/schemas (e.g., TEI,DocBook, etc.)2. Page-layout formatsa. PDF/UA (ISO 14289-1-compliant)b. PDF/A (ISO 19005-compliant)4. PDF (highest quality available, with features such assearchable text, embedded fonts, lossless compression, highresolution images, device-independent specification ofcolorspace, content tagging; includes document formats suchas PDF/X)AcceptableOther character encodings not listed in Preferredsection1.2.3.C. Rarity and SpecialFeaturesPage 7Other structured or markup formatsa. XHTML or HTML, with DOCTYPEdeclaration and presentationstylesheet(s)b. XML-based document formats(widely-used and publiclydocumented), withpresentation stylesheet(s) ifapplicable. IncludesDOCX/OOXML 2012 (ISO29500), ODF (ISO/IEC 26300)and OOXML (ISO/IEC 29500).c. SGML, with included oraccessible DTDd. Other XML-based nonproprietary formats, withpresentation stylesheet(s)e. XML-based formats that useproprietary DTDs or schemas,with presentation stylesheet(s)Page-layout formatsa. PDF (web-optimized)Other formatsa. Rich text format (RTF)b. Plain textc. Widely-used proprietary wordprocessing formatsLimited editions (including those with special features such ashigh resolution images)Editions with the greatest number of unique features (such asadditional content, multimedia, interactive elements, etc.)I. Textual Works

LOC Recommended Formats Statement2021-2022ii. Textual Works – Digital1BD. CompletenessE. MetadataF. TechnologicalMeasuresPage 8Complete work. For items published in a finite number ofseparate components, all elements published as part of thework and offered for sale or distribution must be submitted.Includes all associated external files and fonts consideredintegral to the publication.All updates, supplements, releases, and supersessions published aspart of the work and offered for sale or distribution must be submittedand received in a regular and timely manner for proper maintenanceof the deposit.1. As supported by format (e.g., standards-based formats suchas ONIX for Books, XMP, MODS, or MARCXML eitherembedded in or accompanying the digital item):a. Titleb. Creatorc. Creation Date or Start Date/End Dated. Place of publicatione. Publisher/ producer/ distributorf. ISBNg. Contact information2. Include if available:a. Language of workb. Other relevant identifiers (e.g., DOI, LCCN, originalURL, etc.)c. Editiond. Subject descriptorse. AbstractsFiles must contain no measures (such as digital rights managementtechnologies or encryption) that control access to or prevent use of thedigital work. I. Textual Works

LOC Recommended Formats Statement2021-2022iii. Textual Works – Electronic serials2BA. TechnicalCharacteristics, inorder of preferenceB. Formats, in order ofpreferencePreferred1.Character encoding, in descending order of preference:a. UTF-8, UTF-16 (with BOM), US-ASCIIb. ISO 88591.Content compliant with the NISO JATS: Journal Article TagSuite (ANSI/NISO Z39.96-2015) with XSD/XSL presentationstylesheet(s) and explicitly stated character encodingPage-layout formatsa. PDF/UA (ISO 14289-1-compliant)b. PDF/A (ISO 19005-compliant)c. PDF (highest quality available, with features suchas searchable text, embedded fonts, losslesscompression, high resolution images, deviceindependent specification of colorspace; contenttagging; includes document formats such asPDF/X)2.Acceptable1.Other character encodings not listed inPreferred section1.Other structured or markup formats:a. Widely-used serials or journalnon-proprietary XML-basedDTDs/schemas with included oraccessible DTD/schema,presentation stylesheet(s) andexplicitly stated characterencoding.b. Proprietary XML-based formatfor serials or journals (withdocumentation) withDTD/schema and presentationstylesheet(s)c. XHTML or HTML, with DOCTYPEdeclaration and presentationstylesheet(s)d. XML-based document formats(widely used and publiclydocumented). With presentationstylesheets, if applicable.Includes DOCX/OOXML 2012(ISO 29500), ODF (ISO/IEC26300) and OOXML (ISO/IEC29500).Page-layout formatsa. PDF (web-optimized withsearchable text)Other formatsa. Rich text formatb. Plain text2.3.Page 9I. Textual Works

LOC Recommended Formats Statement2021-2022iii. Textual Works – Electronic serials2Bc.d.C. Completeness D. Metadata1.2.Page 10Widely-used proprietary wordprocessing or page-layoutformatsOther text- or graphic-basedformats not listed here thatrepresent textual worksComplete work. All elements considered integral to thepublication and offered for sale or distribution must besubmitted – e.g., articles, table(s) of contents, frontmatter, back matter, etc. Includes all associated externalfiles and fonts considered integral to the publication.All updates, supplements, releases, and supersessionspublished as part of the work and offered for sale ordistribution must be submitted and received in a regularand timely manner for proper maintenance of the deposit.Title-level metadata (e.g., standards-based formats such asONIX for Books, XMP, MODS, or MARCXML eitherembedded in or accompanying the digital item):a. Serial or journal titleb. ISSN and ISSN-Lc. Publisherd. Frequencye. Place of publicationArticle-level metadata as relevant or applicable (e.g.,standards-based formats such as ONIX for Books, XMP,MODS, or MARCXML either embedded in or accompanyingthe digital item):a. Volume(s)b. Number(s)c. Issue date(s)d. Article title(s)I. Textual Works

LOC Recommended Formats Statement2021-2022iii. Textual Works – Electronic serials2BE. TechnologicalMeasuresPage 11e. Article author(s)f. Article identifier (DOI, original URL, etc.)3. Include if available:a.Other descriptive metadata (e.g., subjectheading(s), descriptor(s), abstract(s)) Files must contain no measures (such as digital rightsmanagement technologies or encryption) that controlaccess to or prevent use of the digital work.I. Textual Works

LOC Recommended Formats Statement2021-2022II.i. Photographs – PrintStill Image Works3BA. Faithful representationof the workB. Permanence andappearanceC. SizeD. MetadataPreferredEqual in quality to the publication version, best edition or mastercopy 1.2.Page 12UnmountedPigmented inks (if digitally printed)Fixed, well-washed (if wet chemistry method)Min: 8 x10”Max: 28 x 36”As supported by format:a. Titleb. Creatorc. Creation Dated. Place of publicatione. Publisher/producer/distributorf. Contact informationInclude if available:a. Language of workb. Other relevant identifiers (e.g., DOI, LCCN, etc.)c. Subject descriptorsd. Abstractse. Key or reference to each data field and technicalproduction information (type of paper, howprocessed, publisher internal tracking numbers)AcceptableLarger sizes may be acceptable if best or onlyversion.II. Still Image Works

LOC Recommended Formats Statement2021-2022ii. Photographs – Digital4BA. Faithful representationof the workPreferred Equal in quality to the published version, best edition ormaster copyIn the same format as the master copyHighest resolution available, not rescaled or interpolatedHighest bit depth available, 16 bits per channel if availableEmbedded color profile or specified color space used inpublished versionUncompressedUnlayeredC. Formats TIFF (*.tif)JPEG2000 (*.jp2)PNG (*.png)JPEG/JFIF (*.jpg)BMP (*.bmp)D. Metadata1.As supported by format:a. Titleb. Creatorc. Creation Dated. Place of publicatione. Publisher/producer/distributorf. Contact informationInclude if available:a. Common embedded schema (e.g., IPTC)b. Language of workc. Other relevant identifiers (e.g., PLUS ID, DOI, LCCN,etc.)d. Subject descriptorse. Abstractsf. Key or reference to each data field and technicalproduction information (e.g. EXIF metadata fromdigital camera)B. TechnicalCharacteristics 2.Page 13AcceptableLossless compression or lowercompression ratios Discrete wavelet transform (DWT)preferred to discrete cosine transform(DCT) Layered, if supported by preferred oracceptable format Photoshop (*.psd, *.psb) JPEG2000 Part 2 (*.jpf, *.jpx) Digital Negative DNG (*.dng) Proprietary Camera Raw formats (*.nef,*.crw) GIF (*.gif)Metadata provided separately in external text ofXML-based file II. Still Image Works

LOC Recommended Formats Statement2021-2022ii. Photographs – Digital4BE. TechnologicalMeasures Files must contain no measures (such as digital rightsmanagement technologies or encryption) that controlaccess to or prevent use of the digital work.iii. Other Graphic Images – Print (posters, postcards, fine prints)5BNOTE: See also Geospatial Cartographic and Design and 3DPreferred Equal in quality to the publication version, best edition orA. Faithful representationmaster copyof the work Packaging materials equivalent to published form (e.g.,B. Permanence andbinding, box/packaging materials)appearanceC. Related MaterialsD. Metadata If multiple versions available, provide the most widelydistributed edition. If limited edition, provide an unnumbered but otherwiseidentical copy. For large items, provide rolled, unfolded. Includes indexes, study guides or other matter if available Also includes annotations, accompanying tabular or textualmatter or other interpretive aids As supported by formata. Titleb. Creatorc. Creation Dated. Place of Publicatione. Publisher/producer/distributorf. Contact Information Page 14AcceptableInclude if available:a. Language of workb. Other relevant identifiers (e.g., DOI, LCCN, etc.)c. Subject descriptorsd. AbstractsII. Still Image Works

LOC Recommended Formats Statement2021-2022iii. Other Graphic Images – Print (posters, postcards, fine prints)5Be.Key or reference to each data field and technicalproduction information (type of paper, howprocessed, publisher internal tracking numbers)iv. Other Graphic Images – Digital6BNOTE: See also Geospatial Cartographic and Design and 3DA. Faithful representationof the workB. TechnicalCharacteristicsC. Formats (raster)Page 15PreferredAcceptable Equal in quality to the published version, best edition ormaster copyIn the same format as the master copy Highest resolution available, not rescaled or interpolatedHighest bit depth available, 16 bits per channel if availableSpecified color space used in published versionUncompressedUnlayered TIFF (*.tif)JPEG2000 (*.jp2)PNG (*.png)JPEG/JFIF (*.jpg)BMP (*.bmp) Lower compression ratiosDiscrete wavelet transform (DWT)preferred to discrete cosine transform(DCT)Layered, if supported by preferred oracceptable formatPhotoshop (*.psd, *.psb)JPEG2000 Part 2 (*.jpf, *.jpx)MrSID (*.sid)Encapsulated Postscript (*.eps)Digital Negative DNG (*.dng)Proprietary Camera Raw formatsGIF (*.gif)II. Still Image Works

LOC Recommended Formats Statement2021-2022iv. Other Graphic Images – Digital6BD. Formats (vector) Scalable vector graphics (*.svg) E. Related MaterialsG. MetadataH. TechnologicalMeasuresPage 16Includes indexes, study guides or other matter if availableAlso includes annotations, accompanying tabular or textualmatter or other interpretive aids1. As supported by format:a. Titleb. Creatorc. Creation Dated. Place of publicatione. Publisher/producer/distributorf. Contact information2. Include if available:a. Common embedded schemab. Language of workc. Other relevant identifiers (e.g., DOI, LCCN, etc.)d. Subject descriptorse. Abstractsf. Key or reference to each data field and technicalproduction information (e.g. EXIF metadata fromdigital camera) Files must contain no measures (such as digital rightsmanagement technologies or encryption) that controlaccess to or prevent use of the digital work.Computer Graphics Metafile (CGM,WebCGM)Page-layout formats, e.g. PDF/UA (ISO14289-1-compliant), PDF/A (ISO 19005compliant), PDF (highest qualityavailable, with features such assearchable text, embedded fonts,lossless compression, high resolutionimages; includes document formatssuch as PDF/X)Encapsulated Postscript (*.eps) Metadata provided separately inexternal text or XML-based fileII. Still Image Works

LOC Recommended Formats Statement2021-2022v. Microforms7BA. Faithful representationof the workB. Permanence andappearanceC. Format (newspapersand newspaperformatted serials)D. Format (all othermaterials), in order ofpreferenceE. SizeF. Related MaterialsG. MetadataPage 17PreferredEqual in quality to the publication version, best edition or mastercopyAcceptable Silver halide Positive polarity Color (when available) Polyester film baseRoll microfilm1. Microfiche2. Roll microfilm3. Microfilm cassettes4. Micro-opaque prints35mm, if roll filmInclude indexes, study guides or other printed matter if available As supported by formata. Titleb. Creatorc. Creation Dated. Place of Publicatione. Publisher/producer/distributorf. Contact Information Include if available:a. Language of workb. Other relevant identifiers (e.g., DOI, LCCN, etc.)c. Subject descriptorsd. Abstractse. Key or reference to each data field and technicalproduction information (type of paper, howprocessed, publisher internal tracking numbers)16mm film and other sizes that match theprimary production masterII. Still Image Works

LOC Recommended Formats Statement2021-2022III.Moving Image Worksi. Motion Pictures – Digital and Physical Media8BA. Motion Pictures Digital And PhysicalMediaPreferred Complete final production/release version of motion picturework in the original production resolution, aspect ratio andframe rateTheatrical release version in original gauge (e.g., 70mm, 35mm,16mm)Unencrypted interop Digital Cinema Package (DCP) with thefollowing characteristics:a. 24- or 48-frame progressive scanb. Minimum projector resolution of 2048 by 1080 pixelsc. Image source compression (if used) conforming toISO/IEC 15444-1 (JPEG2000)d. Image and sound files packaged as either SMPTE orInterop DCPse. DCP formats (SMPTE ST429-2 and relatedspecifications)Acceptable Commercially pressed DVD or Blu-raydiscContact archive for guidance regarding master materials (DCDM,DSM, camera original negatives, etc.)B. AudioC. MetadataPage 18 Complete final tracks, including any foreign language tracks anddescriptive audio, when applicable1.Relevant unique identifiers applicable to the work (EIDR,ISAN)2. If unique identifier not available, thena) Release titleb) Release/Production Datec)Production Company and/or Producerd) Distributor Namee) Country of Originf)Languageg) Duration Each language and mix for the finalproduction version shall be in its originalchannel structure and audio resolution as itwas delivered to the content distributorIII.Moving Image Works

LOC Recommended Formats Statement2021-2022i. Motion Pictures – Digital and Physical Media8BD. TechnologicalMeasures Files must contain no measures (such as digital rightsmanagement technologies or encryption) that controlaccess to or prevent use of the digital work.ii. Video – File Based and Physical Media9BA. Video – File-based, inorder of preferencePreferredFinal production version with the original production resolution andframe rate (i.e. 1080p24; 720p60, etc.) and file-based format thatwas delivered to the content distributor.1.2.3.4.Interoperable Master Format (IMF) consisting ofa. Essence files as MXF tracks including video, audio,data and dynamic metadata essencesb. Composition playlistc. Packaging data XML files (asset map, packing list,volume index)ProResa. QuickTime (.mov) containerb. 4444 (XQ), 4444 or 422 HQ codecsMPEG-2a. Compliant with ISO/IEC 13818XDCAMa. MXFb. HD422, SHD422, HD codecsAcceptableFFV1 (version 3) in Matroska (.mkv) containeronly for content without closed captions and/ortimecode information.Viewing proxy such asa) Recordable DVDb) Recordable Blu-ray discc) MPEG-4 (.mp4)Contact archive for guidance regarding pre-production versions.Page 19III.Moving Image Works

LOC Recommended Formats Statement2021-2022ii. Video – File Based and Physical Media9BB. Video – Physical Media,in order of preference1.2.C. AudioD. Metadata 1.Relevant unique identifiers applicable to the work (EIDR,ISAN)If unique identifier not available, then:a) Release titleb) Release/Production Datec) Production Company and/or Producerd) Distributor Namee) Country of Originf) Languageg) Duration Files must contain no measures (such as digital rightsmanagement technologies or encryption) that controlaccess to or prevent use of the digital work.2.E. TechnologicalMeasuresPage 20Complete, final production version withthe original production resolution andframe rate (i.e. 1080p24; 720p60, etc.)Content contained in standard physicalmedia in the following order ofpreference:a. HD: HDCAM-SR, HDCAM, HDD5, Commercially pressed DVDor Blu-ray discb. SD: Digital Betacam, BetacamSPEach language and mix for the finalproduction version shall be in its originalchannel structure and audio resolutionas it was delivered to the contentdistributorIII.Moving Image Works

LOC Recommended Formats Statement2021-2022IV.Audio Worksi. Audio – On Tangible Media (digital and analog)10BPreferred1. Sound Recordings, inorder of preference1.2.3.4.5.6.7.Page 21Final production/release version of content rather than preproduction versionPublis

LOC Recommended Formats Statement 2021-2022 Page 3 Introduction . Introduction to the 2021-2022 revision . The success of the Recommended Formats Sta