WACZ

From Just Solve the File Format Problem
(Difference between revisions)
Jump to: navigation, search
(Created page with "{{FormatInfo |subcat=Archiving |extensions={{ext|wacz}} |pronom={{PRONOM|application/warc}} |wikidata={{wikidata|Q104903124}} |mimetypes={{mimetype|application/x-wacz}} |locfd...")
 
 
Line 22: Line 22:
 
       676  12-19-2023 10:27  datapackage-digest.json
 
       676  12-19-2023 10:27  datapackage-digest.json
 
</pre>
 
</pre>
 +
 +
==Software==
 +
* https://github.com/webrecorder/py-wacz
  
 
==References==
 
==References==

Latest revision as of 22:41, 19 December 2023

File Format
Name WACZ
Ontology
Extension(s) .wacz
MIME Type(s) application/x-wacz
LoCFDD fdd000586
PRONOM application/warc
Wikidata ID Q104903124

A Web Archive Collection Zipped[1][2] is a file format designed to package a standard WARC with accompanying metadata into a single file.[3]

[edit] Format Information

A WACZ file is a ZIP compressed format which can include:

Archive:  sample.wacz
  Length      Date    Time    Name
---------  ---------- -----   ----
    77751  12-19-2023 10:27   pages/pages.jsonl
 19477775  12-19-2023 10:27   archive/data.warc.gz
   525986  12-19-2023 10:27   indexes/index.cdx
      828  12-19-2023 10:27   datapackage.json
      676  12-19-2023 10:27   datapackage-digest.json

[edit] Software

[edit] References

  1. https://specs.webrecorder.net/wacz/latest/
  2. https://webrecorder.net/2023/05/03/an-update-on-wacz.html
  3. https://replayweb.page/docs/wacz-format
Personal tools
Namespaces

Variants
Actions
Navigation
Toolbox