You do not have permission to edit this page, for the following reasons:

This IP address has been blocked from editing Wikipedia.
This does not affect your ability to read Wikipedia pages.
Most people who see this message have done nothing wrong. Some kinds of blocks restrict editing from specific service providers or telecom companies in response to recent abuse or vandalism, and can sometimes affect other users who are unrelated to that abuse. Review the information below for assistance if you do not believe that you have done anything wrong.

The IP address or range 54.92.0.0/16 has been blocked by ‪Blablubbs‬ for the following reason(s):

The IP address that you are currently using has been blocked because it is believed to be a web host provider or colocation provider. To prevent abuse, web hosts and colocation providers may be blocked from editing Wikipedia.
You will not be able to edit Wikipedia using a web host or colocation provider because it hides your IP address, much like a proxy or VPN.
We recommend that you attempt to use another connection to edit. For example, if you use a proxy or VPN to connect to the internet, turn it off when editing Wikipedia. If you edit using a mobile connection, try using a Wi-Fi connection, and vice versa. If you are using a corporate internet connection, switch to a different Wi-Fi network. If you have a Wikipedia account, please log in.
If you do not have any other way to edit Wikipedia, you will need to request an IP block exemption.

How to appeal if you are confident that your connection does not use a colocation provider's IP address:
If you are confident that you are not using a web host, you may appeal this block by adding the following text on your talk page: ((unblock|reason=Caught by a colocation web host block but this host or IP is not a web host. My IP address is _______. Place any further information here. ~~~~)). You must fill in the blank with your IP address for this block to be investigated. Your IP address can be determined here. Alternatively, if you wish to keep your IP address private you can use the unblock ticket request system. There are several reasons you might be editing using the IP address of a web host or colocation provider (such as if you are using VPN software or a business network); please use this method of appeal only if you think your IP address is in fact not a web host or colocation provider.

Administrators: The IP block exemption user right should only be applied to allow users to edit using web host in exceptional circumstances, and requests should usually be directed to the functionaries team via email. If you intend to give the IPBE user right, a CheckUser needs to take a look at the account. This can be requested most easily at SPI Quick Checkuser Requests. Unblocking an IP or IP range with this template is highly discouraged without at least contacting the blocking administrator.

This block will expire on 10:02, 6 June 2025. Your current IP address is 54.92.174.87.

Even when blocked, you will usually still be able to edit your user talk page, as well as email administrators and other editors.

For information on how to proceed, please read the FAQ for blocked users and the guideline on block appeals. The guide to appealing blocks may also be helpful.

Other useful links: Blocking policy · Help:I have been blocked
This IP address range has been globally blocked.
This does not affect your ability to read Wikipedia pages.
Most people who see this message have done nothing wrong. Some kinds of blocks restrict editing from specific service providers or telecom companies in response to recent abuse or vandalism, and can sometimes affect other users who are unrelated to that abuse. Review the information below for assistance if you do not believe that you have done anything wrong.

This block affects editing on all Wikimedia wikis.
The IP address or range 54.92.0.0/16 has been globally blocked by ‪Jon Kolbert‬ for the following reason(s):

Open proxy/Webhost: See the help page if you are affected

This block will expire on 18:27, 12 November 2028. Your current IP address is 54.92.174.87.

Even while globally blocked, you will usually still be able to edit pages on Meta-Wiki.

If you believe you were blocked by mistake, you can find additional information and instructions in the No open proxies global policy. Otherwise, to discuss the block please post a request for review on Meta-Wiki. You could also send an email to the stewards VRT queue at stewards@wikimedia.org including all above details.

Other useful links: Global blocks · Help:I have been blocked

You can view and copy the source of this page:

((multiple issues|
((COI|date=October 2013))
((notability|Products|date=October 2013))
((primary sources|date=October 2013))
))
((Infobox software
| name                   = ZPAQ
| screenshot             =
| caption                =
| developer              = [[Matt Mahoney (computer scientist)|Matt Mahoney]]
| programming language   = [[C++]]
| platform               = [[IA-32]], [[x86-64]]
| genre                  = [[File archiver]]
| license                = [[MIT License|MIT]], [[Public domain]]
| latest release version = ((wikidata|property|preferred|references|edit|Q8063260|P348|P548=Q2804309))
| latest release date    = ((wikidata|qualifier|preferred|single|Q8063260|P348|P548=Q2804309|P577))
| latest preview version = ((wikidata|property|preferred|references|edit|Q8063260|P348|P548=Q51930650))
| latest preview date    = ((wikidata|qualifier|preferred|single|Q8063260|P348|P548=Q51930650|P577))
| operating_system       = [[Microsoft Windows]], [[Linux]]
))

'''ZPAQ''' is an [[Open-source software|open source]] [[command line]] [[archiver]] for [[Microsoft Windows|Windows]] and [[Linux]]. It uses a journaling or append-only format which can be rolled back to an earlier state to retrieve older versions of files and directories. It supports fast incremental update by adding only files whose last-modified date has changed since the previous update. It compresses using [[Data deduplication|deduplication]] and several algorithms ([[LZ77]], [[Burrows–Wheeler transform|BWT]], and [[context mixing]]) depending on the data type and the selected compression level. To preserve forward and backward compatibility between versions as the compression algorithm is improved, it stores the decompression algorithm in the archive. The ZPAQ source code includes a [[public domain]] [[API]], '''libzpaq''', which provides compression and decompression services to [[C++]] applications. The format is believed to be unencumbered by [[patent]]s.

== Archive format ==

Files are saved in the ZPAQ level 2 journaling format.<ref>((cite web|last=Mahoney|first=Matt|url=https://mattmahoney.net/dc/zpaq202.pdf|title=The ZPAQ Open Standard for Highly Compressed Data - Level 2|date=3 June 2013|access-date=28 May 2023))</ref> The standard defines two formats - streaming and journaling. Only the journaling format supports deduplication, directory attributes, and multiple dated file versions.

The streaming archive format is designed to be extracted in a single pass. An archive is divided into a sequence of blocks that can be decompressed independently in parallel. Blocks are divided into segments that must be decompressed sequentially in order. Each block header contains a description of the decompression algorithm. Each segment has a header containing an optional file name and an optional comment for meta-data such as size, date, and attributes, and an optional trailing [[SHA-1]] checksum of the original data for integrity checking. If the file name is omitted, it is assumed to be a continuation of the last named file, which may be in the previous block. Thus, inserting, removing, or reordering the blocks in a streaming archive has the effect of performing the same operations on the data that the blocks represent.

The journaling format consists of a sequence of transactions, or updates. An update contains 4 types of blocks: a transaction header block, a sequence of data blocks, a corresponding sequence of fragment tables, and a sequence of index blocks. A transaction header block contains the transaction date and a pointer skipping over the data blocks to allow the archive index to be read quickly. The data blocks contain a sequence of file fragments compressed together. The fragment tables give the size and SHA-1 hash of each fragment. The index blocks contain a list of edits to the global archive index. An edit is either a file update or a file deletion. An update includes a file name, last modified date, attributes, and a list of fragment pointers into the current and previous transactions. Fragments may be shared by more than one file. A deletion does not remove any data from the archive, but rather indicates that the file is not to be extracted unless the archive is rolled back to an earlier date.

The ZPAQ standard does not specify a compression algorithm. Rather, it specifies a format for representing the decompression algorithm in the block headers. Decompression algorithms are written in a language called ZPAQL and stored as a byte code which can either be interpreted or converted directly to 32 or 64 bit x86 code and executed. A ZPAQL program has 3 parts.

* COMP - An optional chain of context modeling components.
* HCOMP - Machine code for computing contexts for the COMP components.
* PCOMP - Optional machine code for post-processing the decoded data.

The COMP models are based on [[PAQ]], which compresses one bit at a time using [[arithmetic coding]]. There are 9 types of components. Each component takes a context and possibly the predictions of earlier components, and outputs a prediction or probability that the next bit will be a 1. The output of the last component is arithmetic coded. The component types are:

* CONST - A fixed prediction.
* CM - Context model. The context is used to look up a prediction in a table. On update, the selected entry is adjusted to reduce the prediction error.
* ICM - Indirect context model. The context is used to look up an 8 bit state representing a recent bit history. The history selects a prediction as with a CM.
* MIX - A group of predictions are combined by weighted averaging in the logistic domain, or log(p/(1-p)). The weights are selected by a context. On update, the weights are adjusted to favor the more accurate inputs.
* MIX2 - A 2 input MIX with weights constrained to add to 1.
* AVG - A MIX2 with fixed weights.
* SSE - Secondary symbol estimator. Looks up a prediction from an interpolated table given a context and quantized prediction from another component.
* ISSE - Indirect secondary symbol estimator. The context selects a bit history as with an ICM, and then the bit history selects a pair of weights to mix the input with a constant 1.
* MATCH - Searches for the previous occurrence of the context and predicts whatever bit followed, with a strength depending on the length of the match.

The HCOMP section computes the contexts for the components in the COMP section. It is a virtual machine whose state is 4 32-bit registers (A, B, C, D), a 16 bit program counter, a condition flag bit, and two memory arrays, one of bytes (M) and one of 32 bit words (H). The beginning of H forms the array of contexts. An [[assembly language]]-like program is called once for each coded or decoded byte with that byte as input in A. The final context seen by the COMP section is the computed context combined with the previously seen bits of the current byte.

The optional PCOMP section is used for post-processing the decoded data. It runs in a separate virtual machine like that of HCOMP. However, unlike the COMP and HCOMP sections which are used for both compression and decompression, the PCOMP section is run only during decompression. The compressor is responsible for performing the inverse operation on the input data prior to coding.

=== ZPAQL Example ===

ZPAQL source code uses a textual syntax, with each space-delimited word assembling to one byte in most cases, and comments in parentheses. The following example is the ''mid'' configuration, similar to level 5 compression. It describes an ICM-ISSE chain of components taking hashed contexts of orders 0 through 5, a MATCH taking an order 7 context, and as a final step, averaging these bit predictions using a MIX. There is no post-processing.

 <code style="border:none">comp 3 3 0 0 8 (hh hm ph pm n)
    0 icm 5      (order 0...5 chain)
    1 isse 13 0
    2 isse 17 1
    3 isse 18 2
    4 isse 18 3
    5 isse 19 4
    6 match 22 24  (order 7)
    7 mix 16 0 7 24 255  (order 1)
  hcomp
    c++ *c=a b=c a=0 (save in rotating buffer M)
    d= 1 hash *d=a   (orders 1...5 for isse)
    b-- d++ hash *d=a
    b-- d++ hash *d=a
    b-- d++ hash *d=a
    b-- d++ hash *d=a
    b-- d++ hash b-- hash *d=a (order 7 for match)
    d++ a=*c a<<= 8 *d=a       (order 1 for mix)
    halt
  end</code>

The COMP parameters describe the log base 2 sizes of the word and byte arrays (hh, hm), 8 bytes each in the HCOMP section and not used in the PCOMP section. There are n = 8 numbered components. The components take parameters describing their table sizes and inputs. In particular, each ISSE takes its input from the previous component, and the MIX takes input from the 7 components starting at 0. The line "5 isse 19 4" says that the ISSE has a table size of 2<sup>19+6</sup> bit histories and takes its input from component 4.

In the HCOMP section, registers B and C point into the 8 byte rotating array M, and D points to the 8 word array H. M is used to store the last 8 bytes of input from the A register. C points to the head of this buffer. The HASH instruction computes:

  a = (a + *b + 512) * 773;

Thus, the code stores context hashes of various orders in H[0]...H[7].

=== Deduplication ===

On update, ZPAQ divides the input files into fragments, computes their SHA-1 hashes, and compares them to the hashes stored in the archive. If there is a match, then the fragments are assumed to be identical, and only a pointer to the previously compressed fragment is stored. Otherwise the fragment is packed into a block to be compressed. Block sizes can be up to 16 MiB to 64 MiB depending on the compression level.

Files are divided into fragments on content-dependent boundaries. Rather than a [[Rabin fingerprint]], ZPAQ uses a [[rolling hash]] that depends on the last 32 bytes that are not predicted by an order 1 context, plus any predicted bytes in between. If the leading 16 bits of the 32 bit hash are all 0, then a fragment boundary is marked. This gives an average fragment size of 64 KiB.

The rolling hash uses a 256 byte table containing the byte last seen in each possible order-1 context. The hash is updated by adding the next byte and then multiplying either by an odd constant if the byte was predicted or by an even number that is not a multiple of 4 if the byte was not predicted.

=== Compression ===

ZPAQ has 5 compression levels, from fastest to best. At all but the best level, it uses the statistics of the order-1 prediction table used for deduplication to test whether the input appears random. If so, it is stored without compression as a speed optimization.

ZPAQ will use an E8E9 transform (see: [[BCJ (algorithm)|BCJ]]) to improve the compression of x86 code typically found in .exe and .dll files. An E8E9 transform scans for CALL and JMP instructions (opcodes E8 and E9 hex) and replaces their relative addresses with absolute addresses. Then it inserts code into the PCOMP section to perform the inverse transform.

=== Error recovery ===

ZPAQ lacks error correction but has several features that limit damage if the archive is corrupted. On decompression, all SHA-1 hashes are checked. If the hash does not match or if some other error occurs, then a warning is printed and the block is ignored. Blocks begin with a 13 byte "locator tag" containing a randomly chosen but fixed string to allow start of the next block to be found by scanning. If a data fragment is lost, then all of the files referencing that fragment and the remaining fragments in the block are also lost. If a fragment table is lost, then it can be recovered from a redundant list of fragment sizes stored in the corresponding data block and by recomputing the hashes. In this case, a second hash of the whole data block is checked. If an index block is lost, then the corresponding files are lost. Index blocks are small (16 KiB) in order to limit damage.

Updates are transacted by appending a temporary transaction header and then updating the header as the last step. If an update is interrupted, then the temporary header signals ZPAQ that no useful data is found after it. The next update will overwrite this excess data.

== Basic usage ==

=== Creating an archive, and updating an archive ===
<code>zpaq add directory/archive.zpaq directory/source_directory -mX -key password</code>

The options <code>-mX</code> (with X being the compression level from 0 to 5) and <code>-key</code> (which performs [[Advanced Encryption Standard|AES-256]] encryption) can be omitted. The 0 compression level does not compress data, but still carries out data deduplication. The compression levels 4 and 5 can be very time-consuming. The default (1) uses simple LZ77 compression.

=== Listing archive contents ===
<code>zpaq list archive.zpaq</code> lists the files and directories of the most recent version. Adding <code>-all</code> will list all versions of all files and directories, in the format <code>version_number/directory/file_name</code>. The output can be further processed with [[grep]] and other tools.

=== Extracting files ===
<code>zpaq extract archive.zpaq</code> will un-pack the last version of the entire archive in the active directory. <code>zpaq extract backup.zpaq path</code> will only extract the specified directory (or file). Appending the <code>-until N</code> option selects the version, where negative numbers are allowed. -2 would extract the third most recent version of the archive. The optional <code>-to</code> tells ZPAQ where to save the extracted files.  

<code>zpaq extract backup.zpaq -all -only "*muppet*"</code> will extract all versions of all files and directories whose name contains "muppet". Different file versions will be placed in different directories (<code>0001/ 0002/ 0003/</code> et cetera). <code>-only</code> is optional.

== History ==

* Feb. 15, 2009 - zpaq 0.01 experimental release.
* Mar. 12, 2009 - zpaq 1.00 specification finalized guaranteeing backward compatibility.
* Sept. 29, 2009 - zpaq 1.06, specification updated to v1.01 add locator tags to support self extracting archives.
* Oct. 14, 2009 - zpaq 1.09 adds ZPAQL to C++ translator as a speed optimization.
* Sept. 27, 2010 - separate libzpaq 0.01 API.
* Jan. 21, 2011 - pzpaq 0.01, first multi-threaded version, later incorporated back into zpaq.
* Nov. 13, 2011 - zpaq 4.00, adds JIT compiler (ZPAQL to x86) eliminating need for external C++ compiler for optimization.
* Feb. 1, 2012 - zpaq 5.00, specification updated to v2.00 to allow empty COMP section (post-processing only).
* Sept. 28, 2012 - zpaq 6.00, specification updated to v2.01 adding journaling format.
* Jan. 23, 2013 - zpaq 6.19, splits development functions to a separate program, zpaqd.

== Related projects ==
* [https://quixdb.github.io/squash/ Squash], a compression abstraction layer supporting many [[codec]]s.
* [http://peazip.sourceforge.net/ PeaZip], an archiver supporting over 150 formats including ZPAQ streaming format extraction.
* [http://mattmahoney.net/dc/fastqz/ fastqz], a [[FASTQ format|FASTQ]] compressor built using libzpaq.<ref>Bonfield JK, Mahoney MV (2013) [http://www.plosone.org/article/info%3Adoi%2F10.1371%2Fjournal.pone.0059190 Compression of FASTQ and SAM Format Sequencing Data]. PLoS ONE 8(3): e59190. doi:10.1371/journal.pone.0059190</ref>
* [https://github.com/fcorbelli/zpaqfranz zpaqfranz], Swiss army knife for the serious backup and disaster recovery manager.
* [https://totalcmd.net/plugring/ZPAQ.html wcx_zpaq], a packer plug-in (wcx) for Total Commander.<ref>((cite web |title=[WCX] ZPAQ |url=https://www.ghisler.ch/board/viewtopic.php?t=42739 |website=Total Commander Forums |access-date=10 July 2021))</ref>

== References ==
((reflist))

== External links ==
* ((Official website))
* ((GitHub|zpaq/zpaq))
* [https://web.archive.org/web/20200222215450/http://www.modejong.com/blog/post15_zpaql_grayscale/index.html Intro to zpaql with grayscale images]

((Archive formats))
((Compression software implementations))
((FOSS))

((DEFAULTSORT:Zpaq))
[[Category:Archive formats]]
[[Category:Free data compression software]]
[[Category:Open formats]]
[[Category:Lossless compression algorithms]]

Pages transcluded onto the current version of this page (help):

ZPAQ (edit)
Template:Ambox (view source) (template editor protected)
Template:Archive formats (edit)
Template:COI (view source) (template editor protected)
Template:Cite web (view source) (protected)
Template:Compression software (edit)
Template:Compression software implementations (edit)
Template:EditAtWikidata (view source) (protected)
Template:FOSS (view source) (semi-protected)
Template:Find sources mainspace (view source) (template editor protected)
Template:GitHub (view source) (semi-protected)
Template:Hlist/styles.css (view source) (protected)
Template:Icon (view source) (template editor protected)
Template:If empty (view source) (protected)
Template:Infobox (view source) (template editor protected)
Template:Infobox software (view source) (template editor protected)
Template:Infobox software/simple (view source) (template editor protected)
Template:Main other (view source) (protected)
Template:Multiple issues (view source) (template editor protected)
Template:Multiple issues/styles.css (view source) (template editor protected)
Template:Navbox (view source) (template editor protected)
Template:Notability (view source) (template editor protected)
Template:Official website (view source) (template editor protected)
Template:Plainlist (view source) (protected)
Template:Plainlist/styles.css (view source) (protected)
Template:Primary sources (view source) (template editor protected)
Template:Reflist (view source) (protected)
Template:Reflist/styles.css (view source) (protected)
Template:Small (view source) (protected)
Template:Strong (view source) (template editor protected)
Template:Template other (view source) (protected)
Template:URL (view source) (template editor protected)
Template:Wikidata (view source) (template editor protected)
Template:Yesno (view source) (protected)
Template:Yesno-no (view source) (template editor protected)
Module:Arguments (view source) (protected)
Module:Category handler (view source) (protected)
Module:Category handler/blacklist (view source) (protected)
Module:Category handler/config (view source) (protected)
Module:Category handler/data (view source) (protected)
Module:Category handler/shared (view source) (protected)
Module:Check for unknown parameters (view source) (protected)
Module:Citation/CS1 (view source) (protected)
Module:Citation/CS1/COinS (view source) (protected)
Module:Citation/CS1/Configuration (view source) (protected)
Module:Citation/CS1/Date validation (view source) (protected)
Module:Citation/CS1/Identifiers (view source) (protected)
Module:Citation/CS1/Utilities (view source) (protected)
Module:Citation/CS1/Whitelist (view source) (protected)
Module:Citation/CS1/styles.css (view source) (protected)
Module:EditAtWikidata (view source) (protected)
Module:Find sources (view source) (template editor protected)
Module:Find sources/config (view source) (template editor protected)
Module:Find sources/links (view source) (template editor protected)
Module:Find sources/templates/Find sources mainspace (view source) (template editor protected)
Module:Icon (view source) (template editor protected)
Module:Icon/data (view source) (template editor protected)
Module:If empty (view source) (protected)
Module:Infobox (view source) (template editor protected)
Module:Infobox/styles.css (view source) (template editor protected)
Module:InfoboxImage (view source) (template editor protected)
Module:Message box (view source) (protected)
Module:Message box/ambox.css (view source) (protected)
Module:Message box/configuration (view source) (protected)
Module:Namespace detect/config (view source) (protected)
Module:Namespace detect/data (view source) (protected)
Module:Navbar (view source) (protected)
Module:Navbar/configuration (view source) (protected)
Module:Navbar/styles.css (view source) (protected)
Module:Navbox (view source) (template editor protected)
Module:Navbox/configuration (view source) (template editor protected)
Module:Navbox/styles.css (view source) (template editor protected)
Module:Official website (view source) (template editor protected)
Module:String (view source) (protected)
Module:URL (view source) (template editor protected)
Module:Unsubst (view source) (protected)
Module:Wd (view source) (template editor protected)
Module:Wd/i18n (view source) (template editor protected)
Module:WikidataIB (view source) (template editor protected)
Module:WikidataIB/nolinks (view source) (template editor protected)
Module:WikidataIB/titleformats (view source) (template editor protected)
Module:Yesno (view source) (protected)

Return to ZPAQ.

Retrieved from "https://en.wikipedia.org/wiki/ZPAQ"