ARJ
ARJ is a compressed archive format, and associated software. It was developed by Robert Jung.
Contents |
Discussion
ARJ was one of the leading compression tools during the 1990s. While it was a bit slower than PKZIP, it sported many more options, some of which were unique during that time (archives over multiple disks/volumes, fine-tuning of the compression algorithms used based on the data that was being compressed, recovery records to recover from simple transmission errors, etc.).
ARJ software expiration
If you wish to use or research the official ARJ software, be aware that some of the free/evaluation and beta versions of ARJ and ARJ32 have a hard-coded expiration date. They do not work, or do not work as well, if they think they are being used after that date. Expiration can manifest in several ways.
The 0.xx versions disable some features, based on the system clock, and the timestamp of the file being archived. Other versions only seem to care about the system clock.
Some beta versions inexplicably print "CRC error!" if they're expired.
At least one version prints a message about a time delay, but then seems to hang forever.
Newer versions have a time delay whose duration depends on how long ago they expired. The delay can be several minutes.
The newest versions that don't expire are apparently ARJ 2.76 and ARJ32 3.09.
Suggest running expired DOS versions in DOSBox-X, which lets you set the clock.
Format details
Compression methods
ID | Name | Description |
---|---|---|
0 | stored | No compression |
1 | compressed most | LZ77+Huffman. These are all essentially the same as LHA's "lh6" method, but with the history window artificially limited to 26K, instead of 32K.
It is possible for ARJ files to exist that violate the 26K limit, e.g. if they were created by ARJZ, or by Open-Source ARJ with the |
2 | ... | |
3 | ... | |
4 | compressed fastest | An ARJ-specific implementation of LZ77/LZSS.
Note: This compression method was slightly different in ARJ v0.13–0.14. |
8 | no data, no CRC | |
9 | no data |
Format version number
The main header and each local file header have two 1-byte fields, named "archiver version" and "min archiver version to extract". The main "archiver version" field in particular could be useful to help classify the file. It normally appears at file offset 5.
The table below maps format versions to software versions and some relevant format changes. There is a similar table in the Open-Source ARJ FAQ (in resource/en/readme.txt), which includes some additional software.
Format ver. | Used by | Remarks |
---|---|---|
1 | ARJ 0.13-0.14 | |
2 | ARJ 0.15-1.00 | Changed -m4 compression.
|
3 | ARJ 1.10-2.22 | Added support for directories. |
4 | ARJ 2.30 | |
5 | ARJ 2.39a-2.39b | |
6 | ARJ 2.39c-2.41a | Last-modified-time field added to main header. |
7 | ARJ 2.42a-2.50a | |
8 | ARJ 2.55-2.61 | Encryption version field added to main header. Support for "chapters" added. |
9 | ARJ 2.62-2.63[release 2 of 5?] ARJ32 3.00 |
Flags bit 0x02 redefined to mean "ANSIPAGE". Creation-time and access-time fields added. |
10 | ARJ 2.63[release 3 of 5?]-2.76 | "ARJ protection factor" field added to main header. |
11 | ARJ 2.81-(at least 2.86) | "Flags (second series)" field added to main header. |
50 | ARJZ | Used by ARJZ, maybe only in the "min version" field. |
51 | ARJZ | Used by ARJZ. |
100 | ARJ32 3.00a-3.01a | Feature-equivalent to 9. |
101 | ARJ32 3.02-3.09 | Feature-equivalent to 10. |
102 | ARJ32 3.10-(at least 3.20) | Feature-equivalent to 11. |
Self-extracting archives
ARJ can create self-extracting archives. This is usually done with the -je
or -je1
option. The -je1
option creates an "SFXJR" archive, which is smaller, but has limited features.
According to the documentation, the ASCII signature "aRJsfX
" appears somewhere in the first 1000 bytes of all such files made by ARJ v2.20+.
Files made by ARJ for DOS use EXE format. The different versions are summarized below.
Versions | Option | Remarks |
---|---|---|
<0.15 | No such feature. | |
0.15–0.15a | -x |
The executable part is pre-compressed with LZEXE. LZEXE's usual "LZ91 " signature appears at offset 28.
|
0.20 | -je |
LZEXE w/"LZ91 " signature.
|
1.00–2.00 | -je or -je1 |
Apparently uses DIET compression. |
2.10 | -je or -je1 |
LZEXE w/"LZ91 " signature.
|
2.20+ | -je or -je1 |
LZEXE w/"RJSX " signature.
|
Identification
An ARJ archive starts with signature bytes 0x60 0xea
. Byte 0x02
appears at offset 10.
The full identification algorithm used by the ARJ software is given in its technical documentation.
See also
Specifications
- The ARJ/UNARJ distributions include detailed format information, in a file named TECHNOTE.TXT or similar.
- A copy of the 2001-09 version (with an addendum of some sort)
- The ARJ Archive File Format, from fileformat.info and Corion.net
- Some extensions are documented in the Open-source ARJ project, especially in resource/en/readme.txt, "INFORMATION FOR DEVELOPERS" section.
Software
- ARJ - Official compression/decompression software for DOS
- ARJ32 - Official compression/decompression software for Windows console
- v3.04 (1999-12-19)
- v3.20 (2012-01-02)
- Various versions at old-dos.ru
- UNARJ - Official software, decompression only
- ARJCRYPT - An add-on required to enable the "new" encryption method
- 7-Zip
- Open-source ARJ
- Konvertor
See also the notes about software expiration, elsewhere on this page.
Sample files
- http://cd.textfiles.com/20mnn/HPTEXT/PHRACK/
- The Silver Collection CDs have many ARJ files
- http://cd.textfiles.com/goldmedal/