xz.git - XZ Utils

Age	Commit message (Collapse)	Author	Files	Lines
2024-02-23	xz: Fix Capsicum sandbox compile error.	Jia Tan	1	-2/+2
	user_abort_pipe[] was still being used instead of the parameters.
2024-02-23	Build: Fix ARM64 CRC32 instruction feature test.	Jia Tan	1	-0/+10
	Old versions of Clang reported the unsupported function attribute and __crc32d() function as warnings instead of errors, so the feature test passed when it shouldn't have, causing a compile error at build time. -Werror was added to this feature test to fix this. The change is not needed for CMake because check_c_source_compiles() also performs linking and the error is caught then. Thanks to Sebastian Andrzej Siewior for reporting this.
2024-02-22	CMake: Add LOCALEDIR to the windres workaround.	Lasse Collin	1	-5/+11
	LOCALEDIR may contain spaces like in "C:\Program Files".
2024-02-22	xz: Landlock: Fix error message if input file is a directory.	Lasse Collin	1	-1/+14
	If xz is given a directory, it should look like this: $ xz /usr/bin xz: /usr/bin: Is a directory, skipping The Landlock rules didn't allow opening directories for reading: $ xz /usr/bin xz: /usr/bin: Permission denied The simplest fix was to allow opening directories for reading. While it's a bit silly to allow it solely for the error message, it shouldn't make the sandbox significantly weaker. The single-file use case (like when called from GNU tar) is still as strict as possible: all Landlock restrictions are enabled before (de)compression starts.
2024-02-22	liblzma: Disable branchless C version in range decoder.	Lasse Collin	1	-3/+10
	Thanks to Sebastian Andrzej Siewior and Sam James for benchmarking on various systems.
2024-02-21	INSTALL: Clarify that --disable-assembler affects only 32-bit x86.	Lasse Collin	1	-9/+9

2024-02-21	Windows: build.bash: Include COPYING.0BSD in the package.	Lasse Collin	1	-1/+1

2024-02-21	Windows: build.bash: include liblzma-crt-mixing.txt in the package.	Lasse Collin	1	-2/+4

2024-02-21	Windows: Major update to Windows build instructions.	Lasse Collin	8	-184/+404

2024-02-21	Windows: Update windows/README-Windows.txt.	Lasse Collin	1	-63/+41
	It's for binary packages built with windows/build.bash.
2024-02-20	Windows: Update windows/build.bash.	Lasse Collin	1	-79/+112
	Support for the old MinGW was dropped. Only MinGW-w64 with GCC is supported now. The script now supports also cross-compilation from GNU/Linux (tests are not run). MSYS2 and also the old MSYS 1.0.11 work for building on Windows. The i686 and x86_64 toolchains must be in PATH to build both 32-bit and 64-bit versions. Parallel builds are done if "nproc" from GNU coreutils is available. MinGW-w64 runtime copyright information file was renamed from COPYING-Windows.txt to COPYING.MinGW-w64-runtime.txt which is the filename used by MinGW-w64 itself. Its existence is now mandatory, it's checked at the beginning of the script. The file TODO is no longer copied to the package.
2024-02-20	Translations: Update the Romanian man page translations.	Jia Tan	1	-840/+875

2024-02-20	Translations: Update the Korean man page translations.	Jia Tan	1	-3/+3

2024-02-20	Translations: Update the Spanish translation.	Jia Tan	1	-3/+3

2024-02-20	Translations: Update the Romanian translation.	Jia Tan	1	-243/+227

2024-02-20	Translations: Update the Croatian translation.	Jia Tan	1	-293/+355

2024-02-20	Translations: Update the German man page translations.	Jia Tan	1	-823/+873

2024-02-20	Translations: Update the German translation.	Jia Tan	1	-202/+225

2024-02-20	Translations: Update the Hungarian translation.	Jia Tan	1	-218/+338

2024-02-19	CMake: Fix building of lzmainfo when translations are enabled.	Lasse Collin	1	-0/+2

2024-02-19	CMake: Don't assume that -fvisibility=hidden is supported outside Windows.	Lasse Collin	1	-4/+22
	The original code was good enough for supporting GNU/Linux and a few others but it wasn't very portable. CMake doesn't support Solaris Studio's -xldscope=hidden. If it ever does, things should still work with this commit as Solaris Studio supports not only its own __global but also the GNU C __attribute__((visibility("default"))). Support for the attribute was added in 2007 to Sun Studio 12 compiler version 5.9.
2024-02-19	CMake: Revise the component splitting.	Lasse Collin	1	-26/+31

2024-02-19	CMake: Update the main comment and document CMAKE_BUILD_TYPE=Release.	Lasse Collin	1	-16/+63

2024-02-19	CMake: Use -O2 instead of -O3 in CMAKE_BUILD_TYPE=Release.	Lasse Collin	1	-0/+19
	-O3 doesn't seem useful for speed but it makes the code bigger. CMake makes is difficult for users to simply override the optimization level: CFLAGS / CMAKE_C_FLAGS aren't helpful because they go before CMAKE_C_FLAGS_RELEASE. Of course, users can override CMAKE_C_FLAGS_RELEASE directly but then they have to remember to add also -DNDEBUG to disable assertions. This commit changes -O3 to -O2 in CMAKE_C_FLAGS_RELEASE if and only if CMAKE_C_FLAGS_RELEASE cache variable doesn't already exist. So if a custom value is passed on the command line (or reconfiguring an already-configured build), the cache variable won't be modified.
2024-02-19	CMake: Handle symbol versioning on MicroBlaze specially.	Lasse Collin	1	-4/+19
	This is to match configure.ac.
2024-02-19	CMake: Keep build working even if lib/*.[ch] are removed.	Lasse Collin	1	-1/+6

2024-02-19	CMake: Install documentation.	Lasse Collin	1	-0/+32

2024-02-19	CMake: Bump maximum policy version to 3.28.	Lasse Collin	1	-1/+1
	CMP0154 doesn't affect us since we don't use FILE_SET.
2024-02-19	CMake: Build lzmainfo.	Lasse Collin	1	-0/+54

2024-02-19	CMake: Build lzmadec.	Lasse Collin	1	-34/+42

2024-02-19	CMake: Add test_scripts.sh to the tests.	Lasse Collin	2	-5/+22
	In contrast to Automake, skipping of this test when decoders are disabled is handled at CMake side instead of test_scripts.sh because CMake-build doesn't create config.h.
2024-02-19	CMake: Install scripts.	Lasse Collin	1	-1/+82
	Compared to the Autotools-based build, this has simpler handling for the shell (@POSIX_SHELL@) and extra PATH entry for the scripts (configure has --enable-path-for-scripts=PREFIX). The simpler metho should be enough for non-ancient systems and Solaris.
2024-02-19	Scripts: Use @PACKAGE_VERSION@ instead of @VERSION@.	Lasse Collin	4	-4/+4
	PACKAGE_VERSION was already used in liblzma.pc.in. This way only one version @foo@ is used.
2024-02-19	CMake: Simplify symlink creation and install translated man pages.	Lasse Collin	1	-97/+98
	It helps that cmake_install.cmake doesn't parallelize installation so symlinks can be created so that the target is always known to exist (a requirement on Windows in some cases). This bumps the minimum CMake version from 3.13 to 3.14 to use file(CREATE_LINK ...). It could be made to work on 3.13 by calling "cmake -E create_symlink" but it's uglier code and slower in "make install". 3.14 should be a reasonable version to require nowadays, especially since the Autotools build is still the primary build system for most OSes.
2024-02-19	CMake: Add support for building and installing xz with translations.	Lasse Collin	1	-2/+66
	If gettext tools are available, the .po files listed in po/LINGUAS are converted using msgfmt. This allows building with translations directly from xz.git without Autotools. If gettext tools aren't available, the Autotools-created .gmo files in the "po" directory will be used. This allows CMake-based build to use translations from Autotools-generated tarball. If translation support is found (Intl_FOUND) but both the gettext tools and the pre-generated .gmo files are missing, then "make" will fail.
2024-02-19	liblzma: Remove commented-out code.	Lasse Collin	1	-3/+0

2024-02-17	xz: Delete old commented-out code.	Lasse Collin	1	-19/+0

2024-02-17	xz: Use stricter pledge(2) and Landlock sandbox.	Lasse Collin	3	-13/+69
	This makes these sandboxing methods stricter when no files are created or deleted. That is, it's a middle ground between the initial sandbox and the strictest single-file-to-stdout sandbox: this allows opening files for reading but output has to go to stdout.
2024-02-17	xz: Support Landlock ABI version 4.	Lasse Collin	1	-5/+20
	Linux 6.7 added support for ABI version 4 which restricts TCP connections which xz won't need and thus those can be forbidden now. Since the ABI version is handled at runtime, supporting version 4 won't cause any compatibility issues. Note that new enough kernel headers are required to get version 4 support enabled at build time.
2024-02-17	xz: Move sandboxing code to sandbox.c and improve Landlock sandbox.	Lasse Collin	8	-213/+357
	Landlock is now always used just like pledge(2) is: first in more permissive mode and later (under certain common conditions) in a strict mode that doesn't allow opening more files. I put pledge(2) first in sandbox.c because it's the simplest API to use and still somewhat fine-grained for basic applications. So it's the simplest thing to understand for anyone reading sandbox.c.
2024-02-17	xz: Tweak comments.	Lasse Collin	1	-1/+3

2024-02-17	xz: Fix message_init() description.	Lasse Collin	3	-3/+7
	Also explicitly initialize progress_automatic to make it clear that it can be read before message_init() sets it. Static variable was initialized to false by default already so this is only for clarity.
2024-02-17	Build: Makefile.am: Sort EXTRA_DIST.	Lasse Collin	1	-7/+7
	Dirs first, then files in case-sensitive ASCII order.
2024-02-17	Build: Don't install TODO.	Lasse Collin	1	-1/+1

2024-02-18	Translations: Update the Korean man page translations.	Jia Tan	1	-836/+871

2024-02-18	Translations: Update the Korean translation.	Jia Tan	1	-200/+223

2024-02-17	Build: Install translated lzmainfo man pages.	Lasse Collin	1	-0/+26
	All other translated man pages were being installed but lzmainfo had been forgotten.
2024-02-17	liblzma: Avoid implementation-defined behavior in the RISC-V filter.	Lasse Collin	1	-8/+22
	GCC docs promise that it works and a few other compilers do too. Clang/LLVM is documented source code only but unsurprisingly it behaves the same as others on x86-64 at least. But the certainly-portable way is good enough here so use that.
2024-02-17	liblzma: Wrap a line exceeding 80 chars.	Lasse Collin	1	-1/+2

2024-02-17	liblzma/rangecoder: Exclude x32 from the x86-64 optimisation.	Sebastian Andrzej Siewior	1	-1/+1
	The x32 port has a x86-64 ABI in term of all registers but uses only 32bit pointer like x86-32. The assembly optimisation fails to compile on x32. Given the state of x32 I suggest to exclude it from the optimisation rather than trying to fix it. Signed-off-by: Sebastian Andrzej Siewior <sebastian@breakpoint.cc>
2024-02-17	Translations: Update the Spanish translation.	Jia Tan	1	-201/+226

2024-02-17	Translations: Update the Swedish translation.	Jia Tan	1	-204/+230

2024-02-17	Translations: Update the Polish translation.	Jia Tan	1	-200/+224

2024-02-17	Translations: Update the Ukrainian translation.	Jia Tan	1	-1/+1

2024-02-16	Translations: Use the same sentence in xz.pot-header that the TP uses.	Lasse Collin	1	-1/+1

2024-02-16	Fix typos discovered by codespell.	Jia Tan	3	-4/+4

2024-02-16	Translations: Update the Ukrainian man page translations.	Jia Tan	1	-837/+873

2024-02-16	Translations: Update the Ukrainian translation.	Jia Tan	1	-241/+225

2024-02-15	Translations: Omit the generic copyright line from man page headers.	Lasse Collin	1	-0/+1

2024-02-15	Update m4/.gitignore.	Jia Tan	1	-0/+1

2024-02-14	Tests: tuktest.h: Treat Clang separately from GCC.	Lasse Collin	1	-3/+3
	Don't assume that Clang defines __GNUC__ as the extensions are available in clang-cl as well (and possibly in some other Clang variants?).
2024-02-14	Tests: tuktest.h: Add a missing word to a comment.	Lasse Collin	1	-2/+2

2024-02-14	Tests: tuktest.h: Fix the comment about STest.	Lasse Collin	1	-1/+2

2024-02-15	Bump version for 5.5.2beta.larhzu/v5.5.2beta	Jia Tan	3	-4/+4

2024-02-14	liblzma: Fix validate_map.sh.	Lasse Collin	1	-1/+1
	Adding the SPDX license identifier changed the line numbers.
2024-02-14	Build: Start the generated ChangeLog from around 5.4.0 instead of 5.2.0.	Lasse Collin	1	-1/+1

2024-02-14	Fixed NEWS for 5.5.2beta.	Lasse Collin	1	-2/+6

2024-02-14	liblzma: Silence warnings in --enable-small build.	Lasse Collin	2	-0/+3

2024-02-14	Build: Install COPYING.0BSD as part of docs.	Lasse Collin	1	-0/+1

2024-02-14	Docs: List COPYING.0BSD in README.	Lasse Collin	1	-0/+1

2024-02-14	Docs: Include doc/examples/11_file_info.c in tarballs.	Lasse Collin	1	-0/+1
	It was added in 2017 in c2e29f06a7d1e3ba242ac2fafc69f5d6e92f62cd but it never got into any release tarballs because it was forgotten to be added to Makefile.am.
2024-02-14	liblzma: Silence a warning.	Lasse Collin	1	-1/+1

2024-02-14	Add NEWS for 5.5.2beta.	Lasse Collin	1	-0/+60

2024-02-14	xz: Mention lzmainfo if trying to use 'lzma --list'.	Lasse Collin	1	-2/+14
	This kind of fixes the problem reported here: https://bugs.launchpad.net/ubuntu/+source/xz-utils/+bug/1291020
2024-02-14	liblzma: Add comments.	Lasse Collin	2	-2/+18

2024-02-14	Scripts: Add lz4 support to xzgrep and xzdiff.	Lasse Collin	4	-10/+19

2024-02-14	liblzma: Choose the range decoder variants using a bitmask macro.	Lasse Collin	1	-11/+53

2024-02-14	xz: Fix outdated threading related info on the man page.	Lasse Collin	1	-8/+14

2024-02-14	liblzma: Range decoder: Add x86-64 inline assembly.	Lasse Collin	1	-0/+491
	It's compatible with GCC and Clang.
2024-02-14	liblzma: Range decoder: Add branchless C code.	Lasse Collin	1	-0/+76
	It's used only for basic bittrees and fixed-size reverse bittree because those showed a clear benefit on x86-64 with GCC and Clang. The other methods were more mixed and thus are commented out but they should be tested on other archs.
2024-02-14	liblzma: Clarify a comment.	Lasse Collin	1	-3/+6

2024-02-14	liblzma: LZMA decoder: Optimize loop comparison.	Lasse Collin	2	-4/+11
	But now it needs one more local variable.
2024-02-14	liblzma: Optimize literal_subcoder() macro slightly.	Lasse Collin	5	-22/+24

2024-02-14	liblzma: LZ decoder: Add unlikely().	Lasse Collin	1	-1/+1

2024-02-14	liblzma: LZ decoder: Remove a useless unlikely().	Lasse Collin	1	-1/+1

2024-02-14	liblzma: Optimize LZ decoder slightly.	Lasse Collin	3	-60/+88
	Now extra buffer space is reserved so that repeating bytes for any single match will never need to copy from two places (both the beginning and the end of the buffer). This simplifies dict_repeat() and helps a little with speed. This seems to reduce .lzma decompression time about 2 %, so with .xz and CRC it could be slightly less. The small things add up still.
2024-02-14	liblzma: LZMA decoder: Get rid of next_state[].	Lasse Collin	3	-24/+24
	It's not completely obvious if this is better in the decoder. It should be good if compiler can avoid creating a branch (like using CMOV on x86). This also makes lzma_encoder.c use the new macros.
2024-02-14	liblzma: LZMA decoder improvements.	Lasse Collin	3	-200/+210
	This adds macros for bittree decoding which prepares the code for alternative C versions and inline assembly.
2024-02-14	liblzma: Creates Non-resumable and Resumable modes for lzma_decoder.	Jia Tan	2	-213/+521
	The new decoder resumes the first decoder loop in the Resumable mode. Then, the code executes in Non-resumable mode until it detects that it cannot guarantee to have enough input/output to decode another symbol. The Resumable mode is how the decoder has always worked. Before decoding every input bit, it checks if there is enough space and will save its location to be resumed later. When the decoder has more input/output, it jumps back to the correct sequence in the Resumable mode code. When the input/output buffers are large, the Resumable mode is much slower than the Non-resumable because it has more branches and is harder for the compiler to optimize since it is in a large switch block. Early benchmarking shows significant time improvement (8-10% on gcc and clang x86) by using the Non-resumable code as much as possible.
2024-02-14	liblzma: Creates separate "safe" range decoder mode.	Jia Tan	2	-103/+82
	The new "safe" range decoder mode is the same as old range decoder, but now the default behavior of the range decoder will not check if there is enough input or output to complete the operation. When the buffers are close to fully consumed, the "safe" operations must be used instead. This will improve speed because it will reduce the number of branches needed for most of the range decoder operations.
2024-02-14	doxygen/footer.html: Add missing closing tags and don't open a new tab.	Lasse Collin	1	-2/+4
	The footer template from Doxygen has the closing </body> </html> as Doxygen doesn't add them otherwise. target="_blank" was omitted as it's not useful here but it can be slightly annoying as one cannot just go back in the browser history. Since the footer links to the license file in the same directory and not to CC website, the rel attributes can be omitted.
2024-02-14	Tweak the expressions in AUTHORS.	Lasse Collin	1	-8/+23

2024-02-14	Translations: Add the man page translators into man page header comment.	Lasse Collin	3	-7/+26
	It looked odd to only have the original English authors listed in the header comments of the translated files.
2024-02-14	Translations: Translate also messages of lzmainfo.	Lasse Collin	1	-0/+2
	lzmainfo has had translation support since 2009 at least but it was never added to po/POTFILES.in so the messages weren't translated. It's a very rarely needed tool so it's not too bad. This also adds src/xz/mytime.c to po/POTFILES.in although there are no translatable strings. It's simpler this way so that it won't be forgotten if strings were ever added to that file.
2024-02-14	Translations: Add custom .pot header with SPDX license identifier.	Lasse Collin	3	-0/+16
	The same is used for both po/xz.pot and po4a/xz-man.pot.
2024-02-14	Translations: po4a/update-po: Add copyright notice to xz-man.pot.	Lasse Collin	1	-1/+1
	All man pages are under 0BSD now so this is simple now.
2024-02-14	Update COPYING about the man pages of the scripts.	Lasse Collin	1	-3/+3

2024-02-14	xzdiff, xzgrep, and xzmore: Rewrite the man pages.	Lasse Collin	3	-116/+173
	The main reason is a kind of silly one: xz-man.pot contains strings from all man pages in XZ Utils. The man pages of xzdiff, xzgrep, and xzmore were under GPLv2 and the rest under 0BSD. Thus xz-man.pot contained strings under two licences. po4a creates the translated man pages from the combined 0BSD+GPLv2 xz-man.pot. I haven't liked this mixing in xz-man.pot but the Translation Project requires that all man pages must be in the same .pot file. So a separate xz-man-gpl.pot wasn't an option. Since these man pages are short, rewriting them was quick enough. Now xz-man.pot is entirely under 0BSD and marking the per-file licenses is simpler. As a bonus, some wording hopefully is now slightly better although it's perhaps a matter of taste. NOTE: In xzgrep.1, the EXIT STATUS section was written by me in the commit d796b6d7fdb8b7238b277056cf9146cce25db604 so that's why that section could be taken as is from the old xzgrep.1.
2024-02-14	xzless: Update man page slightly.	Lasse Collin	1	-4/+4
	The xz tool can decompress three file formats and xzless has always supported uncompressed files too.
2024-02-14	Translations: Change po/Makevars to add a copyright notice to po/xz.pot.	Lasse Collin	1	-2/+2

2024-02-14	Translations: Update po/Makevars to use the template from gettext 0.22.4.	Lasse Collin	1	-5/+46
	Also add SPDX license identifier now that there is a known license.
2024-02-14	liblzma: Include the SPDX license identifier 0BSD to generated files.	Lasse Collin	11	-26/+50
	Perhaps the generated files aren't even copyrightable but using the same license for them as for the rest of the liblzma keeps things more consistent for tools that look for license info.
2024-02-14	liblzma: Fix compilation of price_tablegen.c.	Lasse Collin	2	-1/+9
	It is built and run only manually so this didn't matter unless one wanted to regenerate the price_table.c.
2024-02-14	Add SPDX license identifiers to GPL, LGPL, and FSFULLR files.	Lasse Collin	22	-6/+37

2024-02-14	Add SPDX license identifier into 0BSD source code files.	Lasse Collin	290	-58/+588

2024-02-14	liblzma: Sync the AUTHORS fix about SHA-256 to lzma.h.	Lasse Collin	1	-6/+4

2024-02-14	Change most public domain parts to 0BSD.	Lasse Collin	288	-911/+100
	Translations and doc/xz-file-format.txt and doc/lzma-file-format.txt were not touched. COPYING.0BSD was added.
2024-02-14	Fix SHA-256 authors.	Lasse Collin	2	-14/+6
	The initial commit 5d018dc03549c1ee4958364712fb0c94e1bf2741 in 2007 had a comment in sha256.c that the code is based on Crypto++ Library 5.5.1. In 2009 the Authors list in sha256.c and the AUTHORS file was updated with information that the code had come from Crypto++ but via 7-Zip. I know I had viewed 7-Zip's SHA-256 code but back then the C code has been identical enough with Crypto++, so I don't why I thought the author info would need that extra step via 7-Zip for this single file. Another error is that I had mixed sha.* and shacal2.* files when checking for author info in Crypto++. The shacal2.* files aren't related to liblzma's sha256.c and thus Kevin Springle's code in Crypto++ isn't either.
2024-02-14	Remove macosx/build.sh.	Lasse Collin	2	-114/+0
	It was last updated in 2013.
2024-02-14	Doc: Remove doc/examples_old.	Lasse Collin	3	-255/+0
	It was good to keep these around in parallel with the newer examples but I think it's OK to remove the old ones at this point.
2024-02-13	Tests: Add RISC-V filter support in a few places.	Jia Tan	2	-0/+12

2024-02-13	liblzma: Fix build error if only RISC-V BCJ filter is enabled.	Jia Tan	1	-1/+3
	If any other BCJ filter was enabled for encoding or decoding, then this was not a problem.
2024-02-13	Translations: Update the Korean translation.	Jia Tan	1	-242/+284

2024-02-13	Translations: Update the Korean man page translations.	Jia Tan	1	-605/+770

2024-02-13	Translations: Update the Chinese (simplified) translation.	Jia Tan	1	-156/+268

2024-02-09	xzless: Use \|\|- in LESSOPEN with with "less" 451 and newer.	Lasse Collin	1	-1/+8

2024-02-09	xzless: Use --show-preproc-errors with "less" 632 and newer.	Lasse Collin	1	-2/+9
	This makes "less" show a warning if a decompression error occurred.
2024-02-09	liblzma: Fix typo discovered by codespell.	Jia Tan	1	-1/+1

2024-02-09	Translations: Update the Swedish translation.	Jia Tan	1	-166/+254

2024-02-08	Translations: Update the Spanish translation.	Jia Tan	1	-11/+11

2024-02-07	Translations: Update the Spanish translation.	Jia Tan	1	-166/+253

2024-02-07	Translations: Update the Polish translation.	Jia Tan	1	-162/+249

2024-02-07	Translations: Update the German translation.	Jia Tan	1	-154/+242

2024-02-07	Translations: Update the German man page translations.	Jia Tan	1	-601/+752

2024-02-06	Translations: Update the Romanian translation.	Jia Tan	1	-164/+252

2024-02-06	Translations: Update the Romanian man page translations.	Jia Tan	1	-793/+966

2024-02-07	Translations: Update the Ukrainian translation.	Jia Tan	1	-155/+242

2024-02-06	Translations: Update the Ukrainian man page translations.	Jia Tan	1	-599/+764

2024-02-02	Update AUTHORS.	Jia Tan	1	-1/+2

2024-02-02	liblzma: Update Authors list in crc32_arm64.h.	Jia Tan	1	-0/+1

2024-02-01	liblzma: Check HAVE_USABLE_CLMUL before omitting CRC32 table.	Jia Tan	1	-2/+2
	This was split from the prior commit so it could be easily applied to the 5.4 branch. Closes: https://github.com/tukaani-project/xz/pull/77
2024-02-01	liblzma: Check HAVE_USABLE_CLMUL before omitting CRC64 table.	Jia Tan	1	-2/+2
	If liblzma is configured with --disable-clmul-crc CFLAGS="-msse4.1 -mpclmul", then it will fail to compile because the generic version must be used but the CRC tables were not included.
2024-02-01	liblzma: Only use ifunc in crcXX_fast.c if its needed.	Jia Tan	2	-6/+6
	The code was using HAVE_FUNC_ATTRIBUTE_IFUNC instead of CRC_USE_IFUNC. With ARM64, ifunc is incompatible because it requires non-inline function calls for runtime detection.
2024-02-01	Docs: Add --disable-arm64-crc32 description to INSTALL.	Jia Tan	1	-1/+11

2024-02-01	liblzma: Omit CRC tables when not needed with ARM64 optimizations.	Jia Tan	3	-5/+25
	This is similar to the existing x86-64 CLMUL conditions to omit the tables. They were slightly refactored to improve readability.
2024-02-01	liblzma: Rename crc32_aarch64.h to crc32_arm64.h.	Jia Tan	6	-116/+122
	Even though the proper name for the architecture is aarch64, this project uses ARM64 throughout. So the rename is for consistency. Additionally, crc32_arm64.h was slightly refactored for the following changes: * Added MSVC, FreeBSD, and macOS support in is_arch_extension_supported(). * crc32_arch_optimized() now checks the size when aligning the buffer. * crc32_arch_optimized() loop conditions were slightly modified to avoid both decrementing the size and incrementing the buffer pointer. * Use the intrinsic wrappers defined in <arm_acle.h> because GCC and Clang name them differently. * Minor spacing and comment changes.
2024-02-01	liblzma: Refactor crc_common.h.	Jia Tan	3	-42/+82
	The CRC_GENERIC is now split into CRC32_GENERIC and CRC64_GENERIC, since the ARM64 optimizations will be different between CRC32 and CRC64. For the same reason, CRC_ARCH_OPTIMIZED is split into CRC32_ARCH_OPTIMIZED and CRC64_ARCH_OPTIMIZED. ifunc will only be used with x86-64 CLMUL because the runtime detection methods needed with ARM64 are not compatible with ifunc.
2024-02-01	CMake: Add support for ARM64 CRC32 instruction detection.	Jia Tan	1	-0/+50

2024-02-01	Build: Add support for ARM64 CRC32 instruction detection.	Jia Tan	1	-0/+52
	This adds --enable-arm64-crc32/--disable-arm64-crc32 (enabled by default) for using the ARM64 CRC32 instruction. This can be disabled if one knows the binary will never need to run on an ARM64 machine with this instruction extension.
2024-01-27	Speed up CRC32 calculation on ARM64	Chenxi Mao	6	-9/+130
	The CRC32 instructions in ARM64 can calculate the CRC32 result for 8 bytes in a single operation, making the use of ARM64 instructions much faster compared to the general CRC32 algorithm. Optimized CRC32 will be enabled if ARM64 has CRC extension running on Linux. Signed-off-by: Chenxi Mao <chenxi.mao2013@gmail.com>
2024-01-26	Bump version number for 5.5.1alpha.larhzu/v5.5.1alpha	Jia Tan	3	-3/+3

2024-01-26	Add NEWS for 5.5.1alpha	Jia Tan	1	-0/+80

2024-01-26	Add NEWS for 5.4.6.	Jia Tan	1	-0/+22

2024-01-24	Move doc/logo/xz-logo.png to "doc" and Doxygen footer to "doxygen".	Lasse Collin	4	-3/+3
	The footer isn't a complete HTML file so having it in the doxygen directory is a tiny bit clearer.
2024-01-25	README: Add COPYING.CC-BY-SA-4.0 entry to section 1.1.	Jia Tan	1	-18/+20
	The Overall documentation section (1.1) table spacing had to be adjusted since the filename was very long.
2024-01-25	Build: Add the logo and license to the release.	Jia Tan	1	-0/+2

2024-01-25	COPYING: Add the license for the XZ logo.	Jia Tan	2	-0/+432

2024-01-25	Doxygen: Added the XZ logo and copyright information.	Jia Tan	3	-3/+14
	The PROJECT_LOGO field is now used to include the XZ logo. The footer of each page now lists the copyright information instead of the default footer. The license is also copied to statisfy the copyright and so the link in the documentation can be local.
2024-01-23	xz: Use threaded mode by defaut (as if --threads=0 was used).	Lasse Collin	3	-3/+16
	This hopefully does more good than bad: + It's faster by default. + Only the threaded compressor creates files that can be decompressed in threaded mode. - Compression ratio is worse, usually not too much though. When it matters, -T1 must be used. - Memory usage increases. - Scripts that assume single-threaded mode but don't use -T1 will possibly use too much resources, for example, if they run multiple xz processes in parallel to compress multiple files. - Output from single-threaded and multi-threaded compressors differ but such changes could happen for other reasons too (they just haven't happened since 5.0.0).
2024-01-23	CI: Use RISC-V filter when building with BCJ support.	Jia Tan	1	-2/+2

2024-01-23	Tests: Use smaller dictionary size in RISC-V test files.	Jia Tan	2	-0/+0

2024-01-23	Tests: Skip RISC-V test files if decoder was not built.	Jia Tan	1	-0/+5

2024-01-23	xz: Man page: Add more examples of LZMA2 options with BCJ filters.	Lasse Collin	1	-7/+31

2024-01-23	liblzma: RISC-V filter: Use byte-by-byte access.	Lasse Collin	1	-30/+84
	Not all RISC-V processors support fast unaligned access so it's better to read only one byte in the main loop. This can be faster even on x86-64 when compared to reading 32 bits at a time as half the time the address is only 16-bit aligned. The downside is larger code size on archs that do support fast unaligned access.
2024-01-23	xz: Update xz -lvv for RISC-V filter.	Jia Tan	1	-0/+10
	Version 5.6.0 will be shown, even though upcoming alphas and betas will be able to support this filter. 5.6.0 looks nicer in the output and people shouldn't be encouraged to use an unstable version in production in any way.
2024-01-23	Tests: Add two RISC-V Filter test files.	Jia Tan	3	-0/+8
	These test files achieve 100% code coverage in src/liblzma/simple/riscv.c. They contain all of the instructions that should be filtered and a few cases that should not.
2024-01-23	xz: Update message in --long-help for RISC-V Filter.	Jia Tan	1	-0/+1

2024-01-23	xz: Update the man page for the RISC-V Filter.	Jia Tan	1	-1/+2
	A special note was added to suggest using four-byte alignment when the compressed instruction extension is not present in a RISC-V binary.
2024-01-23	Tests: Add RISC-V Filter test in test_compress.sh.	Jia Tan	1	-0/+1

2024-01-23	liblzma: Update string_conversion.c to support RISC-V Filter.	Jia Tan	1	-0/+5

2024-01-23	CMake: Support RISC-V BCJ Filter for encoding and decoding.	Jia Tan	1	-0/+1

2024-01-23	liblzma: Add RISC-V BCJ filter.	Jia Tan	9	-2/+742
	The new Filter ID is 0x0B. Thanks to Chien Wong <m@xv97.com> for the initial version of the Filter, the xz CLI updates, and the Autotools build system modifications. Thanks to Igor Pavlov for his many contributions to the design of the filter.
2024-01-19	Docs: Update .xz file format specification to 1.2.0.	Jia Tan	1	-12/+17
	The new RISC-V filter was added to the specification, in addition to updating the specification URL.
2024-01-19	xz: Update website URLs in the man pages.	Jia Tan	2	-5/+5

2024-01-19	liblzma: Update website URL.	Jia Tan	2	-4/+4

2024-01-19	Docs: Update website URLs.	Jia Tan	6	-15/+17

2024-01-19	Build: Update website URL.	Jia Tan	2	-2/+2

2024-01-11	liblzma: CRC: Add a comment to crc_x86_clmul.h about BUILDING_ macros.	Lasse Collin	1	-0/+6

2024-01-11	liblzma: CRC: Remove crc_always_inline, use lzma_always_inline instead.	Lasse Collin	2	-21/+1
	Now crc_simd_body() in crc_x86_clmul.h is only called once in a translation unit, we no longer need to be so cautious about ensuring the always-inline behavior.
2024-01-11	liblzma: CRC: Update CLMUL comments to more generic wording.	Lasse Collin	2	-13/+13

2024-01-11	liblzma: Rename arch-specific CRC functions and macros.	Lasse Collin	4	-25/+31
	CRC_CLMUL was split to CRC_ARCH_OPTIMIZED and CRC_X86_CLMUL. CRC_ARCH_OPTIMIZED is defined when an arch-optimized version is used. Currently the x86 CLMUL implementations are the only arch-optimized versions, and these also use the CRC_x86_CLMUL macro to tell when crc_x86_clmul.h needs to be included. is_clmul_supported() was renamed to is_arch_extension_supported(). crc32_clmul() and crc64_clmul() were renamed to crc32_arch_optimized() and crc64_arch_optimized(). This way the names make sense with arch-specific non-CLMUL implementations as well.
2024-01-11	liblzma: Fix a comment in crc_common.h.	Lasse Collin	1	-1/+2

2024-01-11	liblzma: Avoid extern lzma_crc32_clmul() and lzma_crc64_clmul().	Lasse Collin	7	-91/+91
	A CLMUL-only build will have the crcxx_clmul() inlined into lzma_crcxx(). Previously a jump to the extern lzma_crcxx_clmul() was needed. Notes about shared liblzma on ELF platforms: - On platforms that support ifunc and -fvisibility=hidden, this was silly because CLMUL-only build would have that single extra jump instruction of extra overhead. - On platforms that support neither -fvisibility=hidden nor linker version script (liblzma*.map), jumping to lzma_crcxx_clmul() would go via PLT so a few more instructions of overhead (still not a big issue but silly nevertheless). There was a downside with static liblzma too: if an application only needs lzma_crc64(), static linking would make the linker include the CLMUL code for both CRC32 and CRC64 from crc_x86_clmul.o even though the CRC32 code wouldn't be needed, thus increasing code size of the executable (assuming that -ffunction-sections isn't used). Also, now compilers are likely to inline crc_simd_body() even if they don't support the always_inline attribute (or MSVC's __forceinline). Quite possibly all compilers that build the code do support such an attribute. But now it likely isn't a problem even if the attribute wasn't supported. Now all x86-specific stuff is in crc_x86_clmul.h. If other archs The other archs can then have their own headers with their own is_clmul_supported() and crcxx_clmul(). Another bonus is that the build system doesn't need to care if crc_clmul.c is needed. is_clmul_supported() stays as inline function as it's not needed when doing a CLMUL-only build (avoids a warning about unused function).
2024-01-11	liblzma: crc_clmul.c: Add crc_attr_target macro.	Lasse Collin	1	-14/+16
	This reduces the number of the complex #if directives.
2024-01-11	liblzma: Simplify existing cases with lzma_attr_no_sanitize_address.	Lasse Collin	1	-9/+3

2024-01-11	liblzma: #define crc_attr_no_sanitize_address in crc_common.h.	Lasse Collin	1	-0/+10

2024-01-10	liblzma: CRC: Add empty lines.	Lasse Collin	3	-1/+5
	And remove one too.
2024-01-10	liblzma: crc_clmul.c: Tidy up the location of MSVC pragma.	Lasse Collin	1	-2/+2
	It makes no difference in practice.
2023-12-28	Update THANKS.	Lasse Collin	1	-0/+1

2023-12-28	liblzma: Use 8-byte method in memcmplen.h on ARM64.	Lasse Collin	1	-8/+10
	It requires fast unaligned access to 64-bit integers and a fast instruction to count leading zeros in a 64-bit integer (__builtin_ctzll()). This perhaps should be enabled on some other archs too. Thanks to Chenxi Mao for the original patch: https://github.com/tukaani-project/xz/pull/75 (the first commit) According to the numbers there, this may improve encoding speed by about 3-5 %. This enables the 8-byte method on MSVC ARM64 too which should work but wasn't tested.
2023-12-28	liblzma: Check also for __clang__ in memcmplen.h.	Lasse Collin	1	-1/+2
	This change hopefully makes no practical difference as Clang likely was detected via __GNUC__ or _MSC_VER already.
2023-12-21	Translations: Update the French translation.	Jia Tan	1	-262/+370

2023-12-21	xz: Add a comment to Capsicum sandbox setup.	Jia Tan	1	-0/+1
	This comment is repeated in xzdec.c to help remind us why all the capabilities are removed from stdin in certain situations.
2023-12-21	Docs: Update --enable-sandbox option in INSTALL.	Jia Tan	1	-7/+10
	xzdec now also uses the sandbox when its configured.
2023-12-21	CMake: Move sandbox detection outside of xz section.	Jia Tan	1	-80/+98
	The sandbox is now enabled for xzdec as well, so it no longer belongs in just the xz section. xz and xzdec are always built, except for older MSVC versions, so there isn't a need to conditionally show the sandbox configuration. CMake will do a little unecessary work on older MSVC versions that can't build xz or xzdec, but this is a very small downside.
2023-12-20	Build: Allow sandbox to be configured for just xzdec.	Jia Tan	1	-5/+5
	If xz is disabled, then xzdec can still use the sandbox.
2023-12-19	xzdec: Add sandbox support for Pledge, Capsicum, and Landlock.	Jia Tan	1	-7/+139
	A very strict sandbox is used when the last file is decompressed. The likely most common use case of xzdec is to decompress a single file. The Pledge sandbox is applied to the entire process with slightly more relaxed promises, until the last file is processed. Thanks to Christian Weisgerber for the initial patch adding Pledge sandboxing.
2023-12-20	liblzma: Initialize lzma_lz_encoder pointers with NULL.	Jia Tan	1	-1/+5
	This fixes the recent change to lzma_lz_encoder that used memzero instead of the NULL constant. On some compilers the NULL constant (always 0) may not equal the NULL pointer (this only needs to guarentee to not point to valid memory address). Later code compares the pointers to the NULL pointer so we must initialize them with the NULL pointer instead of 0 to guarentee code correctness.
2023-12-16	liblzma: Set all values in lzma_lz_encoder to NULL after allocation.	Jia Tan	1	-3/+1
	The first member of lzma_lz_encoder doesn't necessarily need to be set to NULL since it will always be set before anything tries to use it. However the function pointer members must be set to NULL since other functions rely on this NULL value to determine if this behavior is supported or not. This fixes a somewhat serious bug, where the options_update() and set_out_limit() function pointers are not set to NULL. This seems to have been forgotten since these function pointers were added many years after the original two (code() and end()). The problem is that by not setting this to NULL we are relying on the memory allocation to zero things out if lzma_filters_update() is called on a LZMA1 encoder. The function pointer for set_out_limit() is less serious because there is not an API function that could call this in an incorrect way. set_out_limit() is only called by the MicroLZMA encoder, which must use LZMA1 where set_out_limit() is always set. Its currently not possible to call set_out_limit() on an LZMA2 encoder at this time. So calling lzma_filters_update() on an LZMA1 encoder had undefined behavior since its possible that memory could be manipulated so the options_update member pointed to a different instruction sequence. This is unlikely to be a bug in an existing application since it relies on calling lzma_filters_update() on an LZMA1 encoder in the first place. For instance, it does not affect xz because lzma_filters_update() can only be used when encoding to the .xz format. This is fixed by using memzero() to set all members of lzma_lz_encoder to NULL after it is allocated. This ensures this mistake will not occur here in the future if any additional function pointers are added.
2023-12-16	liblzma: Tweak a comment.	Jia Tan	1	-1/+1

2023-12-16	liblzma: Make parameter names in function definition match declaration.	Jia Tan	1	-4/+4
	lzma_raw_encoder() and lzma_raw_encoder_init() used "options" as the parameter name instead of "filters" (used by the declaration). "filters" is more clear since the parameter represents the list of filters passed to the raw encoder, each of which contains filter options.
2023-12-16	liblzma: Improve lzma encoder init function consistency.	Jia Tan	1	-0/+3
	lzma_encoder_init() did not check for NULL options, but lzma2_encoder_init() did. This is more of a code style improvement than anything else to help make lzma_encoder_init() and lzma2_encoder_init() more similar.
2023-12-16	Docs: Update repository URL in Changelog.	Jia Tan	1	-1/+1

2023-12-15	CI: Update Upload Artifact Action.	Jia Tan	2	-2/+2

2023-12-07	Tests: Silence -Wsign-conversion warning on GCC version < 10.	Jia Tan	1	-1/+1
	Since GCC version 10, GCC no longer complains about simple implicit integer conversions with Arithmetic operators. For instance: uint8_t a = 5; uint32_t b = a + 5; Give a warning on GCC 9 and earlier but this: uint8_t a = 5; uint32_t b = (a + 5) * 2; Gives a warning with GCC 10+.
2023-12-07	Update THANKS.	Jia Tan	1	-0/+1

2023-12-07	Tests: Minor cleanups to OSS-Fuzz files.	Jia Tan	7	-56/+66
	Most of these fixes are small typos and tweaks. A few were caused by bad advice from me. Here is the summary of what is changed: - Author line edits - Small comment changes/additions - Using the return value in the error messages in the fuzz targets' coder initialization code - Removed fuzz_encode_stream.options. This set a max length, which may prevent some worthwhile code paths from being properly exercised. - Removed the max_len option from fuzz_decode_stream.options for the same reason as fuzz_encode_stream. The alone decoder fuzz target still has this restriction. - Altered the dictionary contents for fuzz_lzma.dict. Instead of keeping the properties static and varying the dictionary size, the properties are varied and the dictionary size is kept small. The dictionary size doesn't have much impact on the code paths but the properties do. Closes: https://github.com/tukaani-project/xz/pull/73
2023-12-07	Tests: Add fuzz_encode_stream ossfuzz target.	Maksym Vatsyk	2	-0/+81
	This fuzz target handles .xz stream encoding. The first byte of input is used to dynamically set the preset level in order to increase the fuzz coverage of complex critical code paths.
2023-12-07	Tests: Add fuzz_decode_alone OSS-Fuzz target	Maksym Vatsyk	3	-0/+66
	This fuzz target that handles LZMA alone decoding. A new fuzz dictionary .dict was also created with common LZMA header values to help speed up the discovery of valid headers.
2023-12-07	Tests: Update OSS-Fuzz Makefile.	Maksym Vatsyk	1	-4/+9
	All .c files can be built as separate fuzz targets. This simplifies the Makefile by allowing us to use wildcards instead of having a Makefile target for each fuzz target.