xz.git - XZ Utils

Age	Commit message (Collapse)	Author	Files	Lines
2024-02-14	Add SPDX license identifier into 0BSD source code files.	Lasse Collin	1	-0/+2

2024-02-14	Change most public domain parts to 0BSD.	Lasse Collin	1	-3/+0
	Translations and doc/xz-file-format.txt and doc/lzma-file-format.txt were not touched. COPYING.0BSD was added.
2023-12-20	liblzma: Initialize lzma_lz_encoder pointers with NULL.	Jia Tan	1	-1/+5
	This fixes the recent change to lzma_lz_encoder that used memzero instead of the NULL constant. On some compilers the NULL constant (always 0) may not equal the NULL pointer (this only needs to guarentee to not point to valid memory address). Later code compares the pointers to the NULL pointer so we must initialize them with the NULL pointer instead of 0 to guarentee code correctness.
2023-12-16	liblzma: Set all values in lzma_lz_encoder to NULL after allocation.	Jia Tan	1	-3/+1
	The first member of lzma_lz_encoder doesn't necessarily need to be set to NULL since it will always be set before anything tries to use it. However the function pointer members must be set to NULL since other functions rely on this NULL value to determine if this behavior is supported or not. This fixes a somewhat serious bug, where the options_update() and set_out_limit() function pointers are not set to NULL. This seems to have been forgotten since these function pointers were added many years after the original two (code() and end()). The problem is that by not setting this to NULL we are relying on the memory allocation to zero things out if lzma_filters_update() is called on a LZMA1 encoder. The function pointer for set_out_limit() is less serious because there is not an API function that could call this in an incorrect way. set_out_limit() is only called by the MicroLZMA encoder, which must use LZMA1 where set_out_limit() is always set. Its currently not possible to call set_out_limit() on an LZMA2 encoder at this time. So calling lzma_filters_update() on an LZMA1 encoder had undefined behavior since its possible that memory could be manipulated so the options_update member pointed to a different instruction sequence. This is unlikely to be a bug in an existing application since it relies on calling lzma_filters_update() on an LZMA1 encoder in the first place. For instance, it does not affect xz because lzma_filters_update() can only be used when encoding to the .xz format. This is fixed by using memzero() to set all members of lzma_lz_encoder to NULL after it is allocated. This ensures this mistake will not occur here in the future if any additional function pointers are added.
2023-12-16	liblzma: Tweak a comment.	Jia Tan	1	-1/+1

2023-05-11	liblzma: Creates IS_ENC_DICT_SIZE_VALID() macro.	Jia Tan	1	-3/+1
	This creates an internal liblzma macro to test if the dictionary size is valid for encoding.
2022-11-27	liblzma: Pass the Filter ID to LZ encoder and decoder.	Lasse Collin	1	-2/+3
	This allows using two Filter IDs with the same initialization function and data structures.
2022-11-24	liblzma: Allow nice_len 2 and 3 even if match finder requires 3 or 4.	Lasse Collin	1	-5/+9
	That is, if the specified nice_len is smaller than the minimum of the match finder, silently use the match finder's minimum value instead of reporting an error. The old behavior is annoying to users and it complicates xz options handling too.
2022-11-14	liblzma: Use __attribute__((__constructor__)) if available.	Lasse Collin	1	-1/+1
	This uses it for CRC table initializations when using --disable-small. It avoids mythread_once() overhead. It also means that then --disable-small --disable-threads is thread-safe if this attribute is supported.
2022-07-25	liblzma: Refactor lzma_mf_is_supported() to use a switch-statement.	Jia Tan	1	-18/+14

2021-01-14	liblzma: Add rough support for output-size-limited encoding in LZMA1.	Lasse Collin	1	-0/+16
	With this it is possible to encode LZMA1 data without EOPM so that the encoder will encode as much input as it can without exceeding the specified output size limit. The resulting LZMA1 stream will be a normal LZMA1 stream without EOPM. The actual uncompressed size will be available to the caller via the uncomp_size pointer. One missing thing is that the LZMA layer doesn't inform the LZ layer when the encoding is finished and thus the LZ may read more input when it won't be used. However, this doesn't matter if encoding is done with a single call (which is the planned use case for now). For proper multi-call encoding this should be improved. This commit only adds the functionality for internal use. Nothing uses it yet.
2016-11-21	liblzma: Avoid multiple definitions of lzma_coder structures.	Lasse Collin	1	-25/+32
	Only one definition was visible in a translation unit. It avoided a few casts and temp variables but seems that this hack doesn't work with link-time optimizations in compilers as it's not C99/C11 compliant. Fixes: http://www.mail-archive.com/xz-devel@tukaani.org/msg00279.html
2015-11-04	liblzma: Make Valgrind happier with optimized (gcc -O2) liblzma.	Lasse Collin	1	-0/+4
	When optimizing, GCC can reorder code so that an uninitialized value gets used in a comparison, which makes Valgrind unhappy. It doesn't happen when compiled with -O0, which I tend to use when running Valgrind. Thanks to Rich Prohaska. I remember this being mentioned long ago by someone else but nothing was done back then.
2015-03-07	liblzma: Silence more uint32_t vs. size_t warnings.	Lasse Collin	1	-1/+1

2015-01-26	liblzma: Silence harmless Valgrind errors.	Lasse Collin	1	-0/+6
	Thanks to Torsten Rupp for reporting this. I had forgotten to run Valgrind before the 5.2.0 release.
2014-07-25	liblzma: Use lzma_memcmplen() in the match finders.	Lasse Collin	1	-1/+12
	This doesn't change the match finder output.
2014-05-25	liblzma: Use lzma_alloc_zero() in LZ encoder initialization.	Lasse Collin	1	-40/+44
	This avoids a memzero() call for a newly-allocated memory, which can be expensive when encoding small streams with an over-sized dictionary. To avoid using lzma_alloc_zero() for memory that doesn't need to be zeroed, lzma_mf.son is now allocated separately, which requires handling it separately in normalize() too. Thanks to Vincenzo Innocente for reporting the problem.
2012-07-17	liblzma: Make the use of lzma_allocator const-correct.	Lasse Collin	1	-9/+10
	There is a tiny risk of causing breakage: If an application assigns lzma_stream.allocator to a non-const pointer, such code won't compile anymore. I don't know why anyone would do such a thing though, so in practice this shouldn't cause trouble. Thanks to Jan Kratochvil for the patch.
2011-05-17	Add underscores to attributes (__attribute((__foo__))).	Lasse Collin	1	-1/+1

2010-09-03	liblzma: Adjust default depth calculation for HC3 and HC4.	Lasse Collin	1	-3/+4
	It was 8 + nice_len / 4, now it is 4 + nice_len / 4. This allows faster settings at lower nice_len values, even though it seems that I won't use automatic depth calcuation with HC3 and HC4 in the presets.
2010-06-02	Silence a bogus Valgrind warning.	Lasse Collin	1	-1/+5
	When using -O2 with GCC, it liked to swap two comparisons in one "if" statement. It's otherwise fine except that the latter part, which is seemingly never executed, got executed (nothing wrong with that) and then triggered warning in Valgrind about conditional jump depending on uninitialized variable. A few people find this annoying so do things a bit differently to avoid the warning.
2010-05-26	Rename MIN() and MAX() to my_min() and my_max().	Lasse Collin	1	-1/+1
	This should avoid some minor portability issues.
2010-02-12	Collection of language fixes to comments and docs.	Lasse Collin	1	-1/+1
	Thanks to Jonathan Nieder.
2009-11-14	Fix a design error in liblzma API.	Lasse Collin	1	-0/+17
	Originally the idea was that using LZMA_FULL_FLUSH with Stream encoder would read the filter chain from the same array that was used to intialize the Stream encoder. Since most apps wouldn't use LZMA_FULL_FLUSH, most apps wouldn't need to keep the filter chain available after initializing the Stream encoder. However, due to my mistake, it actually required keeping the array always available. Since setting the new filter chain via the array used at initialization time is not a nice way to do it for a couple of reasons, this commit ditches it and introduces lzma_filters_update(). This new function replaces also the "persistent" flag used by LZMA2 (and to-be-designed Subblock filter), which was also an ugly thing to do. Thanks to Alexey Tourbin for reminding me about the problem that Stream encoder used to require keeping the filter chain allocated.
2009-10-02	Make liblzma produce the same output on both endiannesses.	Lasse Collin	1	-1/+6
	Seems that it is a problem in some cases if the same version of XZ Utils produces different output on different endiannesses, so this commit fixes that problem. The output will still vary between different XZ Utils versions, but I cannot avoid that for now. This commit bloatens the code on big endian systems by 1 KiB, which should be OK since liblzma is bloated already. ;-)
2009-09-11	Fix a couple of warnings.	Lasse Collin	1	-4/+1

2009-04-13	Put the interesting parts of XZ Utils into the public domain.	Lasse Collin	1	-12/+5
	Some minor documentation cleanups were made at the same time.
2009-02-08	Add a separate internal function to initialize the CRC32	Lasse Collin	1	-2/+2
	table, which is used also by LZ encoder. This was needed because calling lzma_crc32() and ignoring the result is a no-op due to lzma_attr_pure.
2009-02-02	Modify LZMA_API macro so that it works on Windows with	Lasse Collin	1	-1/+1
	other compilers than MinGW. This may hurt readability of the API headers slightly, but I don't know any better way to do this.
2009-01-27	Added initial support for preset dictionary for raw LZMA1	Lasse Collin	1	-2/+16
	and LZMA2. It is not supported by the .xz format or the xz command line tool yet.
2008-12-31	Remove lzma_init() and other init functions from liblzma API.	Lasse Collin	1	-0/+6
	Half of developers were already forgetting to use these functions, which could have caused total breakage in some future liblzma version or even now if --enable-small was used. Now liblzma uses pthread_once() to do the initializations unless it has been built with --disable-threads which make these initializations thread-unsafe. When --enable-small isn't used, liblzma currently gets needlessly linked against libpthread (on systems that have it). While it is stupid for now, liblzma will need threads in future anyway, so this stupidity will be temporary only. When --enable-small is used, different code CRC32 and CRC64 is now used than without --enable-small. This made the resulting binary slightly smaller, but the main reason was to clean it up and to handle the lack of lzma_init_check(). The pkg-config file lzma.pc was renamed to liblzma.pc. I'm not sure if it works correctly and portably for static linking (Libs.private includes -pthread or other operating system specific flags). Hopefully someone complains if it is bad. lzma_rc_prices[] is now included as a precomputed array even with --enable-small. It's just 128 bytes now that it uses uint8_t instead of uint32_t. Smaller array seemed to be at least as fast as the more bloated uint32_t array on x86; hopefully it's not bad on other architectures.
2008-09-27	Some API changes, bug fixes, cleanups etc.	Lasse Collin	1	-16/+14

2008-09-17	Miscellaneous LZ and LZMA encoder cleanups	Lasse Collin	1	-2/+6

2008-09-13	Renamed constants:	Lasse Collin	1	-1/+1
	- LZMA_VLI_VALUE_MAX -> LZMA_VLI_MAX - LZMA_VLI_VALUE_UNKNOWN -> LZMA_VLI_UNKNOWN - LZMA_HEADER_ERRRO -> LZMA_OPTIONS_ERROR
2008-09-06	Comments	Lasse Collin	1	-2/+1

2008-09-02	Some fixes to LZ encoder.	Lasse Collin	1	-10/+46

2008-08-28	Sort of garbage collection commit. :-\| Many things are still	Lasse Collin	1	-392/+388
	broken. API has changed a lot and it will still change a little more here and there. The command line tool doesn't have all the required changes to reflect the API changes, so it's easy to get "internal error" or trigger assertions.
2008-06-01	Fix a buffer overflow in the LZMA encoder. It was due to my	Lasse Collin	1	-108/+5
	misunderstanding of the code. There's no tiny fix for this problem, so I also cleaned up the code in general. This reduces the speed of the encoder 2-5 % in the fastest compression mode ("lzma -1"). High compression modes should have no noticeable performance difference. This commit breaks things (especially LZMA_SYNC_FLUSH) but I will fix them once the new format and LZMA2 has been roughly implemented. Plain LZMA won't support LZMA_SYNC_FLUSH at all and won't be supported in the new .lzma format. This may change still but this is what it looks like now. Support for known uncompressed size (that is, LZMA or LZMA2 without EOPM) is likely to go away. This means there will be API changes.
2008-04-25	Prevent LZ encoder from hanging with known uncompressedlarhzu/v4.999.3alpha	Lasse Collin	1	-2/+7
	size. The "fix" breaks LZMA_SYNC_FLUSH at end of stream with known uncompressed size, but since it currently seems likely that support for encoding with known uncompressed size will go away anyway, I'm not fixing this problem now.
2008-04-24	Fix wrong return type (uint32_t -> bool).	Lasse Collin	1	-1/+1

2008-04-24	Fix data corruption in LZ encoder with LZMA_SYNC_FLUSH.	Lasse Collin	1	-0/+16

2008-01-18	Fix LZMA_SYNC_FLUSH handling in LZ and LZMA encoders.	Lasse Collin	1	-8/+26
	That code is now almost completely in LZ coder, where it can be shared with other LZ77-based algorithms in future.
2008-01-14	Major changes to LZ encoder, LZMA encoder, and range encoder.	Lasse Collin	1	-20/+118
	These changes implement support for LZMA_SYNC_FLUSH in LZMA encoder, and move the temporary buffer needed by range encoder from lzma_range_encoder structure to lzma_lz_encoder.
2008-01-10	Eliminate lzma_lz_encoder.must_move_pos. It's needed	Lasse Collin	1	-4/+2
	only in one place which isn't performance criticial.
2007-12-09	Imported to git.	Lasse Collin	1	-0/+481