Forum: >>> Magnum BBS <<<

Dark
Log in

Username Password

Basics of LZ77 Algorithm

From yasar11732@gmail.com@21:1/5 to All on Tue Jun 11 08:11:11 2019

Hello,

I am trying to understand how LZ77 algorithm work. From what I read from various sources, I have come to following conclusion:

According to LZ77 compression algorithm, if I encode jump and length using 4 bits, and character as 8 bits, I will use 16bits for each token. If the text I am compressing doesn't have any repetition, I will actually double the size of my input.

I was wondering if I had arrived to correct conclusion, because it doesn't sound right.

Thanks in advance,

Yaşar Arabacı

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From Scott@21:1/5 to yasar11732@gmail.com on Tue Jun 11 22:05:38 2019

On Tue, 11 Jun 2019 08:11:11 -0700 (PDT), yasar11732@gmail.com wrote:

I am trying to understand how LZ77 algorithm work. From what I read from va= >rious sources, I have come to following conclusion:

According to LZ77 compression algorithm, if I encode jump and length using = >4 bits, and character as 8 bits, I will use 16bits for each token. If the t= >ext I am compressing doesn't have any repetition, I will actually double th= >e size of my input.

I was wondering if I had arrived to correct conclusion, because it doesn't = >sound right.

It does sound odd at first, but that's more or less right. It's a
fundamental fact of information theory and applies to all (lossless) compression methods. Basically, over the set of all possible messages
of length N, the average length of the corresponding compressed
messages is also N.

So yes, every method will have certain inputs that produce larger
outputs. The trick to practical compression is to find algorithms that
work well with patterns that you find in certain useful inputs, which
AIUI is how LZ was designed. Then you check as you go, and if you have
a chunk that is anti-compressible, you just store it without
compression.

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

From flemingsarah015@gmail.com@21:1/5 to yasar...@gmail.com on Mon Jul 1 10:49:13 2019

On Tuesday, June 11, 2019 at 4:11:12 PM UTC+1, yasar...@gmail.com wrote:

Hello,

I am trying to understand how LZ77 algorithm work. From what I read from various sources, I have come to following conclusion:

According to LZ77 compression algorithm, if I encode jump and length using 4 bits, and character as 8 bits, I will use 16bits for each token. If the text I am compressing doesn't have any repetition, I will actually double the size of my input.

I was wondering if I had arrived to correct conclusion, because it doesn't sound right.

Thanks in advance,

Yaşar Arabacı

--- SoupGate-Win32 v1.05
* Origin: fsxNet Usenet Gateway (21:1/5)

Who's Online
Recent Visitors
- Gretchiie
  Mon Dec 30 17:08:49 2024
  from Derry, Nh via Telnet
- Bob Worm
  Mon Dec 30 14:27:01 2024
  from Wales, Uk via Telnet
- Guest
  Mon Dec 30 07:42:32 2024
  from /bin/busybox Cat /proc/self/ex via Raw
- Mxofxn
  Mon Dec 30 02:37:35 2024
  from Little Rock via Telnet
- Keyop
  Mon Dec 30 01:56:05 2024
  from Huddersfield, West Yorkshire via SSH
- Gwylbert
  Mon Dec 30 00:37:24 2024
  from Sydney, Nsw via Telnet
- Bob Worm
  Sun Dec 29 22:17:42 2024
  from Wales, Uk via Telnet
- Keyop
  Sun Dec 29 16:09:13 2024
  from Huddersfield, West Yorkshire via SSH

System Info

Sysop:	Keyop
Location:	Huddersfield, West Yorkshire, UK
Users:	384
Nodes:	16 (2 / 14)
Uptime:	63:31:52
Calls:	8,174
Calls today:	6
Files:	13,113
Messages:	5,864,702