Papers, Documentation, and web pages that are mostly expository in nature. Note that a high percentage of academic papers are stored in either PostScript or compressed PostScript format.
VCEG is responsible for standardization of the "H.26x" line of video coding standards and related technologies. The FTP has a lot of materials produced by VCEG, including information on prospective H.265.
Created: 16/04/2008
by Maxim SmirnovMore...
Paper on "In-place calculation of minimum-redundancy codes" by Alistair Moffat ,& Jyrki Katajainen. Tells how to deal with Huffman codes in the efficient way.
Created: 16/05/2007
More...
Avisynth's homepage: download, documentation, FAQ, plugins (including a list of external plugins), community.
Created: 18/01/2008
by Maxim SmirnovMore...
Contains the code and tells about BWTS which does not need the index. It's what BWT should have been to be length preserving.
Created: 10/01/2008
More...
A new incremental algorithm for data compression is presented. For a sequence of input symbols algorithm incrementally constructs a p-adic integer number as an output. Decoding process starts with less significant part of a p-adic integer and incrementally reconstructs a sequence of input symbols. Algorithm is based on certain features of p-adic numbers and p-adic norm. p-adic coding algorithm may be considered as of generalization a popular compression technique - arithmetic coding algorithms. It is shown that for p = 2 the algorithm works as integer variant of arithmetic coding; for a special class of models it gives exactly the same codes as Huffman's algorithm, for another special model and a specific alphabet it gives Golomb-Rice codes.
Created: 12/04/2007
More...
Library of technical articles along with code samples written and supported by Andrew Polar. Contains articles and source codes on Huffman and range coder. Readers may found another topics of interest not related to data compression, e.g. a simple Web server in sources.
Created: 11/02/2007
by Maxim SmirnovMore...
Nice article by Andrew Polar on arithmetic and range coders. It can be read even by those not very closely acquainted with data compression. The article contains source code attached. The additional merit of this text is a discussion on several patent-relating issues.
Created: 11/02/2007
by Maxim SmirnovMore...
Nelle immagini compresse con la quantizzazione dei coefficienti DCT si possono individuare senza alcuna difficoltà i bordi dei blocchi usati per la compressione. Questo fenomeno è spiacevole da vedere e quindi si deve tentare la sua riduzione, in questo documento si trattano alcune tecniche per farlo.
Created: 22/01/2007
More...
Paper and source code of an efficient binary adaptive encoder of iid sequences.
Outperforms several existing arithmetic-codes-based implementations (JBIG's Q-coder, H.264's CABAC) on short sequences.
Created: 22/01/2007
More...
Boris Ryabko is a well-known Russian scientist with main scientific interests in Information Theory, Prediction, Complexity of Algorithms,
Cryptography and Mathematical Biology. Probably, he is most known as an inventor of "bookstack" or "move-to-front" coding. There is a number of selected publications and reports regarding Information Theory and compression issues on his homepage. The primary language is English, some papers have Russian version.
Created: 14/10/2006
by Maxim SmirnovMore...
This article by Gary Sullivan and Stephen Estrop describes the 8-bit YUV formats that are recommended for video rendering in the Microsoft Windows operating system. This article presents techniques for converting between YUV and RGB formats, and also provides techniques for upsampling YUV formats. This article is intended for anyone working with YUV video decoding or rendering in Windows.
Created: 28/03/2006
by Maxim SmirnovMore...
The European Broadcasting Union (EBU) is the largest professional association of national broadcasters in the world. Cooperation in the technical sphere is one of the EBU's major activities. The Union is in the forefront of research and development of new broadcast media, and has led or contributed to the development of many new radio and TV systems: radio data system (RDS), digital audio broadcasting (DAB), digital television (DVB), high- definition TV (HDTV). There are a number of open publications on video and audio streaming and broadcasting, relevant marketing and production issues (see EBU Technical Review).
Created: 01/03/2006
by Maxim SmirnovMore...
Quantized Indexing is a new practical form of combinatorial (enumerative) coding. Under proper conditions it can be faster and tighter than arithmetic coding. There is a demonstrative source code, but no off-the-shelf working compressor though. There were quite a lot of talks on QI in comp.compression newsgroup recently. See this thread as an example.
Created: 16/02/2006
by Ratko TomicMore...
Tilman Liebchen from Technical University of Berlin did a lot of research in the field of lossless audio compression. He is the author of LPAC codec and he was actively involved in the development of MPEG-4 Audio Lossless Coding (ALS) standard. There are a number of useful papers on his page, including articles on MPEG-4 ALS. Some papers are in German.
Created: 11/02/2006
by Maxim SmirnovMore...
An overview of the Coding Technologies' aacPlus audio codec. This article describes the principles of traditional audio coders and their limitations when used for low bit-rate applications. The second part describes the basic idea of SBR technology and demonstrates the improvements achieved through the combination of SBR technology with traditional audio coders such as AAC and MP3.
Created: 31/01/2006
by Maxim SmirnovMore...
National Software Testing Labs (NSTL), employed by Microsoft, ran a proctored double-blind test that compared 64 kilobits per second (Kbps) WMA Pro (stereo) to the High Efficiency AAC Profile (HE AAC). The 12 segments of audio were presented by a proctor in a double-blind test to 300 individual listeners of both sexes and over a wide variety of ages. The paper states that 37% of the listeners preferred WMA Pro, 34% expressed identical preferences, and 29% preferred HE AAC.
Created: 31/01/2006
by Maxim SmirnovMore...
The text has some charts on G.711, G.723.1, G.729A performance with increasing packet loss and increasing latency.
Created: 30/01/2006
by Maxim SmirnovMore...
Pizza&Chili Corpus: Paolo Ferragina of the University of Pisa and Gonzalo Navarro of the University of Chile have a web site dedicated to the exploration of compressed indices. Paolo and Gonzalo have posted links to quite a few papers on full text compressed indices, which expound the notion that you can pick and choose exactly what you want to decompress. The site has collections of texts, links to people and papers, and a proposed API for testing work in the future.
Created: 22/01/2006
by Sachin GargMore...
Two papers in which it is show how to combine the BWT with the suffix array
data structure, in order to build a sort of compressed suffix array.
In the first paper it is proven that
the space occupancy of the compressed suffix array can be bounded in
terms of the entropy of the input string. In the second paper it is
proposed and tested a practical implementation of this data structure.\
The first paper will appear in the Proc. of 41st IEEE Symposium on
Foundations of Computer Science; the second one in the Proc. of the 12th
SIAM-ACM Symposium on Discrete Algorithms.
Created: 23/09/2000
by Mark NelsonMore...
Links to the articles from this journal dating back to 1953!
(As former Trans. of the IRE professional group on comm. systems.)
Quite a number of compression-related articles were published
in this title, although it aims at channel coding problems. The bad news is that you are only allowed to see the abstracts if you're not an IEEE member.
Created: 10/10/2005
by Maxim SmirnovMore...
A site of Iain Richardson, author of "Video Codec Design" and "H.264 and MPEG-4 Video Compression" books. The site has a very nice set of H.264 white papers, some useful links and other resources.
Created: 18/11/2001
by Mark NelsonMore...
The aim of this paper is to improve the performances of the JPEG2000. In fact, the approach named PJPEG2000 consists in inserting a phase of pretreatment before initiating the JPEG2000 process. This investigation allows better quality compression. When compressing images, noise is fatal to compression performance, it can be both annoying for the observer and make transmission of the imagery consume excessive amounts of bandwidth. The pretreatment based on the filtering reduces noise and thus improve both visual impression and transmission properties. The comparison between JPEG2000 and PJPEG2000 shows that the latter is favourable in both PSNR and WPSNR and in visual inspection evaluation.
Created: 09/09/2005
by Sachin GargMore...
By Jesper Larsson, Published in Proceedings of the IEEE Data Compression Conference 1996. Hey Jesper, how about putting a PDF version online?
Created: 26/04/2003
by Mark NelsonMore...
Home page of MNG: A PNG-like Image Format Supporting
Multiple Images, Animation and Transparent JPEG. Was released on 31 January 2001.
MNG includes a number of interesting features:
* object or sprite-based approach to animation, with commands to move, copy and paste images (rather than replicate them as in GIF)
* nested loops for complex animations
* way better compression than GIF animations
* support for difference (or ``delta'') images for still better compression
* integration of both PNG and JPEG-based (``JNG'') images
* support for transparent JPEG images
* low-complexity and very low-complexity subsets for simpler implementation
Created: 14/11/1999
by Mark NelsonMore...
An introduction to Shannon's notion of probabilistic entropy, from the viewpoint of ergodic theory and dynamical systems theory. Includes a complete proof a reasonably general (and useful!) version of the Asymptotic Equipartition Principle (AEP). Includes figures.
Created: 12/10/2002
by Mark NelsonMore...
"The Context Trees of Block Sorting Compression" - paper By Jesper Larsson. Published in Proceedings of the IEEE Data Compression Conference 1998, PDF version of the paper.
Created: 26/04/2003
by Mark NelsonMore...
"Universal Data Compression Based on the Burrows and Wheeler Transformation: Theory and Practice" by Bernhard Balkenhol, Stefan Kurtz, 1998. Yet another analysis of the algorithm. Any exciting conclusions?
Created: 01/12/1999
by Mark NelsonMore...
by Balkenhol, Bernhard; Kurtz, Stefan; Shtarkov, Yuri M. The authors discuss some statistical qualities of BWT output and improve on the algorithm's performance. This paper was presented at the 1999 Data Compression Conference.
Created: 05/08/2000
by Mark NelsonMore...
Those folks at AT&T have developed a compressor that can be used to squeeze individual data items in XML documents. AT&T says this is "essentially free" software. Read the license on-line to determine exactly what that means.
Also available on http://sourceforge.net/projects/xmill
Created: 19/12/1999
by Mark NelsonMore...
A grey-scale wavelet compressor written in C. Includes pointers to source code and the paper presented on this work.
Created: 20/08/2000
by Mark NelsonMore...
A big batch of image compression papers are on line here. I guess the entire contents of all the papers is online thanks to the contributions of the US government. For image compression, this is a fantastic resource.
Created: 26/04/2000
by Mark NelsonMore...
Bernhard have written quite a few papers on data compression. A few of them are in German, a few in English. All in ps.gz format. Bernhard has links to published papers and preprints on this page, be sure to check them all.
Created: 02/04/2000
by Mark NelsonMore...
This page describes Flexible Parsing, a proposed extension for dictionary based LZ compression schemes. Yossi Matias, Nasir Rajpoot, and Cenk Sahinalp have a summary of their work on this page, along with links to three PS format papers that go into detail on the results. The authors summarize this improved technique as "looking one step ahead for the longest phrase in the dictionary instead of trying to find the longest possible phrase at hand."
Created: 14/12/1999
by Mark NelsonMore...
by Balkenhol, Kurtz, and Shtarkov. A paper that was published at DCC 1999. Some thoughts about BWT and the context tree model, alphabet modification, modification of MTF, and Grouping of symbols.
NOTE:The file type would indicate that this is a compressed file, but it actually appears to be unencoded PS format.
Created: 01/12/1999
by Mark NelsonMore...
A paper by Brent Chapin and Steve Tate describing some improvements in BWT compression. The abstract says that compression can be improved by alphabet ordering and reflecting the sorted binary strings. This version is in PDF format, a PS format is available as well.
Created: 27/11/1999
by Mark NelsonMore...
The home of Wim Swelden's monthly Wavelet Digest. The home page includes some spiffy search capabilities, back issues, forum and so on.
Created: 19/11/1999
by Mark NelsonMore...
David Scott occupies a unique niche in the world of data compression. He is very interested in how one can adapt existing compression technologies to be bijective, a term you can find defined on his web page. The stated motivation for this is to increase the difficulty of breaking an encrypted version of the file. On this page, David makes a first pass at modifying BWT to be bijective. He doesn't quite get there, but feels that he has taken some good steps towards his goal.
Created: 21/09/2000
by Mark NelsonMore...
Mark Nelson's magazine articles, including articles on LZW coding, arithmetic coding, BWT, Zlib, JavaZip and some other topics.
Created: 23/11/1999
by Mark NelsonMore...
This site contains the comp.speech FAQ, and also has links to their ftp site, which contains software for speech codecs.
Created: 21/11/1999
by Mark NelsonMore...
The documentation page for zlib, the free software that implement's PKWare's deflate algorithm. Contains links to the zlib specification, and the gzip specification. Links to versions all over the world, in HTML, ASCII, or PostScript.
Created: 23/08/2001
by Mark NelsonMore...
Home of Portable Network Graphics (PNG image format), Multiple-image Network Graphics (MNG and JNG image formats) and, of course, libpng (the free reference library for reading and writing PNGs). The source for libpng and zlib. Can't do PNG without them.
Created: 14/11/1999
by Mark NelsonMore...
The page contains prepublished versions of H.263 standard, H.264 / AVC standard, H.264 conformance specification, H.264 reference software specification. Other pages of the site--it's the homepage of the video trace research group at Arizona State University--have some papers and software.
There are also several video testing sequences. The sequences are in the common YUV 4:2:0 format (files are zipped) and in CIF (Bridge close, Bridge far, Highway, Mobile, Paris) and QCIF resolutions (Bridge close, Bridge far, Carphone, Claire, Container, Foreman, Grandma, Highway Mother and Daughter, News, Salesman, Silent).
Created: 11/07/2005
by Maxim SmirnovMore...
Not a bad site on wavelets in common, JPEG2000 and digital signal processing using wavelets. The main content is books, papers, thesises, sources. Partially in Russian, but there are a lot of English papers. There is a steganography page as well.
Created: 11/07/2005
by More...
This ftp site contains the PKZIP and InfoZip appnotes in various incarnations, plus RFCs 1950, 1951, and 1952. It is referenced in the zlib docs as if it were the official location for the gzip specification. If you are attempting to understand the format of PKZIP archives you need this document Copies of RFCs 1950, 1951, and 1952 are here as will.
Created: 07/11/1999
by Mark NelsonMore...
From the site: Computer images are extremely data intensive and hence require large amounts of memory for storage. As a result, the transmission of an image from one machine to another can be very time consuming. By using data compression techniques, it is possible to remove some of the redundant information contained in images, requiring less storage space and less time to transmit. Neural nets can be used for the purpose of image compression, as shown in the following demonstration.
Created: 02/11/1999
by Mark NelsonMore...
A page which links to about 70 online papers & books on the topics of enumerative (combinatorial) coding, arithmetic coding, universal codes. Truly a gem.
There are here some hard to find papers, Ph.D. theses (#23,#36,#42), books (#3 Abramson 1963 classic on Inf. Theory and coding, #37, #38 combinatorics textbook). Among the online papers: #27 from Rissanen (who invented arithmetic coding in 1976) on math of arithmetic coding (dual radix representation). Also interesting and hard to find #28,#33, #11 (Schalkwijk on lattice walks on Pascal triangle for enum. coding), #61-#62 on potentially fast combinatorial codes, #60 fast unranking … etc.
Created: 25/06/2005
by Sachin GargMore...
Charles is the author of PPMZ and has a really great collection of software he has written on this page, much of it is indexed here in the appropriate pages. His home page also contains links to some interesting papers and descriptions on various algorithms, as well as some archived posts that explain some compression topics. It's well worth nosing around this site a bit for more information.
Created: 25/12/1998
by Mark NelsonMore...
Byte pair encoding (BPE) is a simple universal text compression scheme with fast decompression which requires small work space which allows decompression of an arbitrary part of the original text. Compression is rather slow with ratios less than Lempel-Ziv type compression. This paper, brings out a potential advantage of BPE compression showing that it is very suitable from a practical view point of compressed pattern matching, where the goal is to find a pattern directly in
compressed text without decompressing it explicitly.
Created: 12/06/2005
by Sachin GargMore...
Quite a number of information theory classic papers. The parent node http://cg.ensmp.fr/~vert/proj/bibli/ has additional related articles.
Created: 13/06/2005
by Maxim SmirnovMore...
The Waterloo Fractal Compression Project is part of a general research programme dedicated to the study of fractal analysis and Iterated Function Systems/Fractal Transforms from both theoretical as well as practical perspectives.
Created: 14/11/1999
by Mark NelsonMore...
Burrows, M. and Wheeler, D.J., A Block-sorting Lossless Data Compression Algorithm, Digital Systems Research Center Research Report 124, May 1994. The original paper on BWT compression. This very effective compression technique was first described in this paper.
Michael Schindler reviewed it thus:Even through it is the first paper about blocksort it is still excellent literature for introduction to this method and its general ideas..
Note: after many years, the DEC web site hosting this paper finally disappeared, not surprisingly. The new link is via the CiteSeer search engine, which I hope is a permanent solution.
Created: 09/06/2002
by Mark NelsonMore...
An impressive set of links to papers, schedules, articles, and contacts for MPEG. Clearly the place to go for information relating to the standardization process.
Created: 15/05/2004
by Mark NelsonMore...
by A.Ratushnyak. A short discussion related to the compression of audio and pictures. A Library readher had this to say: Worth reading 2 or 3 times.
Created: 18/03/2001
by Mark NelsonMore...
A big batch of pointers to various MPEG documents. Includes press releases and docs on MPEG-2, MPEG-4, MPEG-7, and AAC.
Created: 14/11/1999
by Mark NelsonMore...
The range encoder is a fast multisymbol entropy coder (similar to arithmetic coding) with GNU general public license (other licenses on request). Its compression is within 0.01% of arithmetic coding. It is based on an article dated 1979, so it is believed to be patent free. This page includes a PS format paper by G.N.N Martin describing the range encoder.
Created: 28/10/1999
by Mark NelsonMore...
Alistair Moffat has put together all the links to his source code and articles on Arithmetic Coding in one tidy place.
Created: 19/11/1999
by Mark NelsonMore...
Steven Pigeon's Ph. D Thesis from the University of Montreal. Proposes a new set of universal codes, which he dubs taboo codes, as well as new optimization algorithms for (Start, Step, Stop) codes. Plus lossy variations on LZW.
Created: 17/03/2003
by Mark NelsonMore...
D. T. Hoang, P. M. Long and J. S. Vitter. ``Dictionary Selection using Partial Matching,'' Information Sciences, 119(1-2), 57-72, 1999. This paper describes an attempt to squeeze improved compression out of existing dictionary-based schemes by using multiple context-based dictionaries for encoding.
Created: 08/04/2002
by Mark NelsonMore...
by P. G. Howard and J. S. Vitter. This paper shows how images can be encoded and decoded using parallel processing. Both Huffman and arithmetic coding are examined.
Created: 28/07/2002
by Mark NelsonMore...
This page contains a paper "Improved Huffman coding using recursive splitting" that describes a program that attempts to improve on Huffman compression by manipulation of the data stream.
Created: 09/12/1999
by Mark NelsonMore...
Daniel Ricardo has a new algorithm for generation of Huffman codes. He claims superior performance and small memory footprint. Algorithm description here, but no code.
Created: 22/09/2001
by Mark NelsonMore...
It's a page on Huffman and Shannon-Fano methods on the Russian compression.ru site. It contains a number of papers and sources, and the major part is in English. It's unlikely that you will have any problems with downloading, since the titles are in English.
Created: 21/03/2005
by Maxim SmirnovMore...
In this paper we present a method that uses Genetic Algorithms (GAs) to find a Local Iterated Function System (LIFS) that encodes a single image. By doing this, the time needed to achieve this LIFS is reduced by about compared with Barnsley's method if similar image quality is desired. If less quality is acceptable, using a GA we can vary the time the encoding will take by changing parameters such as population size and number of generations allowed.
Created: 12/09/2004
by Mark NelsonMore...
A dissertation by Keith Howell which evaluates the suitability of Fractal Compression for spacecraft images. Keith says he is willing to supply source code upon request.
Created: 29/07/2004
by Mark NelsonMore...
I'm pleased to announce with my co-authors availability of a preprint on our new algorithm to estimate the Shannon entropy rate (bits/symbol) or (bits/sec) of an observed sequence of low-alphabet symbols. It uses the Context-Tree-Weighting universal compression method, but doesnot use the compression ratio directly as an entropy estimator but as a scaffold for a Bayesian estimate. The result is significantly lower bias.
Created: 10/07/2004
by Mark NelsonMore...
An updated and translated version of our German paper "Proseminar Datenkompression - Arithmetische Kodierung" from 2001. To the best of our knowledge, it is the first comprehensive paper that describes the whole way from the basic principles of AC up to a simple implementation, fully documented with C++ source code.
Created: 06/06/2004
by Mark NelsonMore...
by Ziv and Lempel. The seminal LZ78 paper which spawned LZW, GIF, and an entire academic industry.
Update 2004: Document is now packed in RAR format.
Created: 16/05/2004
by Mark NelsonMore...
The 1977 paper describing an algorithm for compression using pointers to previously seen text. This algorithm, later known as LZ77, is still one of the most widely used techniques for lossless data compression in use today.
Update 2004: Document is now packed in RAR format.
Created: 16/05/2004
by Mark NelsonMore...
This group at Loughborough University in the UK would like to use sophisticated compression techniques in high speed networks. To make it all happen, they need to do it in hardware, and do it in parallel. This page has information about their efforts, along with links to papers and other information.
Created: 15/05/2004
by Mark NelsonMore...
From Microsoft: This document describes Binary Delta Compression (BDC) technology and its use in software update deployment. This implementation of BDC technology developed by Microsoft reduces the download size of software update packages by downloading only the differences between old files and new files.
Created: 04/04/2004
by Mark NelsonMore...
A paper by Diego Santa Cruz, Touradj Ebrahimi, Mathias Larsson, Joel Askelof and Charilaos Cristopoulos. According to the abstract, this paper discusses ways to decode regions of an image on the fly without needing to do a complete decode of the entire image.
Created: 07/03/2004
by Mark NelsonMore...
by Maryline Charrier, Diego Santa Cruz and Mathias Larsson. This short paper gives an overview of the JPEG2000 standard, along with a java decoder.
Created: 07/03/2004
by Mark NelsonMore...
by Diego Santa-Cruz and Touradj Ebrahimi. A paper to be published in Proc. of the IEEE International Conference on Image Processing (ICIP), September, 2000.
Created: 07/03/2004
by Mark NelsonMore...
A paper by Timothy McLaughlin that gives an overview of HTTP Compression and tries to decide whether it's a good thing for not.
Created: 07/03/2004
by Mark NelsonMore...
A paper by Eduardo Enrique Gonzalez Rodriguez that describes a proposed new method of entropy encoding. Eduardo overcomes some of the problems faced by Huffman coding in certain circumstances, such as a very small alphabet.
Created: 05/10/2003
by Mark NelsonMore...
A survey paper that covers the current roster of popular encoding algorithms, such as Base64, Base32, Hex, etc.
Created: 01/10/2003
by Mark NelsonMore...
by Jurgen Abel and William Teahan. This paper looks at a few different techniques for preprocessing data before performing text compression, and compares the gains achieved when combining the preprocessors with PPM, BWT, and LZ algorithms.
Created: 19/07/2003
by Mark NelsonMore...
Robert Estes (UCDavis - image compression research). Robert has pointers to many of his papers on this page. He has info on Wavelet-based image compression, image compression, plus some peripherally related topics such as image quality.
Created: 26/05/2003
by Mark NelsonMore...
Current research projects in our lab include research into vector quantization (VQ), wavelets, image compression, edge detection using VQ, VQ for image browsing, VQ design for noisy channels, halftoning, and color palette management.
Created: 04/05/2003
by Mark NelsonMore...
By Jesper Larsson. Published in Proceedings of the IEEE Data Compression Conference 1998, a PDF version of the paper.
Created: 26/04/2003
by Mark NelsonMore...
by Jesper Larsson. Technical report. LU-CS-TR:98-204, LUNFD6/(NFCS-3134)/1-6/(1998). A PDF version of the paper.
Created: 26/04/2003
by Mark NelsonMore...
By Jesper Larsson, Arne Andersson and Kurt Swanson. Technical report. LU-CS-TR:95-158, LUNFD6/(NFCS-3107)/1-14/(1995). Early version of a paper published in Proceedings of the 7th Annual Symposium on Combinatorial Pattern Matching 1996; to appear in Algotrithmica
Created: 26/04/2003
by Mark NelsonMore...
Jesper Larsson spent a fair amount of his years in academia
studying Suffix Trees. His home page has links to his thesis and a few other papers on suffix trees and other string matching/Data Compression topics.
Created: 23/04/2003
by Mark NelsonMore...
This is another German preprint of Jurgen Abel describing the principles of the Burrows-Wheeler Compression Algorithm. An
implementation of a BWT based compressor with a compression rate of 2.25 bps on the Calgary Corpus is presented. The paper will be
published in the German journal "Informatik - Forschung und Entwicklung", Springer-Verlag Heidelberg, Association for Informatics
(Gesellschaft fur Informatik eV.)
Created: 14/04/2003
by Mark NelsonMore...
This preprint of Jurgen Abel gives a short introduction into the BWCA field and proposes several improvements for the WFC stage and the IF stage.
It further introduces a new RLE scheme for bypassing the run length
information around the WFC stage. The paper is the basis of the
BWCA program ABC and the revealed approach achieves a
compression rate of 2.238 bps, which is the best result for a pure
BWCA without any preprocessing before the BWT to date (March 2003).
Created: 22/03/2003
by Mark NelsonMore...
The whole contents of the journal, dating back to 1953. If only you were a member of the society, you could read all these great papers.
Created: 21/03/2003
by Mark NelsonMore...
Context Tree Weighting (CTW) has been a technique with great promise, but it hasn't ever been able to reach the critical mass needed to become more than a curiousity. Jurgen Abel is doing his best to overcome that problem. He's created a nice reference page for CTW on his data-compression.info web site. He has references to a few papers, a few people, and one piece of source code.
Created: 21/03/2003
by Mark NelsonMore...
This is a preprint of a paper by Jurgen Abel describing the functionality
of a basic but quite fast Burrows-Wheeler compressor. Jurgen reports that this will be published in PIK, a German-language journal on Communications and Information Theory.
Created: 21/03/2003
by Mark NelsonMore...
Jurgen Abel has done an enormous amount of research on the Burrows-Wheeler Transform, and has published the results on his web site. On this page you will find:
A summary of this compression technique.
Links to over 70 online papers.
Links to at least that many people involved in BWT research or development.
Extensive links to BWT source code.
This web page may now be the definitive source of information for this field.
Created: 18/03/2003
by Mark NelsonMore...
Filip is interested in wavelets and image restoration. In addition to a nice page of links (navigate from the main page) he has some software, publications, test results, and more.
Created: 17/03/2003
by Mark NelsonMore...
John Robinson's page on BTPC, which includes documentation, samples, links, and source. BTPC is designed to do both lossy and lossless compression of images.
John updated this package to version 5 in March, 2003.
Created: 13/03/2003
by Mark NelsonMore...
Anand Jain discusses his new technique for arithmetic coding. An executable file is included, but no source code.
Created: 22/02/2003
by Mark NelsonMore...
The JPEG 2000 committee publishes links to some of their documents here. (Notably absent: the standards themselves.) This includes links to standard drafts, press releases, and requirements.
Created: 19/01/2003
by Mark NelsonMore...
The papers pointed to here provide useful background information on the standard. There are seminar papers, tutorials, and other background documents.
Created: 19/01/2003
by Mark NelsonMore...
A short paper with some links that describe the compression algorithm used with Microsoft Portrait, which is a demonstration project for video telephony.
Created: 19/01/2003
by Mark NelsonMore...
1997, Robert Giegerich, Stefan Kurtz. We review the linear time suffix tree constructions by Weiner, McCreight, and Ukkonen. We use the terminology of the most recent algorithm, Ukkonen's online construction, to explain its historic predecessors. The submitter of this paper indicates that it has user-friendly terminology, always welcome in Journal papers.
Created: 16/11/2002
by Mark NelsonMore...
SEQUITUR is a method for inferring compositional hierarchies from strings. It detects repetition and factors it out of the string by forming rules in a grammar. The rules can be composed of non-terminals, giving rise to a hierarchy. It is useful for recognizing lexical structure in strings, and excels at very long sequences.
Created: 09/11/2002
by Mark NelsonMore...
The abstract for a paper on calculation of Huffman codes. The paper isn't here, but the source code is. Alistair says that if you sort your array of counts, you can create the Canonical Huffman tree in memory.
Created: 31/10/2002
by Mark NelsonMore...
This paper describes a method which creates minimal acyclic deterministic FSAs. These can be used to create an efficient representation of a dictionary of words by merging identical subtrees.
Created: 30/10/2002
by Mark NelsonMore...
A bibliography containing links to numerous papers on video coding, with links to authors as well. This is a chapter from the massive Annotated Computer Vision Bibliography.
Created: 29/09/2002
by Mark NelsonMore...
A giant bibliography on Fractal Image Compression. Links to online versions of some, but far from all, papers.
Created: 29/09/2002
by Mark NelsonMore...
by Gerry Melnikov, Aggelos K. Katsaggelos. In this paper a hybrid fractal and Discrete Cosine Transform (DCT) coder is developed.
Created: 28/09/2002
by Mark NelsonMore...
Fractal image compression is a promising new technology that may successfully provide a codec for PC-to-PC video communications. Unfortunately, the large amount of computation needed for the compression stage is a major obstacle that needs to be overcome. This paper introduces the Fast Fractal Image Compression algorithm, a new approach to breaking the "speed problem" that has plagued previous efforts.
Created: 28/09/2002
by Mark NelsonMore...
by John D. Villasenor, Benjamin Belzer, and Judy Liao. A paper discussing the choice of filter bank for wavelet compression. PS format only.
Created: 15/09/2002
by Mark NelsonMore...
Links to the articles from this journal dating back to 1988. If you're not an IEEE member, you only get to see the abstracts! Membership in the IEEE plus a few extra bucks lets you download here to your heart's content.
Update: Site has moved due to redesign.
Created: 03/09/2002
by Mark NelsonMore...
This page has a number of papers related to the DjVu compression format. Papers are all posted in ps.gz and DjVu formats. Note that you'll need the DjVu plugin to read them in that format.
Created: 03/09/2002
by Mark NelsonMore...
This page contains links to a wide variety of Ogg Vorbis documents, including specifications, programming guides, and technical discussions.
Created: 07/08/2002
by Mark NelsonMore...
A nice set of papers on audio coding. Includes goodies such as the ISO standards on MPEG-2 part 3 audio coding.
Created: 07/08/2002
by Mark NelsonMore...
by P.G> Howard and J.S. Vitter. This paper from DCC '92 describes an improved method for error modeling in hierarchical lossless image compression.
Created: 28/07/2002
by Mark NelsonMore...
by D. T. Hoang, P. M. Long, and J. S. Vitter. This paper from DCC '94 looks at different methods for selecting motion vectors in low bit-rate video encoders.
Created: 28/07/2002
by Mark NelsonMore...
Graham Fyffe proposes an alternative type of variable length coding that he claims will offer improved efficiency. Full description, no implementation.
Created: 18/07/2002
by Mark NelsonMore...
by Kunihiko Sadakane, Hiroshi Imai. This paper proposes two new methods of performing fast string matching in LZ77 compression. One method uses a new hashing algorithm, the other uses suffix sorting.
Created: 16/07/2002
by Mark NelsonMore...
This paper by Aleks Jakulin discusses the artifacts that show up in images compressed using the JPEG and JPEG2000 algorithms. He includes some nice sample images to illustrate what he is talking about.
Created: 01/07/2002
by Mark NelsonMore...
Daniel Lemire is a researcher at the National Research Council of Canada. Some of his recent research interest is on wavelet-based prefix sum methods for On-Line Analytic Processing (OLAP). Some of his publications are relevant to compression techniques including a white paper on image compression by wavelets.
Created: 05/06/2002
by Mark NelsonMore...
by Subhasis Saha. A typical overview article, devoted to image compression, concentrates on JPEG and Wavelet-based methods. I don't know where this article appeared, but it looks to be peer-reviewed quality.
Created: 01/05/2002
by Mark NelsonMore...
First sentence of the abstract says it all: We describe a new image coding approach in which a 4-ary arithmetic coder is used to represent significant coefficient values and the lengths of zero runs between coefficients
Created: 01/05/2002
by Mark NelsonMore...
Leon has a very nice set of publications online here, generally in both DjVu and PS format. A total of 45 papers with titles such as "Managing drift in DCT-based scalable video coding" and "DjVu : Analyzing and Compressing Scanned Documents for Internet Distribution.". Bless those who make their work available to the world on the Internet!
Created: 01/05/2002
by Mark NelsonMore...
An academic journal focusing on imaging and compression. It appears that all the papers published in this journal are available on line in PDF format!
Created: 19/04/2002
by Mark NelsonMore...
An article in Wired that talks about the use of compression for patten recognition. Tools as simple as Zip are able to accurately determine the language of a given text. More complex algorithms are being used to determine matches of DNA sequences.
Created: 11/04/2002
by Mark NelsonMore...
P. G. Howard and J. S. Vitter. ``Fast and Efficient Lossless Image Compression,'' Proceedings of the 1993 IEEE Data Compression Conference (DCC '93), Snowbird, UT, April 1993. The abstract describes this as a lossless compression algorithm that provides results as good as JPEG in lossless mode with five times the speed.
Created: 08/04/2002
by Mark NelsonMore...
P. G. Howard and J. S. Vitter. ``Fast Progressive Lossless Image Compression,'' Proceedings of the 1994 IST/SPIE International Symposium on Electronic Imaging Science and Technology, San Jose, CA, February 1994. This paper describes a technique that combines the author's previously published FELICS algorithm with the progressivity of the MLP method.
Created: 08/04/2002
by Mark NelsonMore...
by P. G. Howard and J. S. Vitter. ``Arithmetic Coding for Data Compression,'' Proceedings of the IEEE, 82(6), June 1994, 857-865. This paper describes arithmetic coding, and introduces a technique that uses table lookups to make the process more efficient.
Created: 08/04/2002
by Mark NelsonMore...
R. Grossi and J. S. Vitter. ``Compressed Suffix Arrays and Suffix Trees, with Applications to Text Indexing and String Matching,'' Proceedings of the 32nd Annual ACM Symposium on Theory of Computing (STOC '00), Portland, OR, May 2000, 397-406.
Created: 08/04/2002
by Mark NelsonMore...
J. S. Vitter. ``Design and Analysis of Dynamic Huffman Codes,'' Journal of the ACM, 34(4), October 1987, 825-845. Full paper in PDF and Postscript format.
Created: 07/04/2002
by Mark NelsonMore...
This paper by Paul Howard and Jeff Vitter goes over some of the basics of Arithmetic Coding, then outlines a coder that has increased efficiency by virtue of substituting table lookups for expensive arithmetic operations.
Created: 07/04/2002
by Mark NelsonMore...
This paper by Jyh-Han Lin and Jeff Vitter outlines new VQ algorithms based on linear programming. According to the abstract, the algorithm is the first known polynomial-time codebook design algorithm.
Created: 07/04/2002
by Mark NelsonMore...
by Paul Howard and Jeff Vitter. Here's what they have to say about this paper from the abstract: Our algorithm, related to the PPM method, simplifies the modeling phase by eliminating the escape mechanism, and speeds up coding by using a combination of quasi-arithmetic coding and Rice coding. We provide details of the use of quasi-arithmetic code tables, and analyze their compression performance. Our Fast PPM method is shown experimentally to be almost twice as fast as the PPMC method, while giving comparable compression..
Created: 07/04/2002
by Mark NelsonMore...
A nice white paper of H.26L by the folks at UBVideo. The H.26L standard is a work in progress by both the ITU ISO, and is expected to provide outstanding performance in video coding.
Created: 19/03/2002
by Mark NelsonMore...
A white paper discussing the history and various permutations of the H.263 video encoding standard from the folks at UBVideo.
Created: 19/03/2002
by Mark NelsonMore...
A novel algorithm by Stefano Lonardi. It recursively replaces words in text with pointers, and boasts of good results. Source and papers regarding Off-Line can all be found here.
Created: 04/03/2002
by Mark NelsonMore...
UCSB research activities, including speech coding, audio compression, video coding. Lots of links to demos and publications.
Created: 22/02/2002
by Mark NelsonMore...
A paper by Stefan Kuhr describing a technique that he developed. It improves performance when decoing using the IJG library.
Created: 17/02/2002
by Mark NelsonMore...
by Dmitry Shkarin. Dmitry is the author of PPMd, a well know compressor. PPM is a powerful compression technology which has a reputation for being resource intensive. In this paper Dmitry adds bells and whistles to basic PPM and gets fantastic results.
Reader jcp said: Big problem: algorithm very slow, it's impractical to search anything inside of compressed files (compared with gzip), .. But it's good to send/receive letters of e-mail, well compressed!!!
Created: 09/02/2002
by Mark NelsonMore...
A paper describing a way to brute-force ones way through PKZip's encryption - requires a few hundred bytes of known plaintext.
Created: 03/02/2002
by Mark NelsonMore...
A paper by Michael W. Marcellin, Michael J. Gormish, Ali Bilgin, Martin P. Boliek that was published in the 2000 Proceedings of the DCC. Good information of only moderate complexity in an academic format.
Created: 30/01/2002
by Mark NelsonMore...
The wavelet group has links to books, publications, and people doing wavelet things. Link updated to new location January 2002
Created: 25/01/2002
by Mark NelsonMore...
by Peter Fenwick, Department of Computer Science, The University of Auckland.
In his work on the information content of English text in 1951, Shannon described a method of recoding the input text, a technique which has apparently lain dormant for the ensuing 45 years. Whereas traditional compressors exploit symbol frequencies and symbol contexts, Shannon's method adds the concept of "symbol ranking", as in `the next symbol is the one third most likely in the present context'.
Created: 19/01/2002
by Mark NelsonMore...
This site contains a collection of comments, impressions, comparisons, experiments and ideas regarding the PNG (Portable Network Graphics) file format, and related technologies: zlib (Data Compression Library), and MNG (Multiple-image Network Graphics). It is focused especially on the compression algorithms used in PNG.
Created: 08/01/2002
by Mark NelsonMore...
An article by Giovanni Motta and friends that appears in the Encyclopedia of Computer Science. Packs a lot of information into a short 8 or so pages.
Reader sreenu says: excellent article.
Created: 02/01/2002
by Mark NelsonMore...
CiteSeer tracks references between academic papers, and has links to online versions of a great many papers. The compression links at the top level look like they are a bit sloppy, but there are subcategories for Audio, Text, and Video, which look as though they may be better. Anyway, one can find useful papers using cross-linking and external searching.
Created: 02/01/2002
by Mark NelsonMore...
This page has links to online versions of Hirosuke Yamamoto's papers on data compression. Papers here on block sorting, coding, and more. The papers are all published in English.
Created: 01/01/2002
by Mark NelsonMore...
M. Purat, T. Liebchen, P. Noll: Lossless Transform Coding of Audio Signals. 102nd AES Convention, Munich, 1997. A PostScript file.
Created: 01/01/2002
by Mark NelsonMore...
A paper by Howard and Vitter, available in both PDF and PS formats. The paper provides an analysis of the effect that models and various implementations of arithmetic coding have on compression.
Created: 30/12/2001
by Mark NelsonMore...
Many publications spanning a wide variety of image processing topics. Dr. Po has papers here on both wavelet and fractical compression, motion estimation, etc.
Created: 30/12/2001
by Mark NelsonMore...
by Frans M.J. Willems, Yuri M. Shtarkov, and Tjalling J. Tjalkens, IEEE Trans. Inform. Theory, Sept. 1996. A postscript version of this overview paper.
Created: 17/12/2001
by Mark NelsonMore...
One page poster for Frans M.J. Willems, Yuri M. Shtarkov, and Tjalling J. Tjalkens, "The Context Tree Weighting Method: Basic Properties," IEEE Trans. Inform. Theory, May 1995. (Postscript)
Created: 17/12/2001
by Mark NelsonMore...
Atlanta Signal Processor, Inc., is nice enough to host his paper on their site. It gives a brief overview of the MELP Vocoder algorithm.
Created: 15/12/2001
by Mark NelsonMore...
The very long title of this PDF format paper pretty much obviates the need for any more description on my part.
Created: 15/12/2001
by Mark NelsonMore...
Geoff Davis's home page. Geoff has published several of his wavelet papers on the web, including an interesting analysys of Fractal compression in terms of wavelets. At one time Geoff Davis was responsible for a Wavelet toolkit, but it isn't on his new site. Please forward pointers is you find it.
Created: 07/12/2001
by Mark NelsonMore...
Technical report CUED/F-INFENG/TR.156 , Cambridge University Engineering Department, by Tony Robinson. A somewhat terse technical report describing some of the theory behind the design of the Shorten program.
Created: 07/12/2001
by Mark NelsonMore...
Les Atlas at the University of Washington has developed a new method for coding speech and music for transmission over slow lines. The technique, called fine-grain scalable audio encoding, prioritizes the most important part of the signal, allowing reconstruction at the remote end even if the face of lost packets.
Created: 05/12/2001
by Mark NelsonMore...
Latest compression papers here. When this site was first crawled, the papers covered a wide variety of topics in both lossless and lossy compression.
Created: 29/11/2001
by Mark NelsonMore...
A paper by Alistair Moffat describing an improvement on Peter Fenwick's method for maintaining cumulative probability tables. The pointer on this page leads to some source implementing said table.
Created: 12/11/2001
by Mark NelsonMore...
A set of handouts for a survey course on Data Compression, inlcuding papers on various types of coding, information theory, and more. An academic approach to the topics but quite accessible.
Created: 05/11/2001
by Mark NelsonMore...
A paper by Mark D. Schroeder at the University of North Dakota. This paper talks a bit about implementation issues for JPEG.
Created: 07/10/2001
by Mark NelsonMore...
Apple has published a tech note describing the differences between these two file formats, which on the face of it ought to be identical.
Created: 14/09/2001
by Mark NelsonMore...
A giant bibliography with pointer to on-line papers, as well as some basic information on the state of the art. Not for the timid or the beginner.
Created: 27/08/2001
by Mark NelsonMore...
This paper discusses the design and development of a sofware MPEG decoder, which presumably is the one that can be found in other links to the Berkely MPEG group. Paper available in both Postcript and PDF.
Created: 25/08/2001
by Mark NelsonMore...
According to the abstract, this is the first paper to describe realtime decompression of an MPEG stream at 30 fps. It's timestamped 9/98, but I believe the paper was published in 1994.
Created: 25/08/2001
by Mark NelsonMore...
The deflate technical specification in HTML, by L. Peter Deutsch. This document is the basis for RFC 1951. There are many links to this document stored in various places, but this database will only have links to the versions at www.cdrom.com and the IETF pages.
Created: 23/08/2001
by Mark NelsonMore...
This issue of the DDJ newsletter talks about bijective coding, a coding technique that is fearlessly promoted by David Scott, a regular presence in comp.compression.
Created: 11/08/2001
by Mark NelsonMore...
This issue of the newsletter talks about randomness, complexity, and a long-time favorite, recursive compression.
Created: 11/08/2001
by Mark NelsonMore...
Another paper by Michael Maniscalo discussing a scheme for Run Length Encoding of data that's been put through the blocksort transform. His first paper uses fixed length codes, this uses variable lenghts.
Created: 21/06/2001
by Mark NelsonMore...
This page gives short descriptions of MPEG and H.261 in German. Different systems of coding color information are described as well. Brief, but useful if you don't want to get bogged down in details.
Created: 20/06/2001
by Mark NelsonMore...
Andrey Fomin has posted a long Russian paper in word format on his web site. You'll need a Microsoft Passport (free) to get here, but it's worth the trip if you can read it.
Andrey has this to say about his paper/book:
A survey of different basic Data Compression techniques. Uses some mathematical and statistical materials, but doesn't require a
special scientific degree. Though written in Russian, this book contains VERY useful References, Examples and Source Codes!
Created: 11/01/2001
by Mark NelsonMore...
Links to the papers of Tilo Strutz, concentrating on wavelet based compression. Some papers published in German. Note that Tilo Strutz has added a textbook on image compression to his list of publications.
Created: 04/01/2001
by Mark NelsonMore...
A paper by Sebastian Deorowicz. Implementing the BWT transform is nice and simple, but what you do with the transformed data is where all the action is. Traditionally, we use Move To Front followed by an entropy encoder. Sebastian talks about some alternatives that help compression.
Created: 29/11/2000
by Mark NelsonMore...
Lots of information on this site concerning compression of still pictures, moving pictures, and audio. Much of the site is devoted to comparisons of a few different algorithms. A report section draws a few conclusions about the whole thing.
Created: 25/10/2000
by Mark NelsonMore...
Complexity International is a refereed journal for scientific papers dealing with any area of complex systems research.
Created: 22/10/2000
by Mark NelsonMore...
Michael Schindler's page describing a sorting algorithm that was presented in a poster session at DCC '97. Links to his source code, plus links to the paper and poster in postscript format. Update: Michael has made additional source code available.
Created: 28/09/2000
by Mark NelsonMore...
This bibliography is the best attempt at a complete record of all the written works of Claude Shannon, father of Information Theory.
Created: 23/09/2000
by Mark NelsonMore...
by Andrei Khodakovsky, Peter Schroder, Wim Sweldens. Compressing 3-D object representations is a good thing. Even better is allowing progressive display of these objects. The techniques in this paper are to be applied to geometric meshes.
Created: 08/09/2000
by Mark NelsonMore...
A short paper discussing various preprocessing techniques that can be used in BWT-based compression programs. RTF format.
Created: 06/09/2000
by Mark NelsonMore...
A very gentle introduction to Shannon's discrete entropy, written for moleculary biologists.
Reader Mike said To the point explanation of the "why" of some of the math. This allows you to look a symbols and understand the _meaning_ not just the math..
Created: 27/08/2000
by Mark NelsonMore...
A collection of free TIFF software. This may help you decode files in TIFF format. Includes some documentation files.
Created: 20/08/2000
by Mark NelsonMore...
Pierre Moulin has authored quite a few papers on video and image compression. Unfortunately, the full text is available for only a few papers. Perhaps you could send Pierre an email asking him to put more stuff online.
Created: 20/08/2000
by Mark NelsonMore...
Descriptions of various speech codecs, include G.711, G.721, GSM, and CELP. Each codec gets a brief description plus pointers to additional material and source code.
Created: 20/08/2000
by Mark NelsonMore...
Juergen writes a lot of papers. Lots and lots of papers. Looks like lots of stuff on neural nets and the like.
Created: 01/08/2000
by Mark NelsonMore...
The title says A website devoted to the principles and practices of data compression. Can't ask for anything better than that. Appears to be run by Nam Phamdo from SUNY Stony Brook. Links to papers, explanations, and software.
User comment:Gives technical explanations and animated examples of a small number of compression methods. The explanations are good, but the amount of coverage of the field overall is small.
Created: 29/07/2000
by Mark NelsonMore...
by Claude Shannon. This paper is generally credited with being the opening kickoff of the science of Information Theory.
Created: 29/07/2000
by Mark NelsonMore...
According to the publisher, this contains all of Shannon's published works, plus papers that were previously unpublished for various reasons, including wartime secrecy.
Created: 29/07/2000
by Mark NelsonMore...
A couple of programs using neural networks for compression, along with a couple of papers by the author. This area of data compression is definitely underserved, check out what's here and see if neural networks deserve more attention than they are getting.
Update: This page appears to now have some links to general lossless benchmarking info.
Created: 17/07/2000
by Mark NelsonMore...
Douglas W. Jones, University of Iowa Department of Computer Science. This page contains some C source for a Splay Tree algorithm.
Created: 15/07/2000
by Mark NelsonMore...
The Signal Processing and Coding Laboratory (SPACL) at The University of Arizona has some papers on line, plus some information on their current projects. They seem to be interested in wavelets, quantization, and signal coding.
A DCL reader complained: Very little useful information on the website. On topic, but not helpful at all.
Created: 15/07/2000
by Mark NelsonMore...
A project at Cambridge University that led to a low bit rate speech coder that was used in the HP 620LX Palmtop PC.
Created: 15/07/2000
by Mark NelsonMore...
A paper by Sebastian Deorowicz appears here, which claims to make improvements in efficiency to BWT compression, giving the best ratios of any BWT program on the Calgary Corpus.
Created: 09/07/2000
by Mark NelsonMore...
PhD thesis by Franck Davoine, 20 December 1995. The key to a good fractal compression algorithm is the method by which it breaks down segments of an image into smaller pieces, called partitioning. Davoine introduces a partitioning method based on Delaunay triangulation.
Created: 04/07/2000
by Mark NelsonMore...
Home of the WaveStat algorithm, which appears to be a fairly conventional wavelet-based image compressor. Links on this page to a paper describing the technique, but no code as far as I can tell.
Created: 04/07/2000
by Mark NelsonMore...
Wil Osberger has published a bunch of papers dealing the perceptual image assessment. You can see most of them here. While this isn't strictly a compression topic, it is obviously of critical importance to lossy compression techniques.
Created: 04/07/2000
by Mark NelsonMore...
A big selection of the published papers of Peter Cherriman, concentrating on H.26x coding and other low bit rate video techniques.
Created: 04/07/2000
by Mark NelsonMore...
A bibliography of Pamela C. Cosman's papers, with links to many that are available on line. Many papers on wavelet-based and VQ image compression, along with a few miscellaneous others.
Created: 02/07/2000
by Mark NelsonMore...
This ftp site has copies of a number of papers by Christopher M. Brislawn. They include various ocuments on the FBI fingerprint compression standard, and a copy of the standard itself.
Created: 02/07/2000
by Mark NelsonMore...
Martin Cohn's home page contains links to several of his compression papers, which seem to be concentrated on lossless compression issues.
Created: 02/07/2000
by Mark NelsonMore...
Adaptive VQ describes a way to get asymptotically optimal results from VQ while adapting to changes in input statistics.
Created: 02/07/2000
by Mark NelsonMore...
Suzanne Bunton's PhD thesis. A recent post to comp.compression said that Bunton had used floating point math for arithmetic coding. I haven't verified this, but it sounds worth a look.
Created: 24/06/2000
by Mark NelsonMore...
Vladimir has pointers to a batch of his papers here, which seem to concentrate on image compression, including VQ and Wavelet based compression. Plus many links to image compression, fractal, and wavelet pages.
Created: 03/06/2000
by Mark NelsonMore...
This IBM research paper looks into the performance of compressors in relation to the compressibility of the the objects they are compressing. At least I think that's what it is talkinga bout.
Created: 03/06/2000
by Mark NelsonMore...
IBM's implementation of arithmetic coding known as the Q-coder is a well-known piece of work. (Patented, unfortunately.) This paper discusses an implementation of the Q coder in hardware.
Created: 03/06/2000
by Mark NelsonMore...
All of the papers from the 1999 Data Compression Conference are available here on line. Yes, there is a catch - you will have to pay if you want to do more than read the abstract. (Looks like a strong incentive to become a member of the IEEE Computer Society.)
Created: 03/06/2000
by Mark NelsonMore...
A paper from SID Digest by Ahumada, A. J., Jr.and Rensheng Horng. Getting rid of those annoying DCT artifacts is obviously a big deal for those of you using JPEG and MPEG compression.
Created: 04/05/2000
by Mark NelsonMore...
A paper by Andrew Watson from NASA Ames Research Center. This paper describes a procedure called DCTune which is claimed to create DCT quantization matrices superior to the default ones used in the JPEG algorithm.
Created: 04/05/2000
by Mark NelsonMore...
This paper from the Mathematica Journal gives some introductory material on using Mathematica to do a bit of image compression.
Created: 04/05/2000
by Mark NelsonMore...
Woontack Woo has published quite a few papers dealing with coding and compression of stereo images. This is one of the few places you can find information on this somewhat esoteric subject.
Created: 09/04/2000
by Mark NelsonMore...
by D. Baron and Y. Bresler. Postscript paper describing a technique that uses BWT to preprocess a sequence of symbols, then mung.
Created: 28/03/2000
by Mark NelsonMore...
Crossroads is the student publication of the ACM. This issue concentrates on data compression, and includes several very worthwhile articles.
Created: 27/02/2000
by Mark NelsonMore...
An article from the ACM Crossroads student publication. This article shows how to use some of the built-in classes in the JDK to create and access compressed files and streams.
Created: 27/02/2000
by Mark NelsonMore...
by David Salomon. This article discusses an approach to the unique problem of compression sparse binary strings.
Created: 27/02/2000
by Mark NelsonMore...
Laurent Balmelli from the neutral country of Switzerland has some C++ code to do some interesting quadtree processing. He would like you register if you are going to use the code, but this is not mandatory. This code is being used by over 100 registered users, and has at least been accessed by as many as 2500 others.
Created: 24/02/2000
by Mark NelsonMore...
A new DCT based moving pictures compression algorithm. The authors claim results far superior to that of MPEG.
Note: this paper will cost you $35.00. Abstract only is free.
Created: 15/02/2000
by Mark NelsonMore...
Edward J. Delp describes his wavelet-based coding system for color images using a luminance/chromanance color space.
Created: 23/01/2000
by Mark NelsonMore...
Edward J. Delp and pals have created a rate-scalable video codec that uses a wavelet based codec with motion compensation.
Created: 23/01/2000
by Mark NelsonMore...
Edward J. Delp et.al. have tackled the problem of perfomring video compression using parallel processing techniques.
Created: 23/01/2000
by Mark NelsonMore...
Edward J. Delp has a big batch of papers on line. This includes a big batch that deal with Image and Video compression.
Created: 23/01/2000
by Mark NelsonMore...
Antonio Ortega at USC has quite a few compression related papers on line. They include papers on wavelets and quantization.
Created: 23/01/2000
by Mark NelsonMore...
NASA says that this data compression algorithm is the Government Invention of the year. It is apparently an algorithm that can be used by spacecraft with limited transmission bandwidith.
Created: 16/01/2000
by Mark NelsonMore...
These folks at UCSB are encoding speech at 2.8 Kbps. It sounds very good considering the bit rate. Links here to a presentation and an abstract, as well as some samples.
Created: 13/01/2000
by Mark NelsonMore...
This paper appears to discuss the concept of compressing images by breaking them down into small pieces and doing some sort of optimal compression on each piece. I assume the trick here is to locate the point of diminishing returns.
Created: 13/01/2000
by Mark NelsonMore...
EPIC is a lossy image compression program. It uses subband decomposition followed by an entropy encoder to get its job done. This page has links to the code, papers, references, and more.
Created: 01/01/2000
by Mark NelsonMore...
CREW is a compression algorithm developed by RICOH that has been offered up to the JPEG working group as a potential standard. This page contains a description of the algorithms, along with samples and related documents.
Created: 01/01/2000
by Mark NelsonMore...
A nice tidy proof that the Huffman tree is an optimal prefix code for a given message. Note that a library reader finds that there is an error in the fixed-length codes listed in Table 1, a and b have duplicate values.
Created: 16/12/1999
by Mark NelsonMore...
An Optimizing Hybrid LZ77 RLE Data Compression Program, aka Improving Compression Ratio for Low-Resource Decompression by Pasi Ojala.
Presents a new literal tagging system, a fast exhaustive string
match algorithm, an optimal parsing algorithm, and results on
Calgary Corpus and Canterbury Corpus.
Created: 13/12/1999
by Mark NelsonMore...
by Xiaolin Wu, University of Western Ontario. A paper on the CALIC algorithm, first presented at DCC 96. This file is in PS format.
Created: 03/12/1999
by Mark NelsonMore...
by Xiaolin Wu, University of Western Ontario. A paper on high performance lossless image compression. I'm not sure if this paper was a predecessor to CALIC/JPEG-LS or not.
Created: 03/12/1999
by Mark NelsonMore...
by Paul Glor Howard, Brown University, 1993. This thesis looks at statistical methods for performing data compression.
Created: 03/12/1999
by Mark NelsonMore...
Home Page for Steve Tate, University of North Texas. State Tate is an Associate Professor at the University of North Texas, and has links to a few data compression papers on his site.
Created: 27/11/1999
by Mark NelsonMore...
Ioannis Kontoyiannis is an Assistant Professor at Purdue. He has many of his papers available on line, in PS or PDF format, often both. Sample title: Pointwise redundancy in lossy data compression and universal lossy data compression..
Created: 27/11/1999
by Mark NelsonMore...
This master's thesis is available in PS format here. The abstract entry is in English, the Postscript document is in Spanish.
Created: 21/11/1999
by Mark NelsonMore...
Links to lots of info regarding the audio compression portions of the MPEG standards. This includes an overview, the MPEG Audio FAQ, pointers to resources, some free software, and test bitstreams.
Created: 19/11/1999
by Mark NelsonMore...
As it says, this is the home site of the committees. You need to be a member to access some areas, but there is plenty of information here for the common man as well.
Created: 19/11/1999
by Mark NelsonMore...
by P.C. Cosman, R.M. Gray, and M. Vetterli, IEEE Transactions on Image Processing, February 1996. They discuss using VQ in combination with subband and wavelet decompositions.
Created: 14/11/1999
by Mark NelsonMore...
A journal that publishes papers dealing with randomness, pseudorandomness, and Kolmogorov complexity, among other things. Their home page features an on line listing of every paper published in this journal since 1957, with a search tool.
Created: 14/11/1999
by Mark NelsonMore...
Short for Lempel-Zif-Welsh, a popular data compression technique developed in 1977 by J. Ziv and A Lempel, and later refined by T. Welsh. It is the compression algorithm used in the GIF graphics file format, which is one of the standard graphic formats used by CompuServe and the World Wide Web.
Created: 14/11/1999
by Mark NelsonMore...
What's going on with PNG? Inquiring minds want to know! Note that this page includes links to the history pages for previous years, currently 1995-1998
Created: 14/11/1999
by Mark NelsonMore...
Tony Robinson - Technical report CUED/F-INFENG/TR.156. A report on an audio compression algorithm that relies on compression of the waveform with Huffman compression of the residuals.
Created: 14/11/1999
by Mark NelsonMore...
by Keren O. Perlmutter, Sharon M. Perlmutter, Robert M. Gray, Richard A. Olshen, and Karen L. Oehler IEEE Transactions on Image Processing, February 1996). We investigate several VQ-based algorithms that seek to minimize both the distortion of compressed images and errors in classifying their pixel blocks.
Created: 13/11/1999
by Mark NelsonMore...
This page has references to a huge number of Wavelet-related research papers. I don't know how many of them are online, but it is quite a treasure trove for the curious.
Created: 11/11/1999
by Mark NelsonMore...
The dissertation itself, in PS format, along with code used in the dissertation. The code implements k-ary arithmetic compression.
Created: 07/11/1999
by Mark NelsonMore...
Some folks at Loughborough U. are working on attempts to integrate data compression more closely with computer hardware. This page has reports on their progress, as well as links to some of their papers.
Created: 07/11/1999
by Mark NelsonMore...
All papers are in postscript format and are gzipped to save space. The filenames correspond to the naming convention established in my extensive Fractal Bibliography. It probably makes most sense to examine the bibliography first (it is updated regularly), before going on a paper downloading spree.
Created: 07/11/1999
by Mark NelsonMore...
Cognition as Compression, by Gerry Wolff, discussion of the author's research.
From as long ago as the last century it has been recognised that information processing in brains and nervous systems (perception, learning, thinking, neuro-muscular control etc) may be governed by principles of economy or compression.
Created: 07/11/1999
by Mark NelsonMore...
Computing as Compression: The SP Theory and the SP System, by Gerry Wolff, discussion of research in progress.
Created: 07/11/1999
by Mark NelsonMore...
This page is intended to provide an explanation of some of the features of the PNG format for non-technical users
Created: 07/11/1999
by Mark NelsonMore...
Our thesis is that high compression efficiency for text and images can be obtained by using sophisticated statistical compression techniques, and that greatly increased speed can be achieved at only a small cost in compression efficiency.
Complete paper is included in PS format.
Created: 07/11/1999
by Mark NelsonMore...
Notes on suffix tree construction, some course notes for COMP 612: Graduate Seminar in Compiler Construction, includes some pointers to important papers.
DataCompression.info user David D. had this complaint: None of the links work on this page, all there is is a short paragraph on suffix trees. I have to agree, it's a rather strange page.
Created: 07/11/1999
by Mark NelsonMore...
A set of links to Cormack's publications. Papers germane to this page include one on DMC and arithmetic compression. Pointers to many other data compression articles which are unfortunately not linked to this page.
Created: 05/11/1999
by Mark NelsonMore...
This is the home page for the Calgary Corpus. This set of files has long been the standard used for comparison of various lossless compression techniques.
Created: 21/09/1999
by Mark NelsonMore...
Links from the Yale Computational Mathematics Group. Contains links to software and papers. Most of the links don't show up in this database as they are not necessarily related to using wavelets for data compression.The papers that appear off this page are all compressed postscript and have not been added to the database.
Created: 18/02/1999
by Mark NelsonMore...
This appears to be an overview of various data compression methods applied to image compression, both lossy and lossless. The page is entirely in what appears to be Finnish.
Created: 17/02/1999
by Mark NelsonMore...
This article originally appeared in the electronic Linux Gazette in January 1997 and was subsequently (re)printed in the April 1997 issue of Linux Journal. The main text is current as of early January 1997, with updates appearing at the very end as Author's Notes.
Created: 15/02/1999
by Mark NelsonMore...
by John C. Kieffer (Information Theory Research Group, Electrical Engineering, University of Minnesota). A survey of techniques for designing codes with minimal redundancy. A Postscript copy of an academic paper.
Created: 04/01/1999
by Mark NelsonMore...
ACIS, the AXAF CCD Imaging Spectrometer, is an instrument being built by a team from the Massachusetts Institute of Technology's Center for Space Research and the Pennsylvania State University for the Chandra X-ray Observatory (formerly, AXAF), scheduled for launch in 1999.
This is a good paper discussing practical implementation of data compression in a real world project with interesting parameters.
Created: 04/01/1999
by Mark NelsonMore...
By Robert W Buccigrossi and Eero P Simoncelli, GRASP Laboratory Technical Report #414, University of Pennsylvania, May 30, 1997. This page contains the abstract, with links to the full paper, as well as software associated with it.
Created: 24/12/1998
by Mark NelsonMore...
Leonid A. Broukhis puts his money where his mouth is by offering a cash prize for good, reproducible compression. He has paid out at least one modest prize.
Created: 23/12/1998
by Mark NelsonMore...
R.M. Gray (Ed.), M. Cohn, L.W. Craver, A. Gersho, T. Lookabaugh, F. Pollara, and M. Vetterli, November 1993. A Foreign Applied Sciences Assessment Center (FASAC) report prepared for Science Applications International Corporation (SAIC) under U.S. Government sponsorship. This is the draft version in Poscript format submitted to SAIC.
Created: 18/12/1998
by Mark NelsonMore...
A paper discussing BWT text compression in Proceedings of the 19th Australasian Computer Science Conference, Melbourne, Australia. Jan 31 - Feb 2, 1996.
Created: 08/12/1998
by Mark NelsonMore...
This file is based on the PKWare APPNOTE.TXT file dated 15 February 1996. This is what is normally used as the original documentation specifying the Zip file format. This file is from the documentation site for the InfoZip project. InfoZip is the source for the popular cross-platform Zip and UnZip programs.
Created: 07/12/1998
by Mark NelsonMore...
A reprint of an important paper. This site has links to the paper in PDF and Postscript formats. Claude E. Shannon is widely acknowledged to be the father of Information Theory. The publication of this paper established that reputation and gave birth to this area of scientific endeavor.
Created: 04/12/1998
by Mark NelsonMore...
Contains his implementation of the BWT algorithm, in the program bred. Along with this are some notes and papers on his implementation of BWT
Current list of files:
bexp.c
bexp3.c
bred.c
bred.ps
bred.ps.Z
bred2
bred3.c
bred3.ps
exp.c
huff.ps
mintext.ps
mintext.tex
red.c
tea.ps
tub.ps
wake.ps
xtea.ps
xxtea.ps
Created: 30/11/1998
by Mark NelsonMore...
by Diego Santa-Cruz, Touradj Ebrahimi, Joel Askelof, Mathias Larsson and Charilaos Christopoulos. This to-be-published paper looks at both lossy and lossless modes of the new standard.
Created: 01/01/1970
by Mark NelsonMore...
gzip (GNU zip) is a compression utility designed to be a replacement for compress. Its main advantages over compress are much better compression and freedom from patented algorithms. It has been adopted by the GNU project and is now relatively popular on the Internet. gzip was written by Jean-loup Gailly (jloup@gzip.org), and Mark Adler for the decompression code.
By virtue of links, this seems to be the official home page.
Created: 01/01/1970
by Mark NelsonMore...