Abstract:
We present a new image compression technique called "DjVu " that is specifically geared towards the compression of high-resolution, high-quality images of scanned documents in color. This enables fast transmission of document images over low-speed connections, while faithfully reproducing the visual aspect of the document, including color, fonts, pictures, and paper texture. The DjVu compressor separates the text and drawings, which needs a high spatial resolution, from the pictures and backgrounds, which are smoother and can be coded at a lower spatial resolution. Then, several novel techniques are used to maximize the compression ratio: the bi-level foreground image is encoded with AT&T's proposal to the new JBIG2 fax standard, and a new wavelet-based compression method is used for the backgrounds and pictures. Both techniques use a new adaptive binary arithmetic coder called the Z-coder. A typical magazine page in color at 300dpi can be compressed down to between 40 to 60 KB, approx...
Citations
|
943
|
Embedded image coding using zerotrees of wavelet coefficients
– SHAPIRO
- 1993
|
|
845
|
Some Methods for Classification and Analysis of Multivariate Observations
– MacQueen
- 1967
|
|
637
|
A new, fast, and efficient image codec based on set partitioning in hierarchical trees
– Said, Pearlman
- 1996
|
|
593
|
TC: Managing Gigabytes: compressing and indexing documents and images
– IA, Moffat, et al.
- 1999
|
|
517
|
Arithmetic coding for data compression
– Witten, Neal, et al.
- 1987
|
|
300
|
Lifting Scheme: A Custom-Design Construction of Biorthogonal Wavelets
– Sweldens
- 1996
|
|
166
|
Run-length encodings
– Golomb
- 1966
|
|
84
|
Image Restoration by the Method of Convex Projections: Part 1 – Theory
– YOULA, WEBB
- 1982
|
|
78
|
An Overview of the Basic Principles of the Q-Coder Adaptive Binary Arithmetic
– Pennebaker, Mitchell
- 1988
|
|
76
|
Comparison of learning algorithms for handwritten digit recognition
– LeCun, Jackel, et al.
- 1995
|
|
67
|
Orthogonal pyramid transforms for image coding
– Adelson, Simoncelli, et al.
- 1987
|
|
54
|
Finding Text In Images
– Wu, Manmatha, et al.
- 1997
|
|
52
|
Convergence properties of the k-means algorithms
– Bottou, Bengio
- 1995
|
|
49
|
A means for achieving a high degree of compaction on scan-digitized printed text
– Ascher, Nagy
- 1974
|
|
44
|
Arithmetic coding for data compression
– Howard, Vitter
- 1994
|
|
44
|
Practical Digital Libraries: Books, Bytes, and Bucks
– Lesk
- 1997
|
|
37
|
A New Fast and E cient Image Codec Based on Set Partitioning in Hierarchical Trees
– Said, Pearlman
- 1996
|
|
35
|
Towards Active, Extensible, Networked Documents: Multivalent Architecture and Applications
– Phelps, Wilensky
- 1996
|
|
29
|
Some methods for classi cation and analysis of multivariate observations
– MacQueen
- 1967
|
|
27
|
Text image compression using soft pattern matching
– Howard
- 1997
|
|
25
|
Image restoration by the method of convex projections: Part 2 -- applications and numerical results
– Sezan, Stark
- 1982
|
|
25
|
The Rightpages image-based electronic library for alerting and browsing
– Story, O'Gorman, et al.
- 1992
|
|
23
|
Lossless binary image compression based on pattern matching
– Mohiuddin, Rissanen, et al.
- 1984
|
|
19
|
The Z-coder adaptive binary coder
– Bottou, Howard, et al.
- 1998
|
|
11
|
Progressive bi-level image compression
– JBIG
- 1993
|
|
9
|
Xydeas. Recent developments in image data compression for digital facsimile
– Holt, S
- 1986
|
|
6
|
Textual image compression
– Witten, Bell, et al.
- 1992
|
|
5
|
Reading checks with multilayer graph transformer networks
– LeCun, Bottou
- 1997
|
|
2
|
A rapid entropy-coding algorithm
– Withers
- 1996
|
|
1
|
Mixed rater content (MRC) mode
– MRC
- 1997
|