AU616262B2

AU616262B2 – Compressing color video data
– Google Patents

AU616262B2 – Compressing color video data
– Google Patents
Compressing color video data

Download PDF
Info

Publication number
AU616262B2

AU616262B2
AU33561/89A
AU3356189A
AU616262B2
AU 616262 B2
AU616262 B2
AU 616262B2
AU 33561/89 A
AU33561/89 A
AU 33561/89A
AU 3356189 A
AU3356189 A
AU 3356189A
AU 616262 B2
AU616262 B2
AU 616262B2
Authority
AU
Australia
Prior art keywords
pixel
pixels
decision
determining
color
Prior art date
1988-04-27
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)

Ceased

Application number
AU33561/89A
Other versions

AU3356189A
(en

Inventor
John Music
Gordon H. Smith
James L. Thomas
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)

BIL (FAR EAST HOLDINGS) Ltd

Original Assignee
UVC Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
1988-04-27
Filing date
1989-03-23
Publication date
1991-10-24

1988-04-27
Priority claimed from US07/186,569
external-priority
patent/US4816901A/en

1988-04-27
Priority claimed from US07/186,573
external-priority
patent/US4849807A/en

1989-03-23
Application filed by UVC Corp
filed
Critical
UVC Corp

1989-11-24
Publication of AU3356189A
publication
Critical
patent/AU3356189A/en

1991-10-24
Application granted
granted
Critical

1991-10-24
Publication of AU616262B2
publication
Critical
patent/AU616262B2/en

1994-08-25
Assigned to BIL (FAR EAST HOLDINGS) LIMITED
reassignment
BIL (FAR EAST HOLDINGS) LIMITED
Alteration of Name(s) in Register under S187
Assignors: UVC CORP.

2009-03-23
Anticipated expiration
legal-status
Critical

Status
Ceased
legal-status
Critical
Current

Links

Espacenet

Global Dossier

Discuss

Classifications

H—ELECTRICITY

H04—ELECTRIC COMMUNICATION TECHNIQUE

H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION

H04N11/00—Colour television systems

H04N11/04—Colour television systems using pulse code modulation

H—ELECTRICITY

H04—ELECTRIC COMMUNICATION TECHNIQUE

H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION

H04N11/00—Colour television systems

H04N11/04—Colour television systems using pulse code modulation

H04N11/042—Codec means

G—PHYSICS

G06—COMPUTING; CALCULATING OR COUNTING

G06T—IMAGE DATA PROCESSING OR GENERATION, IN GENERAL

G06T9/00—Image coding

G06T9/005—Statistical coding, e.g. Huffman, run length coding

Abstract

The method and system for compressing color video data in a video communication system utilizes a three color digital signal, and involves the determination of a luminance function for each pixel in a series of video picture frames. One or more decision parameters based upon differences of the luminance function between pixels are compared with corresponding adaptive thresholds to determine decision points in the scan lines, and the digital word size of the three digital color components is reduced before encoding of run lengths between decision points of the color values.

Description

r 0- 616262 OPI DATE 24/11/89 AOJP DATE 21/12/89 APPLN. ID 33561 89 PCT NUMBER PCT/US89/01198
PC
WUKLU IN I LLLL I UAL M’UMIKI Y UXUAN lA I VIN International Bureau INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) (51) International Patent Classification 4 (11) International Publication Number: WO 89/10675 H04N 11/04 Al (43) International Publication Date: 2 November 1989 (02.11.89) (21) International Application Number: PCT/US89/01198 (81) Designated States: AU, BR, FI, JP, KR.
(22) International Filing Date: 23 March i989 (23.03.89) Published With international search report.
Priority data: 186,569 27 April 1988 (27.04.88) US 186,573 27 April 1988 (27.04.88) US (71) Applicant: UVC CORP. [US/US]; 16800 Aston Street, Irvine, CA 92714 (US).
(72)Inventors: MUSIC, John 14711 Mulberry, Irvine, CA 92714 SMITH, Gordon, H. 13072 Wheeler Place, Santa Ana, CA 92705 THOMAS, James, L. 1520 Shasta Way, Placentia, CA 92670 (US).
(74) Agents: PAUL, James, W. et al.; Fulwider Patton Lee Utecht, 3435 Wilshire Boulevard, Suite 2400, Los Angeles, CA 90010 (US).
(54) Title: METHOD AND SYSTEM FOR COMPRESSING COLOR VIDEO DATA (57) Abstract The method and system for compressing color video data in a vdieo communication sytem utilizes a three color digital signal and involves the determination of a luminance function (18) for each pixel in a series of video picture frames. One or more decision parameters (26) based upon differences of the luminance function between pixels are compared with corresponding adaptive thresholds to determine decision points in the scan lines, and the digital word size of the three digital color components is reduced before encoding of run lengths (28) between decision points of the color values.
a i; ii i;.
t i COLOR VIDEO DATA BACKGROUND OF THE INVENTION Field of the Invention: This invention relates generally to information signal processing, and in particular to the field of processing time sequential information signals, such as video signals, for the purpose of compressing the amount of information to be transferred from an encoding site to a decoding site. A particular use of the invention is in the communication of color video data over telephone lines.
Prior Art: Encoding of digital television signals ordinarily requires a transmission rate of approximately 200 Mbits/s. Recent developments in coding systems have permitted the transmission rate to be cut to less than 2 Mbits/s. Coding systems using block oriented analysis of video picture frames and processing by a conventional hybrid discrete cosine transform (DCT) coefficient permit transmission at rates of between 64 Kbits/s and 384 Kbits/s. Such a system is described in Gerken and Schiller, «A Low Bit-Rate Image Sequence Coder Combining A Progressive DPCM On Interleaved Rasters With A Hybrid DCT Technique», IEEE Journal on Selected Areas in Communications, Vol. SAC-5, No. 7, August, 1987. Adaptive coding techniques applied to such DCT processing have allowed video data transmission at rates as low as I- 13 data is to.determine the points of chance baspri 1~=e i%- »i WO 89/10675 PCT/US89/01198 2 one to two bits per pixel, as is described in Chen and Smith, «Adaptive Coding of Monochrome and Color Images», IEEE Transactions on Communications, Vol. COM-25, No. 11, November 19, 1977. However, information transmitted at such low data rates seriously affects the ability to reconstruct a sufficient number of frames per second so that a real time picture is acceptable to a viewer. High capacity telephone lines are available which will carry transmission at a rate of up to 1.544 Mbits/s, but such lines are extremely expensive at a dedicated use rate, and are still quite expensive at a scheduled use rate.
Lower capacity telephone lines are available which permit transmission at rates of up to 56 Kbits/s and 64 Kbits/s.
Relatively expensive video digital and coding devices are commercially available which will transmit a video signal at 56,000 bits per second, so that it is necessary to utilize a combination of a device of this nature with the high capacity 1.544 Mbits/s telephone line to allow a framing speed much faster than about one frame per second. The current transmission rate limit of ordinary telephone lines approaches 18,000 bits per second, so that transmission of real time sequencing of video pictures over ordinary telephone lines has been viewed in the prior art as not being feasible.
Various schemes for reducing the amount of redundancy of informatios to be transmitted in a digital video signal have been used. One technique is to utilize a slow scan camera; and another technique is to transmit every nth scanning line for each frame. Another tech- 1 nique involves the sending of only those parts of a ‘4 WO 89/10675 PCT/US89/01198 14 which identifies a decision point, and will stay in the WO 89/10675 PCT/US89/01198 -3picture frame which are deemed to be important or to have changed in some significant manner, by dividing the picture frame into a number of segments or blocks which are typically 3X3 or 4X4 groups of pixels, and analyzing the content of the blocks. These techniques tend to also reduce the resolution of the video picture.
Another technique in the reduction of transmission time which does not decrease the resolution of a picture transmitted is run length encoding. In run length encoding, the scan lines of a picture frame are encoded as a value of the color content of a series of pixels and the length of the sequence of pixels having that value or range of values. The values may be a measure of the amplitude of a video signal, or other properties of such video signals, such as luminance or chrominance. An example of a system which utilizes run length coding of amplitude of video signals is U. S.
Patent No. 3,609,244 (Mounts). In that system, a frame memory also determines frame to frame differences, so that only those differences from one frame to the next are to be transmitted. Another example of a method for transmitting video signals as compressed run length values which also utilizes statistical coding of frequent values to reduce the number of bits required to represent data is U. S. Patent No. 4,420,771 (Pirsch).
Ideally compression of color video information to allow real time sequencing of picture frames at a rate of up to 15 frames per second, and at bit rates as low as 11,500 bits per second would be desirable, to allow the communication of color video data over ordinary telephone lines. A video data compression system able to achieve equivalent data transmission rates as systems using higher quality telephone lines with more efficient and less costly equipment than is currently available would also be desirable.
WO 89/10675 PCT/US89/01198 15 The threshold for TRIGVAL is usually set in the 4 SUMMARY OF THE INVENTION The present invention provides, in general terms, for a method and system for compressing color video data in a video communication system, in which a luminance function is utilized to determine differences between the luminance of pixels in the scan lines of the picture, to determine the absolute of change about certain decision points in each scan line, and in which the digital word size of the color values is reduced. Thereafter pixels in the scan 1 0 lines of the picture are coded as a series of run lengths of the digitally compressed color values.
e Briefly, and in general terms, the method for compressing color video data according to the present invention is for use in a video communication system 15 having means for producing a color video signal comprising three digital color component signals of first, second and Sthird digital word sizes, and includes determining a luminance function for each pixel based upon the digital color signals; determining at least one decision parameter 20 based upon differences in a said luminance function between pixels a given distance from one another; datermining which of the pixels represent decision points based upon comparison of at least one decision parameter with threshold values; reducing the word size of digital S 25 color signals to provide reduced digital color signals of fourth, fifth and sixth digital word sizes; and coding the pixels in scan lines as combinations of run lengths and the digitally reduced color signals.
The present invention provides a method for compressing color video data in a video communication system having means for producing a color video signal for a plurality of video picture frames, with each picture frame comprising a plurality of scan lines composed of a plurality of pixels, each scan line having a starting pixzi and an ending pixel, and each pixel in said picture WO 89/10675 PCT/US89/1i98′ 16 for the run length between decision points when the 4% 4a frame comprising three digital color component signals of first, second and third digital word sizes, respectively, said method comprising the steps of: a) determining a luminance function for each pixel based upon at least one of said three digital color component signals; b) determining at least one decision parameter for at least a substantial portion of the pixels in the scan lines of said picture frame based upon the difference of 10 said luminance function between pixels at least one predetermined distance from at least one other pixel on each scan line; c) determining which of said pixels represent decision poinus based upon at least one decision parameter S 15 and at least one adaptive threshold; d) reducing the word size of at least one of said three reduced digital color components signals to provide one to three digital color component signals of fourth, fifth, and sixth digital word sizes, respectively, for 20 each said pixel; e) coding said plurality of pixels in each scan line as a plurality of combinations of pixel run lengths oo e and said three respective reduced color component signals for each said run length, said run lengths being S 25 determined between said starting pixel for each scan line, «intermediate points selected from the group of said decision points and points intermediate said decision points, and said ending pixel, and each run length being of a seventh digital word size.
The invention also provides generally for a system for compressing color video means for use in a video communications system having >-~ans for producing a color video signal comprising three digital color signals of first, second and third digital word sizes, and having means for determining a luminance function for each pixel -C_71 f, 1~A WO 89/10675 pCr/US89/O1198 17 code corresponding to a particular color combination.
based upon the digital color signals; the data compression system comprising means for determining at least one decision parameter based upon differences in said luminance function between pixels a given distance from one another; means for determining which of the pixels represent decision points based upon comparison of the decision parameters with one or more corresponding threshold values; means for reducing the word size of the digital color signals to give reduced digital color 10 signals of fourth, fifth, and sixth digital word sizes; and means for coding the pixels in scan lines as :combinations of run lengths of the digitally reduced color signals. The invention also provides for a camera which e. •includes the data compression system.
15 The present invention further provides a system for compressing color video data in a video communication system having a camera for producing a color video signal for a plurality of video picture frames, with each frame comprising a plurality of scan lines composed of a 20 plurality of pixels, each scan line having a starting pixel and an ending pixel, and each pixel in said frame comprising three digital color component signals of first, second and third digital word sizes, respectively, said system comprising: 25 a) means for determining a luminance function for each pixel based upon at least one of said three digital color component signals; b) means for determining at least ov’e decision parameter for at least a substantial portion of the pixels in the scan lines of said picture frame based upon the difference of _aid at least one luminance function between pixels at least one predetermined distance from at least one other pixel on each scan line; «i’j’ 7 $w, WO 89/10675 PCT/US89/01198 18 the capture buffer memory is then transmitted tn -h= 1 4 c) means for determining which of said pixels represent decision points based upon at least one decision parameter and at least one adaptive threshold; d) means for reducing the word size of at least one of said three reduced digital color components signals to provide one to three digital color component signals of fourth, fifth, and sixth digital word sizes, respectively, for each said pixel; e) means for coding said plurality of pixels in 10 each scan line as a plurality of combinations of pixels run lengths and said three respective reduced color component signals for each said run length, said run lengths being determined between said starting pixel for O ,c each scan line, intermediate points selected from the group of said decision points and points intermediate said decision points, and said ending pixel, and each run length being of a seventh digital word size.
In one preferred embodiment, the absolute value of the decision parameters is utilized; in an alternative preferred embodiment, the rate of change of the decision )parameters is utilized. In i currently preferred mode of the invention, the digital color component signals are •RGB, and the color component word sizes are equal. The digital word size of the digital color components is 25 preferably initially six bits per each component color, *a and the luminance function is determined with an accuracy based upon the six bit digital color values. Thereafter the word size of the digital color components is reduced to four bits each, and the run length and color components are coded together as a bit stream of combined run length and color information in sixteen bit digital words.
Other aspects and advantages of the invention will become apparent from the following detailed description and the accompanying drawings illustrating by way of example the feature of the invention.
T
M
-,7 WO 89/10675 PCT/US89/01198 19 in telecommunication would be about 15 frames ner second Ia SWO 89/10675 PCT/US89/0119 8 6 BRIEF DESCRIPTION OF THE DRAWINGS FIGURE 1 is a schematic diagram of the system and method for compressing color video data in a video communication system; FIG. 2 is a luminance plot across one scan line in a video picture; FIG. 3 shows a run length representation of features in a video scan line; and FIG. 4 shows a run length representation of transitions about slope decision points of a video scan line.
DETAILED DESCRIPTION OF THE INVENTION As is shown in the drawings for purposes of illustration, the invention is embodied in a method and system for compressing color video data in a video coamunication system having means for producing a color video signal for a plurality of picture frames, with each picture frame comprising a plurality of scan lines composed of a plurality of pixels, each scan line having a starting pixel and an ending pixel, and each pixel in each frame comprising three digital color component signals of first, second and third digital word sizes.
For each pixel a luminance function is determined, based upon at least one of the three digital color component signals for at least a substantial portion of the pixels in the scan lines of the picture frame, and one or more decision parameters based upon the difference of the luminance function between pixels at least one predetermined distance from another pixel on the scan line is determined for at least a substantial portion of the pixels in the scan lines of the picture frame. The absolute value of change of at least one of the decision parameters for each of the pixels is determined, and the W89/10675 PCT/US89/01198 WO 89/10675 7 amounts of change are compared with a corresponding threshold value to determine which of the pixels in the scan lines are loci for significant decision points in the decision parameter from pixel to pixel.
The word size of at least one of the three digital color component signals is reduced to provide three digitally reduced digital color component signals of fourth, fifth, and sixth digital word sizes, respectively, for each pixel, and these reduced digital color component signals are coded for each scan line as a plurality of combinations of pixel run lengths and the reduced color component signals for each run length, with the run lengths being determined between a starting point for each scan line, intermediate points which are decision points or points intermediate the decision points, and an ending pixel, with each run length being of a seventh digital word size.
The conversion of the color video information to run lengths representing transitions of the color video information from one locus of change to the next allows for shading and other gradual transitions of color information which would otherwise require a pixel by pixel coding to avoid reduced resolution. The coding of the digital color components reduced in their overall digital word size reduces the amount of information which is to be transmitted, without any significantly perceptible reduction in the color information received. The implementation of the invention permits the compression of color video data to levels which permit real time telecommunication and other applications of color video data compression without significant losses of perceptible information.
In accordance with the present invention, there is thus provided a method for compressing color video data in a video communication system having means for producing a color video signal for a plurality of video
I
WO 89/10675 PCT/US89/O1i98 20 tpcr/US89/01’198′ WO 89/10675 -8picture frames, with each picture frame comprising a plurality of scan lines composed of a plurality of pixels, each scan line having a starting pixel and an ending pixel, and each pixel in said picture frame comprising three digital color component signals of first, second and third digital word sizes, respectively, said method comprising the steps of determining a luminance function for each pixel based upon at least one of said three digital color component signals; determining at least one decision parameter for at least a substantial portion of the pixels in the scan lines of said picture frame based upon the difference of said at least one luminance function between pixels at least one predetermined distance from at least one other pixel on each scan line;! determining which of the pixels represent decision points based upon at least one decision parameter and at least one adaptive threshold; 1 reducing the word size of at least one of said three reduced digital color components signals to provide three digital color component signals of fourth, fifth, and sixth digital word sizes, respectively, for each said pixel; coding said plurality of pixels in each scan line as a plurality of combinations of pixel run lengths and said three reduced color component signals for each said run length, said run lengths being determined between said starting pixel for each scan line, intermediate points selected from the group of said decision points and points intermediate said decision points, and said ending pixel, and each run length being of a seventh digital word size. The present invention further provides for a system for compressing color video data in a video communication system having a camera for producing a color video signal for a plurality of video picture frames, with each picture frame comprising a plurality of scan lines composed of a plurality of pixels, each scan line having a starting pixel and an ending pixel and each WO 89/10675 PCT/US89/01198 21 parameter and at least one adaptive threshold; d) reducing the word size of at least one of WO 89/10675 PC/US89/01198 -9pixel in said frame comprising three digital color component signals of first, second and third digital word sizes, respectively, said system comprising means for determining a luminance function for each pixel based upon at least one of said three digital color component signals; means for determining at least one decision parameter for at least a substantial portion of the pixels in the scan lines of said picture frame based upon the difference of said luminance function between pixels at least one predetermined distance from at least on.
other pixel on each scan line; means for determining which of the pixels represent decision points based upon at least one decision parameter and at least one adaptive threshold to determine which of said pixels represent decision points; means for reducing the word size of at least one of said three reduced digital color components signals to provide three digital color component signals of fourth, fifth, and sixth digital word sizes, respectively, for each said pixel; means for coding said plurality of pixels in each scan line as a plurality of combinations of pixels run lengths and said three respective reduced color component signals for each said run length, said run lengths being determined between said starting pixel for each scan line, intermediate points selected fror, the group of said decision points and points intermediate said decision points, and said ending pixel, and each run length being of a seventh digital word size. The present invention additionally I provides a camera which includes the data compression system, for use in a video communication syst-em.
As is illustrated in the drawings, _n a preferrad implementation of the invention, the video communication system is capable of producing a color video picture using an RGB video camera, generating an analog RGB signal at the normal 60 fields per second, with each field representing half of the picture in an ‘i 1 WO 89/10675 PCT/US9/198 10 interlaced mode. The signal for the video picture frames generated by the camera 10 is received by an analog to digital converter 12, which converts the red, green and blue (RGB) analog components into digital RGB components, which are each digitized as six bit digital words, forming packets of bits for the RGB components for each pixel of the color video picture of eighteen bits.
Phe type of the device used to generate the source color video picture is not crucial to the invention, as a camera generating a standard NTSC composite signal which is converted to an RGB digital output would also be suitable as would a field rate differing from the standard 60 fields per second. The output of the camera also does not need to be strictly RGB, since other three color component groups may be used to create and transmit color video pictures. For example, the three digital color component signals may be cyan, magenta, and yellow; hue, saturation, and intensity; or even two distinct colors and a third parameter based upon the entire video signal, such as hue, saturation or intensity of an original analog video signal, so that there would be some automatic weighting of the color information generated by the camera. :i It is also not essential that the three color components be represented by the same number of bits, since it is known in the television industry that certain ranges of colors are not as easily perceived by the human eye. Such a weighting of information could involve a reduction in the number of bits used for the rrd component in an RGB scheme, for example, thus permitting transmission of more gradations of other color informa- i tion that is actually perceptible.
In addition, the source of the color video pictures to be compressed may be a storage means, such as a video disk, a computer file storage media, a video tape, or the like from which the color video information WO 89/10675 PCI?/US89/01198 11 can be processed for introduction into the color video data compression system of the invention.
The digitized RGB signal is received by the transition engine portion 14 of the image capture engine 16, which preferably includes integrated circuit means and associated memory means. The first major part of the image capture engine is the transition engine which includes circuitry for determining a luminance function based upon the three color component video signal for each picture element, or pixel, of each scan line in the sequence of video picture frames generated by the analog front end of the system. In the preferred mode, the luminance converter 18 sums the bits from each of the three digital color components for each pixel in the scan lines of the video picture frame to get a luminance (or intensity) value and performs further processing of the data obtained. In the system of the present invention each scan line preferably contains 480 pixels, which matches the resolution of the camera and which provides for better resol.ution than is typically available in the prior art, in which generally only 256 pixels are utilized per scan line. The luminance of the three color components may be weighted to give greater significance to one color or two colors to provide the luminance function, and may also be based in part upon an original source analog video signal. However, the luminance function is preferably based in part at least upon the sum of the three digital color components. The luminance function derived from the sum of the three six bit color components therefore has a digital word size of eight bits. This luminance function for each pixel is utilized in the input capture engine for evaluating one or more decision parameters based upon the luminance function for determination of those pixels which operate as decision points about which the one or more of the decision WO 89/10675 PCT/US89/Oi 12 parameters are found to vary from a prestored set of threshold values.
The luminance function is an excellent indicator of color changes in the picture, or movements of objects in the picture. In the image capture engine the one or more decision parameters based upon the luminance function may also be used as the basis for determination of differences from line to line, and of distinctive sequences of pixels which define edges of objects which can be determined to be moving from frame to frame.
Generally, the luminance, or other combination of color components which comprise the luminance function, undergoes significant changes where there are changes in the characteristics of the picture.
The camera also introduces anomalies or artifacts into the video picture due to noise in the color sampling resolution which ideally should be eliminated to reduce the amount of data to be transmitted since they contribute nothing beneficial to the picture. When the picture is displayed with a new field every 60th of a second, the effect of such anomalies is averaged out by the human eye. Areas having a smooth appearance and little actual detail upon close observation seem to «crawl». This appearance is also known as the «mosquito effect». When a picture is frozen so that only one field or picture frame is being examined, the picture takes on a grainy, speckled appearance. The impact of the noise on the luminance data is in the form of tiny variations in the computed luminance. When the picture is digitized, the digitizing process also converts all of these artifacts to digital representations, even though they do not actually represent picture detail. The processing of luminance in the image capture engine operates to eliminate such meaningless details.
One preferred method eliminating the non-essential details caused by noise in the luminance WO 89/10675 PCT/US89/01198 25 16. The system of Claim 15, wherein said means for determining said luminance function includes weinhi-nrr eNP -0 -a WO 89/10675 PCT/US89/01198 13 data is to determine the points of change based at least in part on the luminance function for pixels in the scan lines by comparing differences in one or more decision parameters with corresponding adaptive thresholds. This is termed feature encoding. The decision parameters are preferably comprised of differences of the luminance function between pixels, determined between proximate pixels (Diff-l) in a scan line, n plus one n plus two, or even a further distance away, where n represents the position on a scan line of the pixel being examined for changes in luminance; between adjacent first differences (Diff-2), and a cumulative parameter (Cum-diff) which is a sum of the individual difference functions Diff-l, and Diff-2. Each decision parameter has its own corresponding adaptive threshold, having a default value which is subject to modification by the system in response to operator settings. The adaptive threshold preferably has a default value which may be adjusted by the input capture engine responsive to operator or processor selections for resolution. The selecting of the threshold parameters for determining either the feature or transition decision points is quite subjective. The selection of the parameters determines the number of data points required to define the picture and it also determines the overall perceptual quality of the picture.
Typically for the feature run length determination, two thresholds are used. One is the cumulative change in luminance since the last decision point, Cumdiff. Cumdiff will trigger a decision point if it was greater than 6 and the number of pixels since the last decision point was greater than 5. Another decision parameter is the sum of two adjacent difference values, l Diff2 (this is the same as the difference between luminance values that are two pixels apart). If the Diff2 value is computed to be greater than typically 32, the logic wil signify that the line is entering an edge,
U
l WO 89/10675 PCT/US89/01198 14 which identifies a decision point, and will stay in the edge characteristic until the Diff2 value falls below When the edge mode is exited, the color of the next pixel is carried all the way back to the pixel where the starting edge determination was made. Also, if Diff2 changes sign, it signifies a new decision point. Changing the values far the cumdiff thresholds greatly affects the quality and data complexity of the picture.
In the slope determination of decision points (apexes), three general conditions are used. An initial slope is determined at the decision point and all measurements are base on that slope. The initial slope, INITS, is determined by computing the following function termed NDIFF2: NDIFF2 (luminance(.i+ 2 luminance(i))/2 INITS is the value of NDIFF2 immediately after the decision point.
CUMDIFF in the slope case is defined the following way: CUMDIFFi) CUMDIFF(i_-) NDIFF2(i) If the absolute value of the CUMDIFF is typically greater than 20 and the number of pixels in the run length is typically greater than 10, then a decision point will be triggered. Similarly, if the absolute value of NDIFF2 is less than or equal to typically 4 and the run length is typically greater than 5, a decision point will be triggered unless the last decision point was also triggered in this manner. The third decision parameter is also based upon NDIFF2: TRIGVAL(i) NDIFF2(i) INITS
IL
SPCT/US89/01198 ‘WO 89/10675 12- 1/3 WO 89/10675 PCT/US89/01198 15 The threshold for TRIGVAL is usually set in the range of 4 to 10 and will trigger a decision point any time the absolute value reaches or exceeds the set value and the run length is at least 2 pixels. Other techniques may be used but these seem to give good quality pictures with an acceptable number of data points.
A graphic representation of a typical plot of luminance across a line of a video picture is shown in Figure 2. The luminance function of the pixels intersected by the scan line 36 is graphically represented by line 38. As is shown in Figure 3, a graph of the decision points based upon comparison of one of the decision parameters with the corresponding adaptive difference threshold in a feature encoding technique, results in stepped line 40, a sequence of horizontal straight lines across the luminance pattern. Each horizontal line represents a separate length of a specific color.
A second approach which may be used to eliminate the non-essential details is a transition or slope encoding technique, which is illustrated in Figure 4. In this technique the rate of change of the differences in the decision parameter between pixels is determined, and the rates of change of these differences are compared with an adaptive, prestored difference rate of change threshold to determine decision points or apex points. These 4 change points or decision points are indicated as X’s on line 39. They indicate the location of the next apex.
«Run length» is defined as being the pixel distance between decision points, for both the feature encoding and slope encoding techniques. According to the transition or slope encoding technique, the luminance data results in a line 42 representing a series of apexes or S^ slope decision points, which may be used for controlling the color segments between decision points. A drawing engine can produce a smooth transition of color values PCT/US89/01 198 WO 2/9/10675 WO 89/10675 pCT/US89/0119 8 16 for the run length between decision points when the encoded information is to be retrieved. In this technique, for each scan line an initial color is transmitted, followed by as many sequences of run length and color values as are necessary to represent the picture frame content.
In the image capture engine of Fig. 1, the decision point detector 26 for determining decision points may alternatively be able to utilize either one of these methods for fixing the decision points in the color of the pixels in the picture, as each method has its respective advantages and disadvantages. The feature coding technique is typically more appropriate for pictures with a complexity of objects with distinctive edges or lines. On the other hand, the slope encoding technique is most suitable for encoding gradual transitions in shading or gradual color changes, but may require additional coding to represent complex pictures with images having many edges and lines. In the preferred implementation of the slope encoding technique, a sequence of thresholds will be compared with decision parameters, and the cumulative parameter (cum-diff) and an adaptive cumulative threshold will also be utilized in determining decision points, to account for those slow, gradual rates of change of luminance which would still result in an accumulated luminance change which is significant enough to merit identification of a decision point.
The three component color codes are also operated on in the run length processor 28 to drop the two least significant bits from the six bit values for the color components, reducing each of the color components in the preferred mode to four bit digital words.
Alternatively, in one preferred embodiment, the transi- I tion engine may also contain a predetermined color map representation of three-component colors, with an n-bit SPCT/US89/01198 W’ 89/10675 3/3 WO 89/10675 PCT/US89/01198 17 code corresponding to a particular color combination.
Here, the colors of the image are matched as closely as possible with the colors in the color map. As a further alternative, the color codes could also be rounded.
These truncated or reduced digital color components are then encoded with the run lengths between decision points in the run length processor 28. Although the preferred bit size for the reduced color components is four bits, just as the input digital word size for the color components from the analog front end can be of different sizes to vary the informational content, the reduced digital color components may also be of different sizes. A particular combination of digital word sizes for color components may include a reduced size for the red component, due to the recognition in the industry of the reduced perceptibility of this component.
These feature encoding and slope encoding techniques allow for a variable number of bits to be used to represent an initial picture frame and then changes in subsequent picture frames, in order to encode the minimum number of bits for each picture frame. This is significant a improvement over the prior art which typically analyzes a four by four or three by three block of pixels to compress the information in such a Llock, which always results in the same number of bits being utilized to represent the informational content in the picture, whether there have been changes outside the segment or not.
The second major portion of the image capture engine is the capture buffer memory (CBM) 29, which receives the encoded run lengths and reduced color components representing some 200 lines of data from the picture frame. Alternatively, if the data rate required becomes too high to send pictures at a desired speed, lesser numbers of scan lines can be stored, such as 150 or 100 lines. The run length and color component information in t.
r WO 89/10675 PCT/US89/01198 S89- 18 the capture buffer memory is then transmitted to the video data processor 30, which accesses the run length and color data in the capture buffer memory by an access control 35, and operates as an interface to transform and transmit the video information in a format suitable for transmission by the modem 32, connected to the telephone 34, and which may include means for further compressing the video data, at 33. The video data may also be compared with a previous picture frame stored in an old picture memory 31.
It is possible in a simplification processor 33 of the video data processor 30 to further analyze the difference between color values of pixels after the color codes have been truncated to provide the reduced color component codes, and to concatenate run lengths of such reduced color component codes which vary less than a given threshold value, or to further concatenate run lengths of the reduced color codes based upon variance of one or more of the decision parameters with respect to a corresponding threshold. As the run length code is typically at a maximum of four bits to he compatible with run length and color code combinations of 16 bits, with 16 bit computer buses in the current implementation, concatentation of a sequence of pixels for each run length would be expected to permit coding of up to sixteen pixels per run length. However, in the current implementation the values 0 to 15 are used to represent run lengths of from 2 to 17 pixels, since run lengths of 0 and 1 are not meaningful. Alternatively, longer run lengths may be determined initially as well, as may be compatible with different capacity computer buses, to permit run lengths of greater than 4 bits and run length color cede combinations greater than 16 bits.
As mentioned previously, it is expected that the limits of compression required for adequate smoothing of information in a real time sequencing of video pictures INTERNATIONAL SEARCH REPORT International Application No. PCT/US89/01198 1. CLASSIFICATION OF SUBJECT MATT iR (It several classfication symbols apply, indicate all) 6 According to International Patent Classfication (IPC) or to both National Classification and IPC IPC(4): H04N 11/04 U.S. C1. 35R/11 P<7L/US9/011" WO 89/10675 PCT/US89/01198 19 in telecommunication would be about 15 frames per second for transmission over conventional telephone lines. It would be possible to use a modem at 1200 bps (bits per second), but this would considerably slow the number of frames per second possible in the communication system. Ideally, the system is configured for half duplex mode, and a full duplex mode of configuration would be expected to require two telephone lines. Ideally the modem that is to be used is one which would utilize the largest bandwidth possible, and may be conventional 2400 bps or 9600 bps modem or special modems providing higher bit rates may be used. Although the invention has been described in the context of a video telephone conferencing system, the invention may be also be adapted for use in compressing color video data on magnetic media, such as magnetic floppy discs which may be used in storing and communicating such data via computer systems, magnetic hard disks for image storage or short video movie sequences, or on video discs for video disc players which could transmit the information in the form of a full length movie. In the foregoing description, it has been demonstrated that the method and system for compressing color video data can achieve a significant elimination of extraneous noise introduced by a video camera, and can result in a significant improvement in coding of the minimum amount of information necessary to reconstruct color video picture frames in a real time sequencing of video pictures. It will also be appreciated that the method and system for compressing color video data in a video communication system according to the invention reduces the digital word sizes of data encoded, and codes only the minimum necessary decision points in scan lines in video color pictures for reception, storage, and/or retrieval by a system for decompressing and decoding the color video information. 7 Claims (12) 2. The method of Claim 1, wherein said step of determining which of said pixels represent decision points comprises comparing the absolute value of said at least one decision parameter with at least one adaptive absolute difference threshold. 3. The method of Claim 1, wherein said step of determining which of said pixels represent decision points comprises determining the rate of change of said at least one decision parameter for each of said pixels for which said luminance function difference has been determined; and comparing said rate of change of said at least one decision parameter with a corresponding adaptive rate of change threshold. Ir t. ~z P I WO 89/10675 PCT/US89/01198 22 3 4. The method of Claim 1 N, wherein the step of comparing said rates of change with an adaptive rate of change threshold to determine said decision points includes comparing at least one said decision parameter with an adaptive cumulative difference threshold. The method of Claim 1, wherein the step of determining said luminance function comprises summing each of said three digital color component signals. 6. The method of Claim 5, wherein the step of determining said luminance function includes weighting of the sium of said three digital color component signals with respect to one or more of said three digital color component signals. 7. The method of Claim 1, wherein a first decision parameter is determined from the difference in said luminance function between a first pixel and a second proximate pixel a distance of one pixel away from said first pixel on a scan line. 8. The method of Claim 1, wherein a first decision parameter is determined from the difference in said luminance function between a first pixel and a second proximate pixel a distance of two pixels away froa said first pixel a scan line. 9. The method of Claim 7 or 8, wherein a cumulative decision parameter is determined from summing said proximate pixel differences. 7. or The method of Claim i4, wherein a second decision parameter is determined from differences between said proximate pixel differences. SWO 89/10675 PCT/US89/01198 -23- 11. The method of Claim 1, further including the step of concatenating run lengths in said plural- ity of combinations of run lengths and reduced color components in a scan line which are associate with reduced color components whose differences are less than a predetermined color difference threshold. 12. A system for compressing color video data in a video communication system having a camera for producing a color video signal for a plurality of video picture frames, with each picture frame compris- ing a plurality of scan lines composed of a plurality of pixels, each scan line having a starting pixel and an ending pixel and each pixel in said frame compris- ing three digital color component signals of first, second and third digital word sizes, respectively, said system comprising: a) means for determining a luminance func- tion for each pixel based upon at least one of said three digital color component signals; b) means for determining at least one deci- sion parameter for at ±east a substantial portion of the pixels in the scan lines of said picture frame based upon the difference of said at least one luminance function between pixels at least one predetermined distance from at least one other pixel on each scan line; c) means for determining which of said pixels represent decision points based upon at least one decision parameter and at least one adaptive threshold; d) means for reducing the word size of at least one of said three reduced digital color compon- ents signals to provide one to three digital color component signals of fourth, fifth, and sixth digital word sizes, respectively, for each said pixel; WO 89/10675 PCr/US89/198 24 e) misas for coding said plurality of pixels in each scan line as a plurality of combinations of pixels run lengths and said three respective reduced color component signals for each said run length, said run lengths being determined between said starting pixel for each scan line, intermediate points selected from the group of said decision points and points intermediate said decision points, and said ending pixel, and each run length being of a seventh digital word size. 13. The system of Claim 12, wherein said means for determining which of said pixels represents decision points comprises means for comparing the absolute values of said at least one decision parameter with at least one adaptive absolute difference threshold. 14. The system of Claim 12, wherein said means for determining which of said pixels represent decision points comprises: a) means for determining the rate of change of said at least one decision parameter for each of said pixels for which said luminance function difference has been determined; and b) means for comparing said rate of change of said at least one decision parameter with a corresponding adapative rate of change threshold to determine which of said pixels represent decision points. The system of Claim 12, wherein said means for determining said luminance function com- prises means for summing each of said three digital color component signals. K. it I A WO 89/10675 PCT/US89/01198 25 16. The system of Claim 15, wherein said means for determining said luminance function includes weighting of the sum of said three digital color component signals with respect to one or more of said three digital color component signals. 17o The system of Claim 12, further includ- ing means for concatenating run lengths in said plurality of combinations of run lengths and reduced color components in a scan line which are associated with reduced color components whose differences are less than a predetermined color difference threshold. I' AU33561/89A 1988-04-27 1989-03-23 Compressing color video data Ceased AU616262B2 (en) Applications Claiming Priority (4) Application Number Priority Date Filing Date Title US07/186,569 US4816901A (en) 1988-04-27 1988-04-27 Method and system for compressing color video data US07/186,573 US4849807A (en) 1988-04-27 1988-04-27 Method and system for compressing color video feature encoded data US186573 1988-04-27 US186569 1988-04-27 Publications (2) Publication Number Publication Date AU3356189A AU3356189A (en) 1989-11-24 AU616262B2 true AU616262B2 (en) 1991-10-24 Family ID=26882225 Family Applications (1) Application Number Title Priority Date Filing Date AU33561/89A Ceased AU616262B2 (en) 1988-04-27 1989-03-23 Compressing color video data Country Status (13) Country Link EP (1) EP0339938B1 (en) JP (1) JP2583628B2 (en) KR (1) KR950009459B1 (en) AT (1) ATE140114T1 (en) AU (1) AU616262B2 (en) BR (1) BR8906932A (en) CA (1) CA1324654C (en) DE (1) DE68926760T2 (en) FI (1) FI896257A0 (en) IL (1) IL90023A0 (en) NZ (1) NZ228807A (en) PT (1) PT90388B (en) WO (1) WO1989010675A1 (en) Families Citing this family (4) * Cited by examiner, † Cited by third party Publication number Priority date Publication date Assignee Title US6453067B1 (en) * 1997-10-20 2002-09-17 Texas Instruments Incorporated Brightness gain using white segment with hue and gain correction US7085424B2 (en) 2000-06-06 2006-08-01 Kobushiki Kaisha Office Noa Method and system for compressing motion image information US10888069B2 (en) 2017-11-07 2021-01-12 Towerstar Pets, Llc Pet toy including apertures for receiving treats USD843680S1 (en) 2018-02-21 2019-03-26 Towerstar Pets, Llc Pet chew toy Citations (3) * Cited by examiner, † Cited by third party Publication number Priority date Publication date Assignee Title US4772956A (en) * 1987-06-02 1988-09-20 Eastman Kodak Company Dual block still video compander processor AU3414789A (en) * 1988-04-27 1989-11-24 Bil (Far East Holdings) Limited Method and system for decompressing color video encoded data AU3354289A (en) * 1988-04-27 1989-11-24 Bil (Far East Holdings) Limited Method and system for compressing and decompressing digital color video statistically encoded data Family Cites Families (5) * Cited by examiner, † Cited by third party Publication number Priority date Publication date Assignee Title FR2524740B1 (en) * 1981-06-02 1986-09-19 Thomson Csf METHOD FOR COMPRESSING A DIGITAL IMAGE FR2529044B1 (en) * 1982-06-18 1986-06-20 Inst Nat Rech Inf Automat VISUAL TELECOMMUNICATIONS METHODS AND DEVICES, ESPECIALLY FOR THE USE OF THE DEAF DE3236281A1 (en) * 1982-09-30 1984-04-05 Siemens AG, 1000 Berlin und 8000 München METHOD FOR COLOR SPACE CODING OF DIGITAL COLOR VIDEO SIGNALS AND SYSTEM FOR IMPLEMENTING THE METHOD US4743959A (en) * 1986-09-17 1988-05-10 Frederiksen Jeffrey E High resolution color video image acquisition and compression system US4774587A (en) * 1987-06-02 1988-09-27 Eastman Kodak Company Still video transceiver processor 1989 1989-03-23 BR BR898906932A patent/BR8906932A/en unknown 1989-03-23 JP JP1503914A patent/JP2583628B2/en not_active Expired - Lifetime 1989-03-23 WO PCT/US1989/001198 patent/WO1989010675A1/en active Application Filing 1989-03-23 AU AU33561/89A patent/AU616262B2/en not_active Ceased 1989-03-23 KR KR1019890702455A patent/KR950009459B1/en not_active IP Right Cessation 1989-03-23 CA CA000594741A patent/CA1324654C/en not_active Expired - Fee Related 1989-04-19 NZ NZ228807A patent/NZ228807A/en unknown 1989-04-19 IL IL90023A patent/IL90023A0/en unknown 1989-04-25 EP EP89304115A patent/EP0339938B1/en not_active Expired - Lifetime 1989-04-25 AT AT89304115T patent/ATE140114T1/en not_active IP Right Cessation 1989-04-25 DE DE68926760T patent/DE68926760T2/en not_active Expired - Fee Related 1989-04-27 PT PT90388A patent/PT90388B/en not_active IP Right Cessation 1989-12-22 FI FI896257A patent/FI896257A0/en not_active IP Right Cessation Patent Citations (3) * Cited by examiner, † Cited by third party Publication number Priority date Publication date Assignee Title US4772956A (en) * 1987-06-02 1988-09-20 Eastman Kodak Company Dual block still video compander processor AU3414789A (en) * 1988-04-27 1989-11-24 Bil (Far East Holdings) Limited Method and system for decompressing color video encoded data AU3354289A (en) * 1988-04-27 1989-11-24 Bil (Far East Holdings) Limited Method and system for compressing and decompressing digital color video statistically encoded data Also Published As Publication number Publication date EP0339938A2 (en) 1989-11-02 JP2583628B2 (en) 1997-02-19 NZ228807A (en) 1991-02-26 JPH02504099A (en) 1990-11-22 CA1324654C (en) 1993-11-23 KR950009459B1 (en) 1995-08-22 IL90023A0 (en) 1989-12-15 PT90388A (en) 1989-11-10 BR8906932A (en) 1990-12-11 AU3356189A (en) 1989-11-24 ATE140114T1 (en) 1996-07-15 EP0339938A3 (en) 1991-07-10 DE68926760D1 (en) 1996-08-08 DE68926760T2 (en) 1996-10-31 KR900701130A (en) 1990-08-17 FI896257A0 (en) 1989-12-22 EP0339938B1 (en) 1996-07-03 PT90388B (en) 1994-04-29 WO1989010675A1 (en) 1989-11-02 Similar Documents Publication Publication Date Title US4816901A (en) 1989-03-28 Method and system for compressing color video data AU617478B2 (en) 1991-11-28 Video telecommunication system and method for compressing and decompressing digital color video data US4914508A (en) 1990-04-03 Method and system for compressing and statistically encoding color video data US4857993A (en) 1989-08-15 Method and system for decompressing digital color video statistically encoded data US4857991A (en) 1989-08-15 Method and system for decompressing color video feature encoded data US4541012A (en) 1985-09-10 Video bandwidth reduction system employing interframe block differencing and transform domain coding US5557330A (en) 1996-09-17 Encoding video signals using selective pre-filtering EP0888010B1 (en) 2004-10-06 Image encoding method and apparatus US4843466A (en) 1989-06-27 Method and system for decompressing color video slope encoded data US4849807A (en) 1989-07-18 Method and system for compressing color video feature encoded data AU616006B2 (en) 1991-10-17 Method and system for compressing and decompressing digital color video statistically encoded data US7212680B2 (en) 2007-05-01 Method and apparatus for differentially compressing images AU616262B2 (en) 1991-10-24 Compressing color video data EP0339947B1 (en) 1996-09-04 Method and system for decompressing colour video encoded data JPH07284101A (en) 1995-10-27 Image compression coding device Legal Events Date Code Title Description 2002-10-31 MK14 Patent ceased section 143(a) (annual fees not paid) or expired
Download PDF in English

None