Last updated: April 19, 2026
Application No. 19/058,985
TECHNIQUES FOR PERFORMING BOTH SCALAR QUANTIZATION AND TRELLIS CODED QUANTIZATION WHEN ENCODING VIDEO DATA

Non-Final OA §102
Filed
Feb 20, 2025
Examiner
HODGES, SUSAN E
Art Unit
2425
Tech Center
2400 — Computer Networks
Assignee
Netflix Inc.
OA Round
1 (Non-Final)
Interview Optional

— +14.4% interview lift. This examiner has a relatively high allow rate; a written response may suffice.
Based on 375 resolved cases, 2023–2026
Examiner Intelligence

HODGES, SUSAN E View full profile →
Grants 67% — above average
Career Allow Rate
250 granted / 375 resolved
+8.7% vs TC avg
Moderate +14% lift
Without
With
+14.4%
Interview Lift
resolved cases with interview
Typical timeline
2y 4m
Avg Prosecution
31 currently pending
Career history
406
Total Applications
across all art units
Statute-Specific Performance

§101
6.0%
-34.0% vs TC avg
§103
48.7%
+8.7% vs TC avg
§102
20.9%
-19.1% vs TC avg
§112
22.6%
-17.4% vs TC avg
Black line = Tech Center average estimate • Based on career data from 375 resolved cases
Office Action

§102
DETAILED ACTION
This office action is in response to the application filed on February 20, 2025. Claims 1 – 20 are pending.

Notice of Pre-AIA  or AIA  Status
The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA .

Priority
Acknowledgment is made of applicant's claim for priority based on U.S. provisional applications 63/647,364 filed on Mary 14, 2024.

Information Disclosure Statement
The information disclosure statement (IDS) was submitted on November 18, 2025.  The submission is in compliance with the provisions of 37 CFR 1.97.  Accordingly, the information disclosure statement is being considered by the Examiner.

Claim Rejections - 35 USC § 102
In the event the determination of the status of the application as subject to AIA  35 U.S.C. 102 and 103 (or as subject to pre-AIA  35 U.S.C. 102 and 103) is incorrect, any correction of the statutory basis for the rejection will not be considered a new ground of rejection if the prior art relied upon, and the rationale supporting the rejection, would be the same under either status.  
The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the basis for the rejections under this section made in this Office action:
A person shall be entitled to a patent unless –

(a)(1) the claimed invention was patented, described in a printed publication, or in public use, on sale or otherwise available to the public before the effective filing date of the claimed invention.

Claims 1 - 20 are rejected under 35 U.S.C. 102(a)(1) as being anticipated by SCHWARZ et al., (US 2021/0136385 A1) referred to as SCHWARZ hereinafter.
Regarding Claim 1, SCHWARZ discloses a computer-implemented method for encoding video data (Fig. 1, Par. [0025], a method for encoding a block of transform coefficients), the method comprising:
performing one or more scalar quantization operations on a plurality of transform coefficients of prediction residues to generate a plurality of first indices (Fig. 2a, Scalar quantization, Par. [0063], The resulting transform blocks are coded using transform coding: A 2d transform is applied (i.e. performing) to the block of residual samples, the resulting transform coefficients are quantized (i.e. quantization operation) using independent scalar quantization, and the resulting transform coefficient levels (quantization indexes) (i.e. first indices) are entropy coded);
performing one or more trellis coded quantization operations on a first vector that includes the plurality of transform coefficients to generate a second vector that includes a plurality of second indices (Fig. 5B, dependent scalar quantization, Par. [0150] The state transition in dependent quantization can also be represented using a trellis structure, as is illustrated in FIG. 12. The trellis shown in this figure corresponds to the state transitions specified in Table 1. For each state (i.e. second vector), there are two paths that connect the state for a current transform coefficient with two possible states for the next transform coefficient in reconstruction order. Given an initial state (the state 0) (i.e. first vector), the path through the trellis (i.e. trellis quantization operation) is uniquely specified by the transmitted quantization indexes (i.e. second indices));
determining a first cost function value based on a third vector that includes the plurality of first indices and a second cost function value based on the second vector that includes the plurality of second indices (Par. [0108], The location of the first non-zero transform coefficient levels (i.e. third vector) is determined by comparing the Lagrangian costs that are obtained by choosing one of the non-zero transform coefficient levels as first non-zero transform coefficient levels (i.e. second cost function value) in coding order (the preceding transform coefficient levels are set equal to zero (i.e. first cost function value));
determining whether the second cost function value is less than the first cost function value (Fig. 17, Par. [0103] Since the selection of transform coefficient levels determines both the distortion (or reconstruction/approximation quality) and the bit rate, the quantization algorithm used has a substantial impact on the rate-distortion performance of the produced bitstream. Par. [0106], the quantization indexes q.sub.k for a transform block should be determined in a way so that the following cost measure is minimized. Par. [0189] Compare the costs of the 4 final nodes (for the last coefficient in coding order) and chose the node with minimum cost (i.e. less than first));
setting a fourth vector that includes a plurality of quantization indices equal to either the second vector or the third vector based on whether the second cost function value is less than the first cost function value (Par. [0189] Follow the chosen path (specified by the final node) (i.e. fourth vector) is reverse order and collect the quantization indexes that are associated with the connections between the trellis nodes); and
performing one or more entropy coding operations on the fourth vector that includes the plurality of quantization indices to generate encoded video data (Fig. 1, entropy coding, Par. [0113], dependent scalar quantization is combined with a modified entropy coding, in which the probability model selection (or, alternatively, the codeword table selection) for a transform coefficient depends on the set of admissible reconstruction levels).

Regarding Claim 2, SCHWARZ discloses claim 1. SCHWARZ further discloses wherein first metadata also is generated when the one or more scalar quantization operations are performed on the plurality of transform coefficients of prediction residues (Par. [0056] The binarization of the transform coefficient levels and the distribution of the binary decisions (also referred to as bins) over the multiple passes is chosen in a way that the data coded (i.e. metadata) in the first pass uniquely determine the set of admissible reconstruction levels for the next scan position).

Regarding Claim 3, SCHWARZ discloses claim 2. SCHWARZ further discloses wherein the second cost function value is not less than the first cost function value (Par. [0114], the set of admissible values for the second transform coefficient t′.sub.1 does not depend on the chosen value for the first reconstructed transform coefficient t′.sub.0.), and further comprising determining quantization metadata based on the first metadata (Par. [0056] The binarization of the transform coefficient levels and the distribution of the binary decisions (also referred to as bins) over the multiple passes is chosen in a way that the data coded (i.e. metadata) in the first pass uniquely determine the set of admissible reconstruction levels for the next scan position), and storing at least a portion of the quantization metadata in memory (Par. [0064], The reconstructed prediction error signal for a block (obtained by reconstructing the transform coefficients given the quantization indexes and an inverse transform) is added to the corresponding prediction signal and the result is written to a buffer for the current picture. After all blocks of a picture are reconstructed, one or more in-loop filters can be applied (for example, a deblocking filter and a sample adaptive offset filter). The final reconstructed picture is then stored in a decoded picture buffer).

Regarding Claim 4, SCHWARZ discloses claim 3. SCHWARZ further discloses wherein the quantization metadata is further determined based on the second cost function value (Par. [0056] The binarization of the transform coefficient levels and the distribution of the binary decisions (also referred to as bins) over the multiple passes is chosen in a way that the data coded (i.e. metadata) in the first pass uniquely determine the set of admissible reconstruction levels for the next scan position. Par. [0103] Since the selection of transform coefficient levels determines both the distortion (or reconstruction/approximation quality) and the bit rate, the quantization algorithm used has a substantial impact on the rate-distortion performance of the produced bitstream).

Regarding Claim 5, SCHWARZ discloses claim 1. SCHWARZ further discloses wherein second metadata also is generated when the one or more trellis coded quantization operations are performed on the first vector that includes the plurality of transform coefficients (Par. [0056] The binarization of the transform coefficient levels and the distribution of the binary decisions (also referred to as bins) over the multiple passes is chosen in a way that the data coded (i.e. second metadata) in the first pass uniquely determine the set of admissible reconstruction levels for the next scan position) Fig. 5B, dependent scalar quantization, Par. [0150] The state transition in dependent quantization can also be represented using a trellis structure, as is illustrated in FIG. 12. The trellis shown in this figure corresponds to the state transitions specified in Table 1. For each state (i.e. second vector), there are two paths that connect the state for a current transform coefficient with two possible states for the next transform coefficient in reconstruction order. Given an initial state (the state 0) (i.e. first vector), the path through the trellis (i.e. trellis quantization operation) is uniquely specified by the transmitted quantization indexes).

Regarding Claim 6, SCHWARZ discloses claim 5. SCHWARZ further discloses wherein the second cost function value is less than the first cost function value (Par. [0051], an example trellis structure that can be exploited for determining sequences (or blocks) of quantization indexes that minimize a cost measures (such as an Lagrangian cost measure D+λ.Math.R). The trellis is shown for 8 transform coefficients (or quantization indexes). The first state (at the very left) represents an initial state, which is set equal to 0), and further comprising determining quantization metadata based on the second metadata (Par. [0056] The binarization of the transform coefficient levels and the distribution of the binary decisions (also referred to as bins) over the multiple passes is chosen in a way that the data coded (i.e. metadata) in the first pass uniquely determine the set of admissible reconstruction levels for the next scan position), and storing at least a portion of the quantization metadata in memory (Par. [0064], The reconstructed prediction error signal for a block (obtained by reconstructing the transform coefficients given the quantization indexes and an inverse transform) is added to the corresponding prediction signal and the result is written to a buffer for the current picture. After all blocks of a picture are reconstructed, one or more in-loop filters can be applied (for example, a deblocking filter and a sample adaptive offset filter). The final reconstructed picture is then stored in a decoded picture buffer).

Regarding Claim 7, SCHWARZ discloses claim 6. SCHWARZ further discloses wherein the quantization metadata is further determined based on the first cost function value (Par. [0056] The binarization of the transform coefficient levels and the distribution of the binary decisions (also referred to as bins) over the multiple passes is chosen in a way that the data coded (i.e. metadata) in the first pass uniquely determine the set of admissible reconstruction levels for the next scan position. Par. [0103] Since the selection of transform coefficient levels determines both the distortion (or reconstruction/approximation quality) and the bit rate, the quantization algorithm used has a substantial impact on the rate-distortion performance of the produced bitstream).

Regarding Claim 8, SCHWARZ discloses claim 1. SCHWARZ further discloses wherein the second cost function value is less than the first cost function value (Par. [0108], The location of the first non-zero transform coefficient levels (i.e. third vector) is determined by comparing the Lagrangian costs that are obtained by choosing one of the non-zero transform coefficient levels as first non-zero transform coefficient levels (i.e. second cost function value) in coding order (the preceding transform coefficient levels are set equal to zero (i.e. first cost function value)), and the fourth vector that includes the plurality of quantization indices is set equal to the second vector (Par. [0189] Follow the chosen path (specified by the final node) (i.e. fourth vector) is reverse order and collect the quantization indexes that are associated with the connections between the trellis nodes).

Regarding Claim 9, SCHWARZ discloses claim 1. SCHWARZ further discloses wherein the second cost function value is not less than the first cost function value (Par. [0114], the set of admissible values for the second transform coefficient t′.sub.1 does not depend on the chosen value for the first reconstructed transform coefficient t′.sub.0.), and the fourth vector that includes the plurality of quantization indices is set equal to the third vector (Par. [0189] Follow the chosen path (specified by the final node) (i.e. third vector) is reverse order and collect the quantization indexes that are associated with the connections between the trellis nodes).

Regarding Claim 10, SCHWARZ discloses claim 1. SCHWARZ further discloses wherein a cost function is used to compute the first cost function value and the second cost function value (Par. [0106] Quantization algorithms that aim to minimize a Lagrange function (i.e. cost function) D+λ.Math.R of distortion and rate are also referred to as rate-distortion optimized quantization (RDOQ). If we measure the distortion using the MSE or a weighted MSE, the quantization indexes q.sub.k for a transform block (i.e. first and second cost functions) should be determined in a way so that the following cost measure is minimized), wherein the cost function incorporates a tradeoff between an estimated number of bits needed by an entropy encoder (Fig. 1) to encode a sequence of transform coefficients (Par. [0187] a very good trade-off between distortion (reconstruction quality) and bit rate (i.e. estimated number of bits), the quantization indexes should be selected in a way that a Lagrangian cost measure is minimized) and a distortion corresponding to the sequence of transform coefficients (Par. [0051] FIG. 17 shows a schematic diagram of an example trellis structure that can be exploited for determining sequences (or blocks) of quantization indexes that minimize a cost measures (such as an Lagrangian cost measure D+λ.Math.R)).

Regarding Claim 11, it has limitations similar to those treated in the above rejection(s) of Claim 1, and is met by the references as discussed above.  Claim 11 however also recites one or more non-transitory, computer-readable media storing instructions that, when executed by one or more processors (Par. [0032] a non-transitory digital storage medium may have a computer program stored thereon to perform the inventive methods, when said computer program is run by a computer).

Regarding Claim 12, SCHWARZ discloses claim 11. SCHWARZ further discloses setting a flag value to indicate that the second cost function value is less than the first cost function value (Par. [0108]-[0112] The flags coded_sub_block_flag for the 4×4 subblocks are determined by comparing the Lagrangian costs for the following two cases: (a) The transform coefficient levels selected in step 1 are used; (b) The syntax element coded_sub_block_flag is set equal to zero and, thus, all transform coefficient levels of the 4×4 subblock are set equal to zero. The coded_block_flag is determined by comparing the Lagrangian costs for the sequence of transform coefficient levels obtained after step 3 and the case that all transform coefficient levels inside the transform block are set equal to zero).

Regarding Claim 13, SCHWARZ discloses claim 12. SCHWARZ further discloses wherein both the fourth vector that includes the plurality of quantization indices and the flag value are transmitted to an entropy coding engine that performs the one or more entropy coding operations (Par. [0076] For simplifying the following entropy coding, the admissible reconstruction levels are represented by quantization indexes (also referred to as transform coefficient levels) (i.e. fourth vector) , which are transmitted as part of the bitstream) Par. [0090] A syntax element coded_block_flag is transmitted, which signals whether there are any non-zero transform coefficient levels in the transform block).

Regarding Claim 14, SCHWARZ discloses claim 11. SCHWARZ further discloses wherein each of the first cost function value and the second cost function value includes a coding cost and a distortion (Par. [0106] Quantization algorithms that aim to minimize a Lagrange function D+λ.Math.R of distortion and rate are also referred to as rate-distortion optimized quantization (RDOQ). If we measure the distortion using the MSE or a weighted MSE, the quantization indexes q.sub.k for a transform block should be determined in a way so that the following cost measure is minimized).

Regarding Claim 15, SCHWARZ discloses claim 11. SCHWARZ further discloses wherein the second cost function value is less than the first cost function value (Par. [0108], The location of the first non-zero transform coefficient levels (i.e. third vector) is determined by comparing the Lagrangian costs that are obtained by choosing one of the non-zero transform coefficient levels as first non-zero transform coefficient levels (i.e. second cost function value) in coding order (the preceding transform coefficient levels are set equal to zero (i.e. first cost function value)), and the fourth vector that includes the plurality of quantization indices is set equal to the second vector (Par. [0189] Follow the chosen path (specified by the final node) (i.e. fourth vector) is reverse order and collect the quantization indexes that are associated with the connections between the trellis nodes).

Regarding Claim 16, SCHWARZ discloses claim 11. SCHWARZ further discloses wherein the second cost function value is not less than the first cost function value (Par. [0114], the set of admissible values for the second transform coefficient t′.sub.1 does not depend on the chosen value for the first reconstructed transform coefficient t′.sub.0.), and the fourth vector that includes the plurality of quantization indices is set equal to the third vector (Par. [0189] Follow the chosen path (specified by the final node) (i.e. third vector) is reverse order and collect the quantization indexes that are associated with the connections between the trellis nodes).

Regarding Claim 17, SCHWARZ discloses claim 11. SCHWARZ further discloses wherein a cost function is used to compute the first cost function value and the second cost function value (Par. [0106] Quantization algorithms that aim to minimize a Lagrange function (i.e. cost function) D+λ.Math.R of distortion and rate are also referred to as rate-distortion optimized quantization (RDOQ). If we measure the distortion using the MSE or a weighted MSE, the quantization indexes q.sub.k for a transform block (i.e. first and second cost functions) should be determined in a way so that the following cost measure is minimized), wherein the cost function incorporates a tradeoff between an estimated number of bits needed by an entropy encoder (Fig. 1) to encode a sequence of transform coefficients (Par. [0187] a very good trade-off between distortion (reconstruction quality) and bit rate (i.e. estimated number of bits), the quantization indexes should be selected in a way that a Lagrangian cost measure is minimized) and a distortion corresponding to the sequence of transform coefficients (Par. [0051] FIG. 17 shows a schematic diagram of an example trellis structure that can be exploited for determining sequences (or blocks) of quantization indexes that minimize a cost measures (such as an Lagrangian cost measure D+λ.Math.R)).

Regarding Claim 18, SCHWARZ discloses claim 11. SCHWARZ further discloses wherein first metadata also is generated when the one or more scalar quantization operations are performed on the plurality of transform coefficients of prediction residues (Par. [0056] The binarization of the transform coefficient levels and the distribution of the binary decisions (also referred to as bins) over the multiple passes is chosen in a way that the data coded (i.e. metadata) in the first pass uniquely determine the set of admissible reconstruction levels for the next scan position), and further comprising determining quantization metadata based on the first metadata (Par. [0056] The binarization of the transform coefficient levels and the distribution of the binary decisions (also referred to as bins) over the multiple passes is chosen in a way that the data coded (i.e. metadata) in the first pass uniquely determine the set of admissible reconstruction levels for the next scan position).

Regarding Claim 19, SCHWARZ discloses claim 11. SCHWARZ further discloses wherein second metadata also is generated when the one or more trellis coded quantization operations are performed on the first vector that includes the plurality of transform coefficients (Par. [0056] The binarization of the transform coefficient levels and the distribution of the binary decisions (also referred to as bins) over the multiple passes is chosen in a way that the data coded (i.e. second metadata) in the first pass uniquely determine the set of admissible reconstruction levels for the next scan position) Fig. 5B, dependent scalar quantization, Par. [0150] The state transition in dependent quantization can also be represented using a trellis structure, as is illustrated in FIG. 12. The trellis shown in this figure corresponds to the state transitions specified in Table 1. For each state (i.e. second vector), there are two paths that connect the state for a current transform coefficient with two possible states for the next transform coefficient in reconstruction order. Given an initial state (the state 0) (i.e. first vector), the path through the trellis (i.e. trellis quantization operation) is uniquely specified by the transmitted quantization indexes), and further comprising determining quantization metadata based on the second metadata (Par. [0056] The binarization of the transform coefficient levels and the distribution of the binary decisions (also referred to as bins) over the multiple passes is chosen in a way that the data coded (i.e. metadata) in the first pass uniquely determine the set of admissible reconstruction levels for the next scan position).

Regarding Claim 20, it has limitations similar to those treated in the above rejection(s) of Claim 1, and is met by the references as discussed above.  Claim 20 however also recites a computer system, comprising: one or more memories storing instructions; and one or more processors that are coupled to the one or more memories and, when executing the instructions, are configured to perform the steps (Par. [0457] Some or all of the method steps may be executed by (or using) a hardware apparatus, like for example, a microprocessor (i.e. processor), a programmable computer or an electronic circuit. In some embodiments, one or more of the most important method steps may be executed by such an apparatus. Par. [0459] The invention can be implemented in hardware or in software. The implementation can be performed using a digital storage medium, for example a floppy disk, a DVD, a Blu-Ray, a CD, a ROM, a PROM, an EPROM, an EEPROM or a FLASH memory (i.e. memories), having electronically readable control signals stored thereon, which cooperate (or are capable of cooperating) with a programmable computer system (i.e. computer system) such that the respective method is performed. Therefore, the digital storage medium may be computer readable) Par. [0461] a computer program product with a program code (i.e. instructions), the program code being operative for performing one of the methods when the computer program product runs on a computer).

Conclusion
The prior art references made of record are not relied upon but are considered pertinent to applicant's disclosure. Chen et al. (US 2025/0055997 A1) teaches a method and an apparatus for video encoding and decoding with adaptive dependent quantization. 
Any inquiry concerning this communication should be directed to SUSAN E HODGES whose telephone number is (571)270-0498.  The Examiner can normally be reached on Monday - Friday from 8:00 am (EST) to 4:00 pm (EST).  
If attempts to reach the Examiner by telephone are unsuccessful, the Examiner's supervisor, Brian T. Pendleton, can be reached on (571) 272-7527. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300.
Information regarding the status of an application may be obtained from the Patent Application Information Retrieval (PAIR) system.  Status information for published applications may be obtained from either Private PAIR or Public PAIR.  Status information for unpublished applications is available through Private PAIR only.  For more information about the PAIR system, see http://portal.uspto.gov/external/portal. Should you have questions on access to the Private PAIR system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free).

/Susan E. Hodges/Primary Examiner, Art Unit 2425
Read full office action
Prosecution Timeline

Feb 20, 2025
Application Filed
Mar 20, 2026
Non-Final Rejection — §102 (current)
Precedent Cases

Applications granted by this same examiner with similar technology

18/862,647
Patent 12603982
STEREOSCOPIC HIGH DYNAMIC RANGE VIDEO
2y 5m to grant Granted Apr 14, 2026
18/923,643
Patent 12604008
ADAPTIVE CLIPPING IN MODELS PARAMETERS DERIVATIONS METHODS FOR VIDEO COMPRESSION
2y 5m to grant Granted Apr 14, 2026
18/716,344
Patent 12574558
Method and Apparatus for Sign Coding of Transform Coefficients in Video Coding System
2y 5m to grant Granted Mar 10, 2026
18/208,163
Patent 12568212
ADAPTIVE LOOP FILTERING ON OUTPUT(S) FROM OFFLINE FIXED FILTERING
2y 5m to grant Granted Mar 03, 2026
18/906,582
Patent 12556671
THREE DIMENSIONAL STROBO-STEREOSCOPIC IMAGING SYSTEMS AND ASSOCIATED METHODS
2y 5m to grant Granted Feb 17, 2026
Study what changed to get past this examiner. Based on 5 most recent grants.
AI Strategy Recommendation

Get an AI-powered prosecution strategy using examiner precedents, rejection analysis, and claim mapping.
Prosecution Projections

1-2
Expected OA Rounds
67%
Grant Probability
81%
With Interview (+14.4%)
2y 4m
Median Time to Grant
Low
PTA Risk
Based on 375 resolved cases by this examiner. Grant probability derived from career allow rate.