Last updated: April 19, 2026

Application No. 18/392,396

ARTIFICIAL INTELLIGENCE-BASED AUDIO DYNAMIC RANGE COMPRESSOR

Non-Final OA §102§103

Filed

Dec 21, 2023

Examiner

SAUNDERS JR, JOSEPH

Art Unit

2692

Tech Center

2600 — Communications

Assignee

Samsung Electronics Co., Ltd.

OA Round

1 (Non-Final)

Interview Optional

— +20.6% interview lift. This examiner has a relatively high allow rate; a written response may suffice.

Based on 740 resolved cases, 2023–2026

Examiner Intelligence

SAUNDERS JR, JOSEPH View full profile →

Grants 73% — above average

Career Allow Rate

538 granted / 740 resolved

+10.7% vs TC avg

Strong +21% interview lift

Without

With

+20.6%

Interview Lift

resolved cases with interview

Typical timeline

2y 9m

Avg Prosecution

27 currently pending

Career history

767

Total Applications

across all art units

Statute-Specific Performance

§101

5.1%

-34.9% vs TC avg

§103

40.0%

+0.0% vs TC avg

§102

29.6%

-10.4% vs TC avg

§112

14.6%

-25.4% vs TC avg

Black line = Tech Center average estimate • Based on career data from 740 resolved cases

Office Action

§102 §103

DETAILED ACTION

Notice of Pre-AIA  or AIA  Status
The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA . This Office action is based on the communications filed December 21, 2023. Claims 1 – 20 are currently pending and considered below.

Information Disclosure Statement
The information disclosure statement (IDS) submitted on December 21, 2023 is in compliance with the provisions of 37 CFR 1.97.  Accordingly, the information disclosure statement is being considered by the examiner.

Claim Rejections - 35 USC § 102
In the event the determination of the status of the application as subject to AIA  35 U.S.C. 102 and 103 (or as subject to pre-AIA  35 U.S.C. 102 and 103) is incorrect, any correction of the statutory basis (i.e., changing from AIA  to pre-AIA ) for the rejection will not be considered a new ground of rejection if the prior art relied upon, and the rationale supporting the rejection, would be the same under either status.  
The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the basis for the rejections under this section made in this Office action:
A person shall be entitled to a patent unless –

(a)(1) the claimed invention was patented, described in a printed publication, or in public use, on sale, or otherwise available to the public before the effective filing date of the claimed invention.


(a)(2) the claimed invention was described in a patent issued under section 151, or in an application for patent published or deemed published under section 122(b), in which the patent or application, as the case may be, names another inventor and was effectively filed before the effective filing date of the claimed invention.

Claim(s) 1 – 5, 8 – 12, and 15 – 19 is/are rejected under 35 U.S.C. 102(a)(1) and 35 U.S.C. 102(a)(2) as being anticipated by Cassidy et al. (US 2018/0114536 A1), hereinafter Cassidy.

Claim 1: Cassidy discloses a computer-implemented method comprising: predicting, based on an artificial intelligence (AI) model (see at least, “Sensing circuitry 1110B can include a circuit to generate a statistical feature 1125-1, a circuit to generate statistical feature 1125-2 ... a circuit to generate statistical feature 1125-L, where each is coupled to distortion prediction computation 1127 that generates a predicted likelihood of distortion associated with the acoustic output from speaker 1130B. Distortion prediction computation 1127 can be realized as one or more computational circuits for
machine learning, statistical learning, predictive learning, or AI-based measure that can provide computation of the likelihood of distortion at loudspeaker output 1130B. Conventional circuits for such learning and AI techniques can be used. These circuits may include various types of memory
devices and one or more processors, DSPs, ASICs and other computational devices. The predicted likelihood of distortion can be output from sensing circuitry 1110B to a dynamic gain cell,” Cassidy [0065], see also Cassidy [0048] – [0050]), a look-ahead level to adjust one or more levels for an audio stream in real-time (see at least, “The preprocessing function can manage look-ahead buffers used for the protected gain applied to the signal in the next stage of the process,” Cassidy [0079], “Parameters of the notch filters such as notch frequencies can be configured offline with the dynamic control of the depths of the notches performed in real time. With respect to signal processing, by real time is meant completing some signal/data processing within a time that is sufficient to keep up with an external process, such as generating an acoustic output from serial processing of an input audio signal,” Cassidy [0039]); and providing level adjustments without user parametrization while improving sound quality for the audio stream (see at least, “Parameters of the notch filters such as notch frequencies can be configured offline with the dynamic control of the depths of the notches performed in real time,” Cassidy [0039], “Dynamic multi-feature distortion sensing and adaptive, multi-band distortion reduction can have application in the improvement of sound quality produced by a loudspeaker. Distortion-triggering spectral components of the signal for the audio output can be reduced or removed such that the number of components removed can be kept to a minimum,” Cassidy [0034], “Dynamic gain cell 315 can be coupled to receive the statistics from sensing circuitry 310 and coupled to multi-notch filter 305 to operatively modify depth of multinotch filter 305. Speaker 330 can be arranged to receive the filtered signal to generate the acoustic output. Dynamic gain cell 315 can be arranged to provide a gain to adjust the depth of the multi-notch filter based on a subsequent statistic based on the bandpass-energy-based statistic above and the total energy, where the bandpass-energy-based statistic and measured
total energy are provided by sensing circuitry 310,” Cassidy [0051]).

Claim 2: Cassidy discloses the method of claim 1, wherein adjusting the one or more levels for the audio stream controls a dynamic range of the audio stream, and produces zero-latency (see at least, “Parameters of the notch filters such as notch frequencies can be configured offline with the dynamic control of the depths of the notches performed in real time. With respect to signal processing, by real time is meant completing some signal/data processing within a time that is sufficient to keep up with an external process, such as generating an acoustic output from serial processing of an input audio signal,” Cassidy [0039]).

Claim 3: Cassidy discloses the method of claim 1, wherein at least one level of the one or more levels for the audio stream is determined based on mean energy per frame (see at least, “Because energy within a single band does not activate the distortion, the estimated amount of distortion activating energy can be calculated as an aggregate statistic such as the arithmetic or geometric mean of the energy across all bands,” Cassidy [0061], “Statistic 1126 can include one or more statistics from a group of statistics including a median, a mean, an arithmetic mean, a geometric mean, a maximum of the energy, a minimum of the energy, and energy of a nth largest energy among all band-pass energies computed with n being a selected positive integer,” Cassidy [0063], “sets of audio features computed from successive frames or blocks,” Cassidy [0050]).

Claim 4: Cassidy discloses the method of claim 1, wherein the audio stream is limited based on its energy on a per sample basis (see at least, “In order to compensate for the
increased harmonic energy,” Cassidy [0074], “Harmonics generation 1452 of the main process monitors the signal and computes how much headroom is available. This can be performed by finding the maximum values of both Land R channels on a block-per-block basis. These maxima are stored in a statistics history buffer of a selected length, and they can be averaged to give an average maximum per channel over a history of buffers of the selected length,” Cassidy [0077], “Lowpass filter 1453 serves to limit the highest frequency of harmonic content injected, and the generated harmonics can optionally be processed by additional filter, gain filter 1454, with low-pass gain characteristics. This can be performed in order to shape the generated harmonics, and give more emphasis to the lower harmonics,” Cassidy [0078]).

Claim 5: Cassidy discloses the method of claim 2, wherein the dynamic range of the audio stream is controlled independently in parallel frequency bands (see at least, “The bank of notch or band-reject filters can be arranged in cascade or in parallel,” Cassidy [0033], “Multi-notch filters such as multi-notch filter 305 of FIG. 4 and multi-notch filter 405 of FIG. 4 can be implemented as a series of independent IIR notch filter bands. Each notch band can be implemented using a linear combination of the signal and an all-pass filter set to the cutoff frequency and Q of the band. Using this method, the filter coefficients can be determined statically at initialization using the cutoff frequency and bandwidth parameters, and the time-varying depth of the notch can be adjusted efficiently such that the notch depth may move up and down in real-time as shown by the bidirectional arrow in FIG. 10,” Cassidy [0059]).

Claims 8 – 12 are directed to a non-transitory processor-readable medium that includes a program that when executed by a processor performs the method substantially similar in scope to claims 1 – 5, respectively, and therefore are rejected for the same reasons (see also at least, “According to various embodiments, there is provided a first example non-transitory computer-readable medium that stores computer instructions, that when executed by one or more processors, cause a system to perform the operations of: receiving a signal from an audio signal source at a speaker; receiving the signal at sensing
circuitry; generating statistics on the signal in the sensing circuitry; modifying the signal based on the generated statistics and directing the modified signal to the speaker; and generating, from the speaker, an acoustic output based on the received modified signal,” Cassidy [0144], “Optionally, in accordance with the preceding first example non-transitory computer-readable medium that stores computer instructions, other implementations can provide operations associated with executing any of the
preceding examples in accordance with the first example method and/or first example system,” Cassidy [0145]).

Claims 15 – 19 are directed to an apparatus comprising: a memory storing instructions; and at least one processor executes the instructions including a process configured to perform the method substantially similar in scope to claims 1 – 5, respectively, and therefore are rejected for the same reasons (see also at least, “The present invention relates generally to apparatus and methods of processing of audio signals,” Cassidy [0002], “The process actions of a method, process, or algorithm described in connection with the embodiments disclosed herein can be embodied directly in hardware, in a software module executed by a processor, or in any combination of the two. The software module can be contained in computer-readable media that can be accessed by a computing device. The computer-readable media can include both volatile and nonvolatile media, which are either removable, non-removable, or some combination thereof. The computer-readable media is used to store information such as computer-readable or computer-executable instructions, data structures, program modules, or other data. By way of example, and not limitation, computer readable media may comprise computer storage media and communication media,” Cassidy [0243]).

Claim Rejections - 35 USC § 103
The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action:
A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made.

Claim(s) 6 – 7, 13 – 14, and 20 is/are rejected under 35 U.S.C. 103 as being unpatentable over Cassidy in view of Yoshino et al. (JP 2014175838 A), with English citations provided from the corresponding machine translation, hereinafter Yoshino.

Claim 6: Cassidy discloses the method of claim 1, but does not disclose further comprising: determining an optimal frame size for the audio stream based on signal analysis applied to the audio stream. However, Cassidy is concerned with finding the “most optimal” parameters (see at least, Cassidy [0056]). Yoshino discloses in regards to a similar “dynamic range compression unit that adjusts the signal level to a fixed level when the signal level of the audio signal amplified by the audio signal processing unit exceeds the second threshold,” Yoshino [0015] further determines an optimal frame size for the audio stream based on signal analysis applied to the audio stream (see at least, “Moreover, FIG. 5 is a graph which shows the time transition of an input signal and gain. In the figure, the X axis indicates time, the Y axis indicates gain, and the Z axis indicates frequency. As shown in the figure, the audio signal processing unit 12 of the present embodiment fluctuates the gain (thick line in the figure) according to the change of the input signal (dotted line in the figure). That is, control can be performed so as to obtain a large gain only when the signal level of the low frequency band component of the input signal exceeds the threshold. For this reason, amplification can be performed with a large gain only at a moment that is necessary in time (for example, when “x⟩ γ”, such as a moment when a bass drum sound is emitted). That is, it is possible to amplify only a large low frequency band component such as a bass drum while preventing an adverse effect of unnecessarily amplifying an unintended low frequency band component and giving an excessive bass feeling. As a result, it becomes possible to operate so as to select and amplify bass drum sounds temporally,” Yoshino [0035], “Further, in the above embodiment, the threshold “γ” for switching amplification processing and the threshold “β” for adjusting the dynamic range have been described as fixed values, but may be variable values. For example, the music (sound signal) input from the sound source 2 may be analyzed, and these thresholds may be varied based on the analysis result. As an analysis result, for example, an average level of music, genre, tempo (BPM (beats per minute)) or the like can be considered,” Yoshino [0050], “Further, the selection of amplification processing is not limited to every unit time (several tens of msec to several hundreds of msec), and may be performed in beat units of the audio signal input from the sound source 2, in bar units of music, or in music units. In these cases, amplification processing may be selected according to
the average signal level, the maximum signal level, and the minimum signal level in each period,” Yoshino [0052]). It would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention to incorporate the aforementioned features of Yoshino in the invention of Cassidy thereby allowing for the advantage that the “amplification can be performed with a large gain only at a moment that is necessary in time (for example, when “x⟩ γ”,” while “preventing an adverse effect of unnecessarily amplifying an unintended low frequency band component and giving an excessive bass feeling,” Yoshino [0035].

Claim 7: Cassidy and Yoshino disclose the method of claim 6, wherein the signal analysis includes determining a beat rate for the audio stream (see at least, “Further, in the above embodiment, the threshold “γ” for switching amplification processing and the threshold “β” for adjusting the dynamic range have been described as fixed values, but may be variable values. For example, the music (sound signal) input from the sound source 2 may be analyzed, and these thresholds may be varied based on the analysis result. As an analysis result, for example, an average level of music, genre, tempo (BPM (beats per minute)) or the like can be considered. Further, instead of performing the music analysis by the audio device 1, the music analysis result may be acquired from the outside,” Yoshino [0050]).

Claims 13 – 14 are substantially similar in scope to claims 6 – 7, respectively, and therefore are rejected for the same reasons.

Claim 20 is substantially similar in scope to claim 7 respectfully and therefore is rejected for the same reasons. 

Conclusion
Any inquiry concerning this communication or earlier communications from the examiner should be directed to JOSEPH SAUNDERS whose telephone number is (571)270-1063. The examiner can normally be reached Monday-Thursday, 9:00 a.m. - 4 p.m., EST.
Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice.
If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, Carolyn R Edwards can be reached at (571)270-7136. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300.
Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000.





/JOSEPH SAUNDERS JR/Primary Examiner, Art Unit 2692                                                                                                                                                                                                        
/CAROLYN R EDWARDS/Supervisory Patent Examiner, Art Unit 2692

Read full office action

Prosecution Timeline

Dec 21, 2023

Application Filed

Dec 13, 2025

Non-Final Rejection — §102, §103

Feb 01, 2026

Interview Requested

Feb 11, 2026

Applicant Interview (Telephonic)

Feb 11, 2026

Examiner Interview Summary

Precedent Cases

Applications granted by this same examiner with similar technology

17/976,804

Patent 12596883

Audio Analysis for Text Generation

2y 5m to grant Granted Apr 07, 2026

18/284,588

Patent 12598420

AUDIO DEVICE WITH ELECTROSTATIC DISCHARGE PROTECTION

2y 5m to grant Granted Apr 07, 2026

18/424,655

Patent 12593190

User Experience Localizing Binaural Sound During a Telephone Call

2y 5m to grant Granted Mar 31, 2026

18/340,728

Patent 12585425

Light-function audio parameters

2y 5m to grant Granted Mar 24, 2026

18/465,496

Patent 12585422

DATA PROCESSING METHOD OF PROCESSING MULTITRACK AUDIO DATA AND DATA PROCESSING APPARATUS

2y 5m to grant Granted Mar 24, 2026

Study what changed to get past this examiner. Based on 5 most recent grants.

AI Strategy Recommendation

Get an AI-powered prosecution strategy using examiner precedents, rejection analysis, and claim mapping.

Prosecution Projections

1-2

Expected OA Rounds

73%

Grant Probability

93%

With Interview (+20.6%)

2y 9m

Median Time to Grant

Low

PTA Risk

Based on 740 resolved cases by this examiner. Grant probability derived from career allow rate.