Last updated: May 29, 2026

Application No. 18/646,431

ACTIVE VOICE LIVENESS DETECTION SYSTEM

Final Rejection §103

Filed

Apr 25, 2024

Priority

Apr 28, 2023 — provisional 63/462,913 +1 more

Examiner

LELAND III, EDWIN S

Art Unit

2654

Tech Center

2600 — Communications

Assignee

Pindrop Security Inc.

OA Round

2 (Final)

Interview Optional

— -0.6% interview lift. Interview already conducted in this application's prosecution history. This examiner has a 75% grant rate with -0.6% interview lift. Since an interview has already been tried, recommend written response with narrowed claims based on precedent claim evolution patterns.

Based on 456 resolved cases, 2023–2026

Examiner Intelligence

LELAND III, EDWIN S View full profile →

Grants 75% — above average

Career Allowance Rate

342 granted / 456 resolved

+13.0% vs TC avg

Minimal -1% lift

Without

With

+-0.6%

Interview Lift

resolved cases with interview

Typical timeline

2y 5m

Avg Prosecution

14 currently pending

Career history

471

Total Applications

across all art units

Statute-Specific Performance

§101

11.1%

-28.9% vs TC avg

§103

67.9%

+27.9% vs TC avg

§102

10.9%

-29.1% vs TC avg

§112

5.6%

-34.4% vs TC avg

Black line = Tech Center average estimate • Based on career data from 456 resolved cases

Office Action

§103

DETAILED ACTION
Notice of Pre-AIA  or AIA  Status
The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA .

Information Disclosure Statement
The information disclosure statements (IDS) submitted on 10/9/2024, 11/7/2024, 7/8/2025, 7/30/2025, 8/13/2025, 8/26/2025 and 10/27/2025 are in compliance with the provisions of 37 CFR 1.97.  Accordingly, the information disclosure statements are being considered by the examiner.

Status of Claims
Claims 1-20 are pending in this application.

Claim Rejections - 35 USC § 103
In the event the determination of the status of the application as subject to AIA  35 U.S.C. 102 and 103 (or as subject to pre-AIA  35 U.S.C. 102 and 103) is incorrect, any correction of the statutory basis (i.e., changing from AIA  to pre-AIA ) for the rejection will not be considered a new ground of rejection if the prior art relied upon, and the rationale supporting the rejection, would be the same under either status.  
The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action:
A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made.

Claims 1, 5, 7-8, 12 and 14-15 are rejected under 35 U.S.C. 103 as being unpatentable over Chen et al. (U.S. Patent Application Publication 2021/0233541).
As per claim 8, Chen et al. discloses:
A system for detecting fraudulent speech in media data comprising: 
a computer comprising at least one processor (Paragraph [0096]) configured to: 
receive media data including a speech signal in an audio signal of the media data (Figure 7, item 712 and Paragraphs [0096-0097] – the audio signal including speech is received); 
determine a plurality of sections of the media data according to a section boundary (Figure 7, item 703 and Paragraph [0097] – the audio signal is parsed into segments); 
for each section of the media data, extract a segment fakeprint for the section using a plurality of section features extracted using a speech portion of the speech signal occurring in the section (Figure 7, item 704 and paragraphs [0099-0101] – the inbound spoofprint is the fakeprint for the section of the inbound signal); 
generate a section liveness score for the segment based upon a distance between the section fakeprint and a classification threshold value (Figure 7, items 716 & 718 and Paragraphs [0102-0103] – the distance score is the classification threshold value and the verification score is the liveness score); and 
identify the portion of the speech signal in the section as genuine or fraudulent based upon comparing the segment liveness score for the section against a fraud detection threshold (Paragraph [0103] – the distance score or other outputted values are compared to a threshold and the segment is determined to be a spoof or not based on the comparison).
Chen et al. fails to explicitly disclose that the segmenting boundaries during operation are preconfigured, but pre-processing techniques of application 17/079,082 (now U.S. Patent 12,087,319) are incorporated by reference (see paragraph [0028]) and in column 7, lines 7-22 of the patent it discloses a pre-processing technique that divides the signal into fixed length (i.e. preconfigured) frames (segments). It would be obvious for a person having ordinary skill in the art at the effective filing date of the invention to modify the method, system and computer readable medium of Chen et al. with the fixed length signal preprocessing technique of the incorporated patent because it is a case of simple substitution of one known element (Unspecified segment lengths in Chen et al.) for another (fixed length segments in the incorporated patent) to obtain predictable results. Since the technique is incorporated into Chen et al. by reference, the results are necessarily predictable.
Claim 1 is directed to the method of using the system of claim 8, so is rejected for similar reasons.
Claim 15 is directed to a non-transitory computer readable media containing instructions to cause a processor to act as the system of claim 8, so is rejected for similar reasons.

As per claims 5 and 12, Chen et al. discloses or suggests all of the limitations of claims 1 and 8 above. Chen et al. further discloses:
the segmenting boundary is based upon a uniform time interval. (column 7, lines 7-22 of incorporated patent)

As per claims 7 and 14, Chen et al. discloses or suggests all of the limitations of claims 1 and 8 above. Chen et al. further discloses:
in response to identifying the speech portion of the speech signal in the segment as fraudulent, generating, by the computer, an alert notification indicating a timestamp of the segment in the media data (Paragraph [0103] – a human readable indicator (e.g. plain language, visual display) indicates that the audio signal is a spoof attempt or the server may trigger one or more downstream applications). 
Chen et al. fails to explicitly disclose that the alert includes a timestamp of the segment.  It would have been obvious to one having ordinary skill in the art at the time the invention was made to include a timestamp, since it has been held to be within the general skill of a worker in the art to select a known material (information) on the basis of its suitability for the intended use as a matter of obvious design choice.  In re Leshin, 125 USPQ 416.

Claims 3-4, 6, 10-11 and 13 are rejected under 35 U.S.C. 103 as being unpatentable over Chen et al. (U.S. Patent Application Publication 2021/0233541) in view Basye et al. (U.S. Patent Application Publication 2020/0194032)
As per claims 3 and 10, Chen et al. discloses or suggests all of the limitations in claims 1 and 8 above. Chen et al. fails to disclose but Basye et al. in the same field of endeavor teaches:
determining, by the computer, the speech portion of the speech signal in the inbound audio signal occurring in the segment, wherein the computer ignores a non-speech portion of the input audio signal in the segment (Paragraph [0016] – ASR and beamforming allow non-speech audio inputs to be ignored).
It would be obvious for a person having ordinary skill in the art to modify the system, method and computer readable media of Chen et al. with the speech isolation capabilities of Basye et al. because improves speech processing accuracy (see paragraph [0010] of Basye et al.)

As per claims 4 and 11, the combination Chen et al. and Basye et al. discloses or suggests all of the limitations in claims 3 and 10 above. Basye et al. in the combination further discloses:
executing, by the computer, at least one of voice activity detection (VAD), an automatic speech recognition (ASR), or speaker diarization, to determine the speech portion of the speech signal in the inbound audio signal in the segment (Paragraph [0016] – ASR and beamforming allow non-speech audio inputs to be ignored and speech portions processed).

As per claims 6 and 13, Chen et al. discloses or suggests all of the limitations in claims 1 and 18 above. Chen et al. fails to disclose but Basye et al. in the same field of endeavor teaches:
the segmenting boundary is based on detecting a triggering condition associated with the speech signal (Paragraph [0032] – the segment boundaries are the beginning or end of speech).
It would be obvious for a person having ordinary skill in the art to modify the system, method and computer readable media of Chen et al. with the endpointing capabilities of Basye et al. because improves speech processing accuracy (see paragraphs [0010-0011] of Basye et al.)

Allowable Subject Matter
Claims 2, 9 and 16-20 are objected to as being dependent upon a rejected base claim, but would be allowable if rewritten in independent form including all of the limitations of the base claim and any intervening claims.



Response to Arguments
Applicant’s arguments, see page 7, filed 2/12/2026, with respect to the rejection of claims 1-20 under 35 U.S.C. 112 have been fully considered and are persuasive.  The ejection of claims 1-20 under 35 U.S.C. 112 has been withdrawn. 
Applicant’s arguments, see pages 7-8, filed 2/12/2026, with respect to the rejections of claims 1-20 under 35 U.S.C. 103 have been fully considered and are not persuasive. Specifically, the change from segment to section does not change the interpretation of the claims. Section can be segment or frame, and Chen processes the signal frame by frame. However, the limitations in dependent claims 2, 9 and 16 preclude that interpretation and would be allowable if incorporated into the independent claims. 

Examiner Notes
The Examiner cites particular columns and line numbers in the references as applied to the claims above for the convenience of the Applicant.  Although the specified citations are representative of the teachings in the art and are applied to the specific limitations within the individual claim, other passages and figures may apply as well.  It is respectfully requested that, in preparing responses, the Applicant fully considers the references in its entirety as potentially teaching all or part of the claimed invention, as well as the context of the passage as taught by the prior art or as disclosed by the Examiner. 
Communications via Internet e-mail are at the discretion of the applicant and require written authorization. Should the Applicant wish to communicate via e-mail, including the following paragraph in their response will allow the Examiner to do so:
“Recognizing that Internet communications are not secure, I hereby authorize the USPTO to communicate with me concerning any subject matter of this application by electronic mail. I understand that a copy of these communications will be made of record in the application file.”
Should e-mail communication be desired, the Examiner can be reached at Edwin.Leland@USPTO.gov

Conclusion
THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a).
A shortened statutory period for reply to this final action is set to expire THREE MONTHS from the mailing date of this action. In the event a first reply is filed within TWO MONTHS of the mailing date of this final action and the advisory action is not mailed until after the end of the THREE-MONTH shortened statutory period, then the shortened statutory period will expire on the date the advisory action is mailed, and any nonprovisional extension fee (37 CFR 1.17(a)) pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of the advisory action. In no event, however, will the statutory period for reply expire later than SIX MONTHS from the mailing date of this final action.
Any inquiry concerning this communication or earlier communications from the examiner should be directed to EDWIN S LELAND III whose telephone number is (571)270-5678. The examiner can normally be reached 8:00 - 5:00 M-F.
Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice.
If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, Hai Phan can be reached at 571-272-6338. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300.
Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000.





/EDWIN S LELAND III/Primary Examiner, Art Unit 2654

Read full office action

Prosecution Timeline

Apr 25, 2024

Application Filed

Nov 12, 2025

Non-Final Rejection mailed — §103

Feb 12, 2026

Applicant Interview (Telephonic)

Feb 12, 2026

Response Filed

Feb 12, 2026

Examiner Interview Summary

May 05, 2026

Final Rejection mailed — §103 (current)

Precedent Cases

Applications granted by this same examiner with similar technology

18/315,789

Patent 12608548

METHODS AND SYSTEMS FOR PARSING A MIX OF FEATURES AND INSTRUCTIONS INTO A PROMPT

2y 11m to grant Granted Apr 21, 2026

18/654,795

Patent 12596869

DETECTING ARTIFICIAL INTELLIGENCE GENERATED TEXT

1y 11m to grant Granted Apr 07, 2026

17/936,873

Patent 12591602

TRAINING MACHINE LEARNING BASED NATURAL LANGUAGE PROCESSING FOR SPECIALTY JARGON

3y 6m to grant Granted Mar 31, 2026

17/993,063

Patent 12579370

MULTILINGUAL CHATBOT

3y 3m to grant Granted Mar 17, 2026

18/602,835

Patent 12579986

Systems and Methods for Distinguishing Between Human Speech and Machine Generated Speech

2y 0m to grant Granted Mar 17, 2026

Study what changed to get past this examiner. Based on 5 most recent grants.

Strategy Recommendation AI-generated — please review before filing

Get a prosecution strategy drawn from examiner precedents, rejection analysis, and claim mapping.

Typically takes 5-10 seconds — AI-generated, attorney review required before filing

Prosecution Projections

3-4

Expected OA Rounds

75%

Grant Probability

74%

With Interview (-0.6%)

2y 5m (~4m remaining)

Median Time to Grant

Moderate

PTA Risk

Based on 456 resolved cases by this examiner. Grant probability derived from career allowance rate.

ACTIVE VOICE LIVENESS DETECTION SYSTEM

Interview Optional

Examiner Intelligence

Statute-Specific Performance

Office Action

Prosecution Timeline

Precedent Cases

Applications granted by this same examiner with similar technology

Strategy Recommendation AI-generated — please review before filing

Prosecution Projections

Ready to respond to this office action?

Sign in with your work email