Last updated: May 29, 2026

Application No. 18/680,594

AUTHENTIFICATION DEVICE AND AUTHENTIFICATION METHOD

Non-Final OA §101§102§103

Filed

May 31, 2024

Priority

Dec 03, 2021 — JP 2021-197225 +1 more

Examiner

HANG, VU B

Art Unit

2654

Tech Center

2600 — Communications

Assignee

Panasonic Intellectual Property Management Co., Ltd.

OA Round

1 (Non-Final)

Interview Optional

— +17.4% interview lift. Examiner has a relatively high allowance rate (75%); +17.4% interview lift. A written response may suffice.

Based on 625 resolved cases, 2023–2026

Examiner Intelligence

HANG, VU B View full profile →

Grants 75% — above average

Career Allowance Rate

467 granted / 625 resolved

+12.7% vs TC avg

Strong +17% interview lift

Without

With

+17.4%

Interview Lift

resolved cases with interview

Typical timeline

3y 1m

Avg Prosecution

8 currently pending

Career history

640

Total Applications

across all art units

Statute-Specific Performance

§101

3.3%

-36.7% vs TC avg

§103

79.9%

+39.9% vs TC avg

§102

5.6%

-34.4% vs TC avg

§112

3.9%

-36.1% vs TC avg

Black line = Tech Center average estimate • Based on career data from 625 resolved cases

Office Action

§101 §102 §103

Notice of Pre-AIA  or AIA  Status
The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA .
Claims 1-9 are pending.

Claim Rejections - 35 USC § 101
35 U.S.C. 101 reads as follows:
Whoever invents or discovers any new and useful process, machine, manufacture, or composition of matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the conditions and requirements of this title.

Claims 1, 8 and 9 are rejected under 35 U.S.C. 101.
Regarding Claim 1, the claimed invention is directed to an abstract idea without significantly more. The claim recites “a device comprising: an acquisition unit configured to acquire and detect a voice signal of an utterance voice of a speaker; an authentication unit configured to authenticate whether the speaker is the person himself/herself based on collation between the voice signal detected by the acquisition unit and a database; a storage unit configured to store, as question example sentences, a plurality of questions for acquiring a voice signal used for authentication of the speaker by the authentication unit; a display interface configured to display the question example sentence for the speaker on a terminal device; and an example sentence selection unit configured to select a question example sentence to be displayed on the terminal device from the plurality of question example sentences stored in the storage unit.”
The limitations of the claim, as drafted, is a process that, under its broadest reasonable interpretation, covers performance of the limitation in the mind but for the recitation of generic computer components. That is, other than reciting a “device”, nothing in the claim element precludes the step from practically being performed in the mind. Each of the limitation in the claim can be performed in the human mind including observation, evaluation and judgement. For example, the limitation to “acquire and detect a voice signal of an utterance voice of a speaker” can be done by a person listening to another person during a conversation. The limitation to “authenticate whether the speaker is the person himself/herself based on collation between the voice signal detected by the acquisition unit and a database” can be completed by listening to the spoken words from an old friend and determine that it is the particular friend from the past (i.e. collation between the detected voice signal and a database). The limitation to “to store, as question example sentences, a plurality of questions for acquiring a voice signal used for authentication of the speaker” can be completed by preparing a list of questions to ask the old friend to verify that he or she is that particular friend from the past. The limitation to “display the question example sentence for the speaker on the terminal device” can be completed by typing the question on a computer device to display or writing the question on paper. The limitation to “select a question example sentence to be displayed on the terminal device from the plurality of question example sentences stored” can be completed by deciding which of the prepared list of questions to ask the friend. Therefore, if the limitations of the claim, under their broadest reasonable interpretation, cover concepts performed in the human mind but for the recitation of generic computer components, then they fall within the “Mental Process” grouping of abstract ideas. Accordingly, the claim recites an abstract idea.
This judicial exception is not integrated into a practical application. In particular, the claim only recites one additional element – using a device to perform the processing steps. The device is recited at a high-level of generality such that it amounts no more than mere instructions to apply the exception using a generic computer component. Accordingly, this additional element does not integrate the abstract idea into a practical application because it does not impose any meaningful limits on practicing the abstract idea. Therefore, the claim is directed to an abstract idea.
The claim does not include additional elements that are sufficient to amount to significantly more than the judicial exception. As discussed above with respect to the integration of the abstract idea into a practical application, the additional element of using a device to perform the processing steps amounts to no more than mere instructions to apply the exception using a generic computer component. Mere instructions to apply an exception using a generic computer component cannot provide an inventive concept. Therefore, claim is not patent eligible.
Regarding Claims 8 and 9, the rationale provided for the rejection of Claim 1 is incorporated herein.

Claim Rejections - 35 USC § 102
The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the basis for the rejections under this section made in this Office action:
A person shall be entitled to a patent unless –

(a)(1) the claimed invention was patented, described in a printed publication, or in public use, on sale, or otherwise available to the public before the effective filing date of the claimed invention.


Claims 1, 8 and 9 are rejected under 35 U.S.C. 102(a)(1) as being anticipated by Takashima et al. (US Pub. 2022/0139402 A1).
Regarding Claims 1 and 9, Takashima teaches an authentication device (see Fig.1 (10) and paragraph [0029]), comprising:
an acquisition unit configured to acquire and detect a voice signal of an utterance voice of a speaker (see Fig.1 (14) and paragraph [0029]);
an authentication unit configured to authenticate whether the speaker is the person himself/herself based on collation between the voice signal detected by the acquisition unit and a database (see Fig.1 (12,15), Fig.2 (1B) and paragraph [0044], comparing voice features from the speaker to the features registered in the authentication database);
a storage unit configured to store, as question example sentences, a plurality of questions for acquiring a voice signal used for authentication of the speaker by the authentication unit (see Fig.1 (12), Fig.2 (2B) and paragraph [0035]);
a display interface configured to display the question example sentence for the speaker on a terminal device (see Fig.1 (13) and paragraph [0038], the output unit outputs instructions to display the questions on a display device);
and an example sentence selection unit configured to select a question example sentence to be displayed on the terminal device from the plurality of question example sentences stored in the storage unit (see Fig.1 (11,12,13), Fig.2 (2B) and paragraphs [0052-0055], the output unit selects and submits a question based on the acquires user input data).
Regarding Claim 8, Takashima further teaches wherein the question example sentence is a question sentence that makes the speaker answer at least one of a telephone number, a date of birth, and personal information of the speaker (see Fig.3 and paragraphs [0036-0037]).

Claim Rejections - 35 USC § 103
The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action:
A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made.

Claims 2-5 are rejected under 35 U.S.C. 103 as being unpatentable over Takashima et al. (US Pub. 2022/0139402 A1) in view of Ortiz et al. (US Pub. 2021/0173916 A1).
Regarding Claim 2, Takashima teaches the device of Claim 1 but fails to teach a phoneme analysis unit configured to extract phonemes contained in the voice signal of the speaker detected by the acquisition unit, wherein the example sentence selection unit selects a question example sentence that prompts an utterance including a phoneme not included in the voice signal, based on the phonemes extracted by the phoneme analysis unit.
Ortiz, however, teaches a processing unit configured to extract audio features for user authentication (see Fig.1 (101,109,113) paragraph [0161] and paragraph [0165], the stored machine learning model extracts the audio features from the spoken pass phrases), compare the phonemes of a user speaking a pass phrase to the stored reference phonemes (see Fig.1 (101,115) and paragraphs [0038-0039]) and present a specific question to a user when there is an overlap in the phonemes or phoneme transitions during phoneme comparison process (see Fig.1 (101,115) and paragraphs [0115-0116]).
It would have been obvious for one skilled in the art, before the effective filing date of the application, to include to Takashima’s authentication device a phoneme analysis unit configured to extract phonemes contained in the voice signal of the speaker detected by the acquisition unit, wherein the example sentence selection unit selects a question example sentence that prompts an utterance including a phoneme not included in the voice signal. The motivation would be to perform audio feature extraction from the user voice signal for comparison with the features stored in the user authentication database, and to present a specific question to the user in which the answer to the question includes phonemes that are not present in the voice signal.
Regarding Claim 3, Takashima further teaches wherein a display interface displays, on the terminal device, the question example sentence selected by the example sentence selection unit (see Fig.1 (13,23) and paragraph [0038]).
Regarding Claim 4, Takashima teaches wherein the display interface displays, on the terminal device, the question example sentence selected by the example sentence selection unit immediately after the start of authentication (see Fig.1 (11,21), paragraph [0033] and paragraph [0038], the user authentication starts after the user inputs identification information), but fails to teach wherein the authentication unit calculates a similarity between a voice signal of an utterance voice that the speaker answers to the question example sentence and a voice signal registered in the database, and authenticates the speaker when the similarity is equal to or larger than a predetermined threshold value.
Ortiz, however, teaches determining the confidence level of a user speaking a pass phrase by comparing the phonemes or speech characteristics with the stored reference phonemes and speech characteristics (see Fig.1 (101,115), paragraph [0041] and paragraph [0045]), and authenticating user if the determined confidence level is above a predetermined threshold value (see Fig.1 (101,115), paragraph [0041] and paragraph [0045]).
It would have been obvious for one skilled in the art, before the effective filing date of the application, to configure Takashima’s authentication device to calculate a similarity between a voice signal of an utterance voice that the speaker answers to the question example sentence and a voice signal registered in the database, and to authenticate the speaker when the similarity is equal to or larger than a predetermined threshold value. The motivation would be to set a confidence level criteria for authenticating a user.
Regarding Claim 5, Takashima teaches wherein the display interface displays, on the terminal device, the question example sentence selected by the example sentence selection unit when a determination result by the authentication unit is acquired (see Fig.1 (11,12,13), Fig.2B, paragraph [0038] and paragraphs [0052-0055]), but fails to teach wherein the authentication unit calculates a similarity between the voice signal acquired after the start of authentication, and determines whether display of the question example sentence is necessary when the similarity is equal to or less than a predetermined threshold value.
Ortiz, however, teaches determining the confidence level of a user speaking a pass phrase by comparing the phonemes or speech characteristics with the stored reference phonemes and speech characteristics (see Fig.1 (101,115), paragraph [0041] and paragraph [0045]), and authenticating user if the determined confidence level is above a predetermined threshold value (see Fig.1 (101,115), paragraph [0041] and paragraph [0045]).
It would have been obvious for one skilled in the art, before the effective filing date of the application, to configure Takashima’s authentication device to unit calculate a similarity between the voice signal acquired after the start of authentication, and to determine whether to display the question example sentence when the similarity is equal to or less than a predetermined threshold value. The motivation would be to display a second example question to the user when the determined confidence level of the user authentication process is equal to or less than a predetermined threshold value.
Claim 7 is rejected under 35 U.S.C. 103 as being unpatentable over Takashima et al. (US Pub. 2022/0139402 A1) in view of Park et al. (US Patent 10,566,011 B2).
Regarding Claim 7, Takashima teaches the authentication device of Claim 1 but fail to teach wherein the authentication device is configured to start authentication on the voice signal when a start button displayed on a terminal device is pressed.
Park, however, teaches a manually starting a speaker authentication process from a terminal device by pressing a start button (see Col.1, Line 31-34).
It would have been obvious for one skilled in the art, before the effective filing date of the application, to configure Takashima’s authentication device to start authentication on the voice signal when a start button displayed on a terminal device is pressed. The motivation would be to manually begin the speaker authentication process by pressing a button on a terminal device.
Claim 6 is rejected under 35 U.S.C. 103 as being unpatentable over Takashima et al. (US Pub. 2022/0139402 A1) in view of Ortiz et al. (US Pub. 2021/0173916 A1), and in further view of Huang et al. (US Pub. 2011/0184723 A1).
Regarding Claim 6, Takashima teaches an example sentence selection unit configured to select a question example sentence to be displayed on the terminal device from the plurality of question example sentences stored in the storage unit (see Fig.1 (11,12,13), Fig.2 (2B) and paragraphs [0052-0055], the output unit selects and submits a question based on the acquires user input data), and Ortiz teaches extracting audio features for user authentication (see Fig.1 (101,109,113) paragraph [0161] and paragraph [0165]) and comparing the phonemes of a user speaking a pass phrase to the stored reference phonemes (see Fig.1 (101,115) and paragraphs [0038-0039]), but Takashima and Ortiz fail to teach wherein the phoneme analysis unit calculates the number of extracted phonemes, and the example sentence selection unit selects a question example sentence including phonemes whose number is less than a predetermined threshold value.
Huang, however, teaches determining the number of phonemes in a query phoneme sequence and providing a length constraint for the query phoneme sequence to have less than a predetermined number of phonemes.
It would have been obvious for one skilled in the art, before the effective filing date of the application, to configure the authentication device to calculate the number of extracted phonemes, and select a question example sentence including phonemes whose number is less than a predetermined threshold value. The motivation would be to provide a constraint for the length or size of the example question to be displayed to the user.

Conclusion
Any inquiry concerning this communication or earlier communications from the examiner should be directed to VU B HANG whose telephone number is (571)272-0582.
Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice.
If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, Hai Phan, can be reached at (571)272-6338. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300.
Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000.





/VU B HANG/Primary Examiner, Art Unit 2654

Read full office action

Prosecution Timeline

May 31, 2024

Application Filed

Dec 22, 2025

Non-Final Rejection mailed — §101, §102, §103 (current)

Precedent Cases

Applications granted by this same examiner with similar technology

18/257,403

Patent 12640136

METHOD FOR TRAINING MODEL, SPEECH RECOGNITION METHOD, APPARATUS, MEDIUM, AND DEVICE

2y 11m to grant Granted May 26, 2026

18/689,668

Patent 12626687

SPEECH RECOGNITION METHOD, APPARATUS AND DEVICE, AND STORAGE MEDIUM

2y 2m to grant Granted May 12, 2026

18/233,728

Patent 12614548

PRE-EMPTIVELY INITIALIZING AN AUTOMATED ASSISTANT ROUTINE AND/OR DISMISSING A SCHEDULED ALARM

2y 8m to grant Granted Apr 28, 2026

18/441,789

Patent 12609110

TECHNIQUES FOR UTTERANCE GROUPING AND FOR IMPROVED TRAINING OF MACHINE LEARNING MODELS USING GROUPED UTTERANCE DATA

2y 2m to grant Granted Apr 21, 2026

18/624,615

Patent 12603082

METHOD FOR TRAINING VOICE CONVERSION MODEL, ELECTRONIC DEVICE, AND STORAGE MEDIUM

2y 0m to grant Granted Apr 14, 2026

Study what changed to get past this examiner. Based on 5 most recent grants.

Strategy Recommendation AI-generated — please review before filing

Get a prosecution strategy drawn from examiner precedents, rejection analysis, and claim mapping.

Typically takes 5-10 seconds — AI-generated, attorney review required before filing

Prosecution Projections

1-2

Expected OA Rounds

75%

Grant Probability

92%

With Interview (+17.4%)

3y 1m (~1y 1m remaining)

Median Time to Grant

Low

PTA Risk

Based on 625 resolved cases by this examiner. Grant probability derived from career allowance rate.