Last updated: April 18, 2026

Application No. 18/294,010

MASKING APPARATUS, MASKING METHOD, AND PROGRAM

Non-Final OA §103

Filed

Jan 31, 2024

Examiner

YU, NORMAN

Art Unit

2693

Tech Center

2600 — Communications

Assignee

NTT, Inc.

OA Round

1 (Non-Final)

Interview Optional

— +13.5% interview lift. This examiner has a relatively high allow rate; a written response may suffice.

Based on 598 resolved cases, 2023–2026

Examiner Intelligence

YU, NORMAN View full profile →

Grants 88% — above average

Career Allow Rate

525 granted / 598 resolved

+25.8% vs TC avg

Moderate +14% lift

Without

With

+13.5%

Interview Lift

resolved cases with interview

Fast prosecutor

2y 1m

Avg Prosecution

35 currently pending

Career history

633

Total Applications

across all art units

Statute-Specific Performance

§101

2.2%

-37.8% vs TC avg

§103

51.8%

+11.8% vs TC avg

§102

17.2%

-22.8% vs TC avg

§112

16.8%

-23.2% vs TC avg

Black line = Tech Center average estimate • Based on career data from 598 resolved cases

Office Action

§103

Notice of Pre-AIA  or AIA  Status
The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA .

Claim Rejections - 35 USC § 103
The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action:
A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made.

Claim(s) 1, 3, and 6-9 is/are rejected under 35 U.S.C. 103 as being unpatentable over Satoyoshi (US 2013/0170655) in view of Benway (US 2018/0151168).

Regarding claim 1, Satoyoshi teaches A masking device comprising: a spoken voice volume evaluation circuitry configured to generate an evaluation value for a volume of a spoken voice (hereinafter referred to as a spoken voice volume evaluation value) from a spoken voice signal by using, as the spoken voice signal, a sound collection signal output by a microphone installed for collecting the spoken voice which is a voice of a speaking person (Satoyoshi ¶0049, “a masking sound is in a mode where the volume is changed in accordance with the level of the picked up speaker voice. In the case where the level of the picked up speaker voice is low… In the case where the level of the picked up speaker voice is high”); a masking sound signal generation circuitry configured to generate a signal for emitting a masking sound from a speaker (hereinafter referred to as a masking sound signal) (Satoyoshi  figure 2, masking sound producing section 73) corresponding to the spoken voice volume evaluation value (Satoyoshi ¶0049, “Therefore, also the level of the masking sound can be lowered…masking sound is set to high”), the masking sound preventing the spoken voice from being heard by surrounding persons other than the speaking person (Satoyoshi ¶0033, “This causes the third persons H3 to hear the voice of the speaker H1 and the masking sound from the same position, and the cocktail party effect is adequately suppressed”); however does not explicitly teach a masking video signal generation circuitry configured to generate a signal for presenting a video corresponding to the masking sound (hereinafter referred to as a masking video signal) from a video presentation device.

Benway teaches a masking video signal generation circuitry configured to generate a signal for presenting a video corresponding to the masking sound (hereinafter referred to as a masking video signal) from a video presentation device (Benway ¶0028, “masking application 6 includes or interfaces with a digital audio player and a digital video player at computing device 4. Noise masking application 6 outputs (i.e., plays) the selected noise masking sound audio file at loudspeaker(s) 14 and outputs (i.e., plays) the selected video file at video display 16. Although only a single video display 16 is shown, multiple displays may be utilized to output the selected video file”).

Therefore, it would have been obvious to a person of ordinary skill in the art before the effective filing date of the claimed invention to use the known technique of Benway to improve the known masking device of Satoyoshi to achieve the predictable result of increasing the user’s psychological comfort with an increase noise masking level (Benway ¶0024).

Regarding claim 3, Satoyoshi teaches A masking device comprising: a microphone array processing circuitry configured to generate an integrated sound collection signal from N (where N is an integer of 2 or more) sound collection signals output by a microphone array including N microphones installed (Satoyoshi figure 2, microphone array 1) for collecting a spoken voice that is a voice of a speaking person and to set (See pertinent art Laroche, ¶0093 it is obvious and well known in the art that audio devices use a VAD to determine the signal picked up by a microphone is a voice signal) the integrated sound collection signal as a spoken voice signal (Satoyoshi figure 1, ¶0033 “picks up the voice of the speaker H1”); a spoken voice volume evaluation circuitry configured to generate an evaluation value for a volume of the spoken voice (hereinafter referred to as a spoken voice volume evaluation value) from the spoken voice signal (Satoyoshi ¶0049, “a masking sound is in a mode where the volume is changed in accordance with the level of the picked up speaker voice. In the case where the level of the picked up speaker voice is low… In the case where the level of the picked up speaker voice is high”); a masking sound signal generation circuitry configured to generate a signal for emitting a masking sound (hereinafter referred to as a masking sound signal) corresponding to the spoken voice volume evaluation value (Satoyoshi ¶0049, “Therefore, also the level of the masking sound can be lowered…masking sound is set to high”) from a speaker array including M (where M is an integer of 2 or more) speakers (Satoyoshi  figure 2, masking sound producing section 73 and speaker array 2), the masking sound preventing the spoken voice from being heard by surrounding persons other than the speaking person (Satoyoshi ¶0033, “This causes the third persons H3 to hear the voice of the speaker H1 and the masking sound from the same position, and the cocktail party effect is adequately suppressed”); and a speaker array processing circuitry configured to generate M individual masking sound signals for emitting sound from the speakers included in the speaker array from the masking sound signal (Satoyoshi figure 2 and ¶0033), however does not explicitly teach a masking video signal generation circuitry configured to generate a signal for presenting a video corresponding to the masking sound (hereinafter referred to as a masking video signal) from a video presentation device.

Benway teaches a masking video signal generation circuitry configured to generate a signal for presenting a video corresponding to the masking sound (hereinafter referred to as a masking video signal) from a video presentation device (Benway ¶0028, “masking application 6 includes or interfaces with a digital audio player and a digital video player at computing device 4. Noise masking application 6 outputs (i.e., plays) the selected noise masking sound audio file at loudspeaker(s) 14 and outputs (i.e., plays) the selected video file at video display 16. Although only a single video display 16 is shown, multiple displays may be utilized to output the selected video file”).

Therefore, it would have been obvious to a person of ordinary skill in the art before the effective filing date of the claimed invention to use the known technique of Benway to improve the known masking device of Satoyoshi to achieve the predictable result of increasing the user’s psychological comfort with an increase noise masking level (Benway ¶0024).

Regarding claim 6, Satoyoshi teaches A masking method comprising: a spoken voice volume evaluation step of generating, by a masking device, an evaluation value for a volume of a spoken voice (hereinafter referred to as a spoken voice volume evaluation value) from a spoken voice signal by using, as the spoken voice signal (Satoyoshi ¶0049, “a masking sound is in a mode where the volume is changed in accordance with the level of the picked up speaker voice. In the case where the level of the picked up speaker voice is low… In the case where the level of the picked up speaker voice is high”), a sound collection signal output by a microphone installed for collecting the spoken voice which is a voice of a speaking person (Satoyoshi figure 2, microphone array 1); a masking sound signal generation step of generating, by the masking device, a signal for emitting a masking sound from a speaker (hereinafter referred to as a masking sound signal) corresponding to the spoken voice volume evaluation value (Satoyoshi ¶0049, “Therefore, also the level of the masking sound can be lowered…masking sound is set to high”), the masking sound preventing the spoken voice from being heard by surrounding persons other than the speaking person (Satoyoshi ¶0033, “This causes the third persons H3 to hear the voice of the speaker H1 and the masking sound from the same position, and the cocktail party effect is adequately suppressed”); however does not explicitly teach a masking video signal generation step of generating, by the masking device, a signal for presenting a video corresponding to the masking sound (hereinafter referred to as a masking video signal) from a video presentation device.

Benway teaches a masking video signal generation step of generating, by the masking device, a signal for presenting a video corresponding to the masking sound (hereinafter referred to as a masking video signal) from a video presentation device (Benway ¶0028, “masking application 6 includes or interfaces with a digital audio player and a digital video player at computing device 4. Noise masking application 6 outputs (i.e., plays) the selected noise masking sound audio file at loudspeaker(s) 14 and outputs (i.e., plays) the selected video file at video display 16. Although only a single video display 16 is shown, multiple displays may be utilized to output the selected video file”).

Therefore, it would have been obvious to a person of ordinary skill in the art before the effective filing date of the claimed invention to use the known technique of Benway to improve the known masking device of Satoyoshi to achieve the predictable result of increasing the user’s psychological comfort with an increase noise masking level (Benway ¶0024).

Regarding claim 7, Satoyoshi teaches A masking method comprising: a microphone array processing step of generating, by a masking device, an integrated sound collection signal from N (where N is an integer of 2 or more) sound collection signals output by a microphone array including N microphones installed (Satoyoshi figure 2, microphone array 1)  for collecting a spoken voice that is a voice of a speaking person and to set (See pertinent art Laroche, ¶0093 it is obvious and well known in the art that audio devices use a VAD to determine the signal picked up by a microphone is a voice signal) the integrated sound collection signal as a spoken voice signal (Satoyoshi figure 1, ¶0033 “picks up the voice of the speaker H1”); a spoken voice volume evaluation step of generating, by the masking device, an evaluation value for a volume of the spoken voice (hereinafter referred to as a spoken voice volume evaluation value) from the spoken voice signal (Satoyoshi ¶0049, “a masking sound is in a mode where the volume is changed in accordance with the level of the picked up speaker voice. In the case where the level of the picked up speaker voice is low… In the case where the level of the picked up speaker voice is high”); a masking sound signal generation step of generating, by the masking device, a signal for emitting a masking sound (hereinafter referred to as a masking sound signal) corresponding to the spoken voice volume evaluation value (Satoyoshi ¶0049, “Therefore, also the level of the masking sound can be lowered…masking sound is set to high”)  from a speaker array including M (where M is an integer of 2 or more) speakers (Satoyoshi  figure 2, masking sound producing section 73 and speaker array 2), the masking sound preventing the spoken voice from being heard by surrounding persons other than the speaking person (Satoyoshi ¶0033, “This causes the third persons H3 to hear the voice of the speaker H1 and the masking sound from the same position, and the cocktail party effect is adequately suppressed”); and a speaker array processing step of generating, by the masking device, M individual masking sound signals for emitting sound from the speakers included in the speaker array from the masking sound signal (Satoyoshi figure 2 and ¶0033), however does not explicitly teach a masking video signal generation step of generating, by the masking device, a signal for presenting a video corresponding to the masking sound (hereinafter referred to as a masking video signal) from a video presentation device.

Benway teaches a masking video signal generation step of generating, by the masking device, a signal for presenting a video corresponding to the masking sound (hereinafter referred to as a masking video signal) from a video presentation device (Benway ¶0028, “masking application 6 includes or interfaces with a digital audio player and a digital video player at computing device 4. Noise masking application 6 outputs (i.e., plays) the selected noise masking sound audio file at loudspeaker(s) 14 and outputs (i.e., plays) the selected video file at video display 16. Although only a single video display 16 is shown, multiple displays may be utilized to output the selected video file”).

Therefore, it would have been obvious to a person of ordinary skill in the art before the effective filing date of the claimed invention to use the known technique of Benway to improve the known masking device of Satoyoshi to achieve the predictable result of increasing the user’s psychological comfort with an increase noise masking level (Benway ¶0024).

Regarding claim 8, Satoyoshi in view of Benway teaches a program causing a computer to function as the masking device according to claim 1 (Satoyoshi figure 2).

Regarding claim 9, Satoyoshi in view of Benway teaches a program causing a computer to function as the masking device according to claim 3 (Satoyoshi figure 2).

Claim(s) 2 and 4 is/are rejected under 35 U.S.C. 103 as being unpatentable over Satoyoshi (US 2013/0170655) in view of Benway (US 2018/0151168) in further view of Kobayashi (US 2013/0163772).

Regarding claims 2 and 4, Satoyoshi in view of Benway does not explicitly teach a masking sound erasing circuitry configured to generate a signal in which a component caused by the masking sound included in the sound collection signal is erased by using the sound collection signal and the masking sound signal, and to use the signal as the spoken voice signal.

Kobayashi teaches a masking sound erasing circuitry configured to generate a signal in which a component caused by the masking sound included in the sound collection signal is erased by using the sound collection signal and the masking sound signal, and to use the signal as the spoken voice signal (Kobayashi figure 2, and ¶0029 echo cancelling section 12).

Therefore, it would have been obvious to a person of ordinary skill in the art before the effective filing date of the claimed invention to use the known technique of Kobayashi to improve the known masking device of Satoyoshi in view of Benway to achieve the predictable result of a masking sounds with minimal echo (Kobayashi ¶0015). 

Claim(s) 5 is/are rejected under 35 U.S.C. 103 as being unpatentable over Satoyoshi (US 2013/0170655) in view of Kim (US 2016/0080537).

Regarding claim 5, Satoyoshi teaches A masking device comprising: a microphone array processing circuitry configured to generate an integrated sound collection signal from N (where N is an integer of 2 or more) sound collection signals output by a microphone array including N microphones installed (Satoyoshi figure 2, microphone array 1)  for collecting a spoken voice that is a voice of a speaking person and to set (See pertinent art Laroche, ¶0093 it is obvious and well known in the art that audio devices use a VAD to determine the signal picked up by a microphone is a voice signal) the integrated sound collection signal as a spoken voice signal (Satoyoshi figure 1, ¶0033 “picks up the voice of the speaker H1”); a spoken voice volume evaluation circuitry configured to generate an evaluation value for a volume of the spoken voice (hereinafter referred to as a spoken voice volume evaluation value) from the spoken voice signal (Satoyoshi ¶0049, “a masking sound is in a mode where the volume is changed in accordance with the level of the picked up speaker voice. In the case where the level of the picked up speaker voice is low… In the case where the level of the picked up speaker voice is high”); a masking sound signal generation circuitry configured to generate a signal for emitting a masking sound (hereinafter referred to as a masking sound signal) corresponding to the spoken voice volume evaluation value (Satoyoshi ¶0049, “Therefore, also the level of the masking sound can be lowered…masking sound is set to high”)  from a speaker array including M (where M is an integer of 2 or more) speakers (Satoyoshi  figure 2, masking sound producing section 73 and speaker array 2), the masking sound preventing the spoken voice from being heard by surrounding persons other than the speaking person (Satoyoshi ¶0033, “This causes the third persons H3 to hear the voice of the speaker H1 and the masking sound from the same position, and the cocktail party effect is adequately suppressed”); and a speaker array processing circuitry configured to generates M individual masking sound signals for emitting sound from the speakers included in the speaker array from the masking sound signal (Satoyoshi figure 2 and ¶0033), is a signal such that the higher the spoken voice volume evaluation value indicates, the greater the sound emitted by the signal is (Satoyoshi ¶0049, “In the case where the level of the picked up speaker voice is low, the speaker voice reaches the third persons H3 at a low level, and the content of a conversation is hardly understood. Therefore, also the level of the masking sound can be lowered. In the case where the level of the picked up speaker voice is high, by contrast, the speaker voice reaches the third persons H3 at a high level”), however does not explicitly teach wherein, of the M individual masking sound signals, an individual masking sound signal directed to a direction of the speaking person.

Kim teaches wherein, of the M individual masking sound signals, an individual masking sound signal directed to a direction of the speaking person (Kim ¶0048, “he locations/directions/phases of the speakers in device 300 may be adjusted such that the masking noise is nullified at or around the ear of the user while the masking noise sounds with a greater volume in the surroundings of the ear”).

Therefore, it would have been obvious to a person of ordinary skill in the art before the effective filing date of the claimed invention to use the known technique of Kim to improve the known masking device of Satoyoshi to achieve the predictable result of controlled noise masking at wanted and unwanted areas.

Conclusion
The prior art made of record and not relied upon is considered pertinent to applicant's disclosure. Laroche (US 2021/0104222).

Any inquiry concerning this communication or earlier communications from the examiner should be directed to NORMAN YU whose telephone number is (571)270-7436.  The examiner can normally be reached on Mon - Fri 11am-7pm.
If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, Ahmad Matar can be reached on 571-272-7488.  The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300.
Any response to this action should be mailed to:
                        Commissioner of Patents and Trademarks
                        P.O. Box 1450
                        Alexandria, Va.  22313-1450
        Or faxed to:
                    (571) 273-8300, for formal communications intended for entry and for 
                     informal or draft communications, please label “PROPOSED” or “DRAFT”.
                                Hand-delivered responses should be brought to: 

                         Customer Service Window 
                         Randolph Building 
                         401 Dulany Street 
                         Arlington, VA 22314

Information regarding the status of an application may be obtained from the Patent Application Information Retrieval (PAIR) system.  Status information for published applications may be obtained from either Private PAIR or Public PAIR.  Status information for unpublished applications is available through Private PAIR only.  For more information about the PAIR system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative or access to the automated information system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000.

/NORMAN YU/Primary Examiner, Art Unit 2693

Read full office action

Prosecution Timeline

Jan 31, 2024

Application Filed

Dec 10, 2025

Non-Final Rejection — §103

Apr 01, 2026

Response Filed

Precedent Cases

Applications granted by this same examiner with similar technology

18/205,362

Patent 12604123

APPARATUS AND VEHICULAR APPARATUS INCLUDING THE SAME

2y 5m to grant Granted Apr 14, 2026

18/188,055

Patent 12598409

IN-EAR WEARABLE DEVICE

2y 5m to grant Granted Apr 07, 2026

18/312,253

Patent 12594882

AUTOMOTIVE SOUND AMPLIFICATION

2y 5m to grant Granted Apr 07, 2026

18/327,873

Patent 12593165

ACOUSTIC INPUT-OUTPUT DEVICES

2y 5m to grant Granted Mar 31, 2026

18/343,228

Patent 12581238

BINDING BAND ASSEMBLY FOR HEADSET AND HEADSET

2y 5m to grant Granted Mar 17, 2026

Study what changed to get past this examiner. Based on 5 most recent grants.

AI Strategy Recommendation

Get an AI-powered prosecution strategy using examiner precedents, rejection analysis, and claim mapping.

Prosecution Projections

1-2

Expected OA Rounds

88%

Grant Probability

99%

With Interview (+13.5%)

2y 1m

Median Time to Grant

Low

PTA Risk

Based on 598 resolved cases by this examiner. Grant probability derived from career allow rate.