Last updated: April 19, 2026

Application No. 18/240,581

METHODS AND SYSTEMS FOR DISPLAYING CAPTIONS FOR MEDIA CONTENT

Final Rejection §103

Filed

Aug 31, 2023

Examiner

ALAM, MUSHFIKH I

Art Unit

2426

Tech Center

2400 — Computer Networks

Assignee

Adeia Guides Inc.

OA Round

4 (Final)

This examiner grants 58% of cases after interview

— +38.5% interview lift. A telephonic interview to clarify the technical implementation could significantly improve the outcome.

Based on 509 resolved cases, 2023–2026

Examiner Intelligence

ALAM, MUSHFIKH I View full profile →

Grants 58% of resolved cases

Career Allow Rate

295 granted / 509 resolved

At TC average

Strong +38% interview lift

Without

With

+38.5%

Interview Lift

resolved cases with interview

Typical timeline

3y 9m

Avg Prosecution

32 currently pending

Career history

541

Total Applications

across all art units

Statute-Specific Performance

§101

3.0%

-37.0% vs TC avg

§103

68.4%

+28.4% vs TC avg

§102

13.1%

-26.9% vs TC avg

§112

4.5%

-35.5% vs TC avg

Black line = Tech Center average estimate • Based on career data from 509 resolved cases

Office Action

§103

DETAILED ACTION
Claims 1, 3-9, 11, 13-19, 51-52 are pending.

Notice of Pre-AIA  or AIA  Status
The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA .

Claim Rejections - 35 USC § 103
The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action:
A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made.

Claim(s) 1-3, 5, 7, 8, 9, 11, 13, 15, 17-19, 51-52 is/are rejected under 35 U.S.C. 103 as being unpatentable over Bhattacharjee et al. (US 2018/0302687) in view of Vaughn et al. (US 2020/0135225), and further in view of Mahyar et al. (US 11216684), and further in view of Kahn (US 2018/0160190).

Claim 1, Bhattacharjee teaches a method comprising: 
analyzing, using control circuitry, media content to determine one or more parameters (i.e. scenes where captions can be displayed) associated with the media content, and wherein the one or more parameters indicate at least one of the location (i.e. ‘scenes’ is the location within the video file) , appearance, or language of the first captions (p. 0020); 
generating, using control circuitry, metadata (i.e. tagging content) storing the one or more parameters (i.e. stored on server) (p. 0020); 
receiving, using control circuitry, a request (i.e. request for close captioning) to display the media content captioning enabled (p. 0025-0032); and 
generating, using control circuitry, for display on the media content second captions (i.e. group based aggregated captions) based at least in part on the metadata (i.e. tagged scene location) and the user preferences (i.e. provides personalized close captioning) (p. 0020, 0030-0032).
Bhattacharjee is silent regarding the specific features of:
“wherein the media content includes first captions”, and
“wherein the first captions are burned into pixels of one or more frames of the media content”;
determining, using control circuitry, based at least in part on the metadata associated with the media content, whether the first captions meet one or more user preferences;
based, at least in part on determining the first captions do not meet the one or more user preferences;
modifying, using control circuitry, the one or more frames of the media content, wherein the modification comprises changing a color of the pixels of the one or more frames;
“wherein the generating the second captions comprises inpainting an area covered by the first captions.”
Vaugn teaches the specific features of:
“wherein the media content includes first captions” (i.e. media content and associated subtitle data) (p. 0003), and
determining, using control circuitry, based at least in part on the metadata (i.e. raw subtitle data) associated with the media content, whether the first captions meet one or more user preferences (i.e. captions in desired language) (p. 0070-0072);
based, at least in part on determining the first captions do not meet the one or more user preferences (i.e. captions not in desired language calls for augmented subtitle data) (p. 0070-0072).
Therefore, it would have been obvious to one of ordinary skill in the art before the effective filing date of the present invention to have provided augmented subtitle data as taught by Vaugn to the system of Bhattacharjee to provide a group preferred language within the subtitles (p. 0072).
Mahyar teaches the specific features of:
“wherein the first captions are burned into pixels of one or more frames of the media content” (i.e. burned-in subtitles) (col. 1-2, lines 43-13);
modifying, using control circuitry, the one or more frames of the media content, wherein the modification comprises changing a color of the pixels of the one or more frames (i.e. color and brightness correction) (col. 1-2, lines 43-13).
Therefore, it would have been obvious to one of ordinary skill in the art before the effective filing date of the present invention to have provided subtitle correction as taught by Mahyar to the system of Bhattacharjee to remove burned in subtitles (col. 1-2, lines 43-13).
Kahn teaches the method of claim 1, wherein generating the second captions comprises inpainting an area covered by the first caption (i.e. the ability to move or resize or change opacity) (p. 0064).
Therefore, it would have been obvious to one of ordinary skill in the art before the effective filing date of the present invention to have provided modified caption data packaging as taught by Kahn to the system of Bhattacharjee to efficiently provide caption data through communication channels (p. 0050).

Claim 3, Bhattacharjee teaches the method of claim 1, the method comprising: 
receiving a request to display the second captions in a language (i.e. common language for group) (p. 0025-0032); 
accessing the metadata in response to the request (i.e. determining group preferences) (p. 0025-0032); 
determining, based on the metadata, whether the language of the requested second captions matches a language of the first captions (i.e. matching occurs during the process selecting captioning data for a group or single profile vs another) (p. 0025-0033); and 
in response to the language of the requested second captions matching the language of the first captions, disregarding the request to display the second captions (i.e. matching occurs during the process selecting captioning data for a group or single profile vs another, in this case the language matches the group profile no need for adjustment) (p. 0025-0033). 

Claim 5 and 7, Bhattacharjee teaches the method of claim 1, the method comprising: 
generating modified media (i.e. adjusting caption data) content by removing the first captions (i.e. changing content length) (p. 0032); 
determining a user preference (p. 0010); 
comparing the user preference with the metadata (i.e. user profile compared to caption data stored in database) (p. 0014-0015).
Bhattacharjee is silent regarding the method of claim 1, the method comprising: 
generating a stream for transmitting the media content, the stream having a version of the modified media content and a version of unmodified media content encoded therein; 
decoding, for display, the unmodified media content when the user preference matches a parameter stored in the metadata; or 
decoding, for display, the modified media content when the user preference does not match a parameter stored in the metadata.
Kahn teaches the method of claim 1, the method comprising: 
generating a stream for transmitting the media content (i.e. packaging component), the stream having a version of the modified media content and a version of unmodified media content encoded therein (i.e. creating packages of close caption data or modified close caption data) (p. 0045-0046, 0050); 
decoding, for display, the unmodified media content when the user preference matches a parameter stored in the metadata (i.e. no modified close caption data) (p. 0045-0046, 0050); or 
decoding, for display, the modified media content when the user preference does not match a parameter stored in the metadata (i.e. modified close caption data to be included in the package) (p. 0045-0046, 0050).
Therefore, it would have been obvious to one of ordinary skill in the art before the effective filing date of the present invention to have provided modified caption data packaging as taught by Kahn to the system of Bhattacharjee to efficiently provide caption data through communication channels (p. 0050).

Claim 8, Bhattacharjee teaches the method of claim 1, the method comprising: 
determining whether a quality of the first captions is less than a quality value (i.e. size not large enough for visually impaired) (p. 0031) ; and 
receiving a request to display the media content with second captions in response to determining that the quality of the first captions is less than the quality value (i.e. increasing font size) (p. 0031).

Claim 9, Bhattacharjee is silent The method of claim 1, the method comprising at least one of: 
displaying the second captions at a position on the media content to not obscure the first captions; 
displaying the second captions at a position on the media content to obscure the first captions.
Kahn teaches The method of claim 1, the method comprising at least one of: 
displaying the second captions at a position on the media content to not obscure the first captions (i.e. modified captions are displayed in a different location) (fig. 5; p. 0057); 
displaying the second captions at a position on the media content to obscure the first captions (i.e. the ability to move or resize the captions allows the user the ability to obscure the original captions) (p. 0064).
Therefore, it would have been obvious to one of ordinary skill in the art before the effective filing date of the present invention to have provided modified caption data packaging as taught by Kahn to the system of Bhattacharjee to efficiently provide caption data through communication channels (p. 0050).

Claim 11 is analyzed and interpreted as an apparatus of claim 1.
Claim 13 is analyzed and interpreted as an apparatus of claim 3.
Claim 15 is analyzed and interpreted as an apparatus of claim 5.
Claim 17 is analyzed and interpreted as an apparatus of claim 7.
Claim 18 is analyzed and interpreted as an apparatus of claim 8.
Claim 19 is analyzed and interpreted as an apparatus of claim 9.

Claim 51, Bhattacharjee teaches the method of claim 1, wherein the request is a first request, and wherein the one or more user preferences are one or more first user preferences, the method further comprising:
receiving, using control circuitry, a second request to display the media content (i.e. requests automatically made when viewing content, watching second video content is a second automatic request) (p. 0025).
Bhattacharjee is silent regarding the specific features of:
determining, using control circuitry, based at least in part on the metadata associated with the media content, whether the first captions meet one or more second user preferences; and
based, at least in part on determining the first captions do meet the one or more user preferences, generating, using control circuitry, the media content with the first captions.
Vaughn teaches the specific features of:
determining, using control circuitry, based at least in part on the metadata associated with the media content (i.e. media content and associated subtitle data), whether the first captions meet one or more second user preferences (i.e. personal user preferences or group preferences) (p. 007-0072); and
based, at least in part on determining the first captions do meet the one or more user preferences (i.e. current language is preferred), generating, using control circuitry, the media content with the first captions (i.e. raw subtitle data) (p. 0070-0072).
Therefore, it would have been obvious to one of ordinary skill in the art before the effective filing date of the present invention to have provided augmented subtitle data as taught by Vaugn to the system of Bhattacharjee to provide a group preferred language within the subtitles (p. 0072).

Claim 52 is analyzed and interpreted as an apparatus of claim 51.

Claim(s) 4, 6, 14, 16 is/are rejected under 35 U.S.C. 103 as being unpatentable over Bhattacharjee et al. (US 2018/0302687) in view of Vaughn et al. (US 2020/0135225), and further in view of Mahyar et al. (US 11216684), and further in view of Kahn (US 2018/0160190), and further in view of Chen et al. (US 2019/0306563).

Claim 4, Bhattacharjee teaches the method of claim 1, wherein analyzing the media content comprises at least one of: 
analyzing the media content to determine one or more portions of the media content having the first captions (i.e. database holds captions for content) (p. 0014-0015); 
analyzing the media content to determine a visual parameter of the first captions (e.g. font size) (p. 0023-0024); 
accessing metadata of the media content (i.e. video captioning program can access the database 112) (p. 0014-0015).
Bhattacharjee is silent regarding the method of claim 1, wherein analyzing the media content comprises at least one of:
analyzing the media content to determine an audio parameter of the media content.
Chen teaches the method of claim 1, wherein analyzing the media content comprises at least one of:
analyzing the media content to determine an audio parameter of the media content (i.e. detect low volume) (p. 0030).
Therefore, it would have been obvious to one of ordinary skill in the art before the effective filing date of the present invention to have provided audio detection as taught by Chen to the system of Bhattacharjee to provide automatically enabling of close captioning (p. 0030).

Claim 6, Bhattacharjee is silent regarding the method of claim 5, the method comprising: 
determining a requested volume level of the media content; and 
displaying, by default, the unmodified version of the media content when the requested volume level is below a predetermined volume level.
Chen teaches the method of claim 5, the method comprising: 
determining a requested volume level of the media content (i.e. detecting volume) (p. 0030); and 
displaying, by default, the unmodified version of the media content when the requested volume level is below a predetermined volume level (i.e. stream without captions when volume not low enough) (p. 0030).
Therefore, it would have been obvious to one of ordinary skill in the art before the effective filing date of the present invention to have provided audio detection as taught by Chen to the system of Bhattacharjee to provide automatically enabling of close captioning (p. 0030).

Claim 14 is analyzed and interpreted as an apparatus of claim 4.
Claim 16 is analyzed and interpreted as an apparatus of claim 6.

Claim(s) 53-54 is/are rejected under 35 U.S.C. 103 as being unpatentable over Bhattacharjee et al. (US 2018/0302687) in view of Vaughn et al. (US 2020/0135225), and further in view of Mahyar et al. (US 11216684), and further in view of Kahn (US 2018/0160190), and further in view of Karia et al. (US 2024/0404525).

Claim 53, Bhattacharjee is silent regarding the method of claim 1, wherein the inpainting the area covered by the first captions further comprises implementing at least one inpainting algorithm comprising at least one of a texture synthesis based image inpainting algorithm or an isophote driven inpainting algorithm.
Karia teaches the method of claim 1, wherein the inpainting the area covered by the first captions (i.e. inpainting an overlay) further comprises implementing at least one inpainting algorithm comprising at least one of a texture synthesis based image inpainting algorithm or an isophote driven inpainting algorithm (i.e. any inpainting technique) (p. 0098).
Therefore, it would have been obvious to one of ordinary skill in the art before the effective filing date of the present invention to have provided inpainting techniques as taught by Karia to the system of Bhattacharjee to provide an overlay for new captions (p. 0098).

Claim 54 is analyzed and interpreted as an apparatus of claim 53.

Response to Arguments
Applicant's arguments filed 12/22/2025 have been fully considered but they are not persuasive.

Claim 1, Applicant argues that Kahn [0064] merely discusses a user's "ability to move, remove, resize, or change the opacity of a closed caption box." Moving, resizing, or changing the opacity of a closed captions box is not the same as "inpainting an area covered by the first captions," as recited by Applicant's amended claim 1. Inpainting is a technique that replaces certain parts of an image using an algorithm, for example, "a texture synthesis based image inpainting algorithm or an isophote driven inpainting algorithm" as recited by new claims 53 and 54. Kahn makes no reference to inpainting or inpainting algorithms.
The Examiner respectfully disagrees. Reading the claims in the broadest sense, the limitation recites “inpainting an area covered by first captions”.  One of ordinary skill in the art would recognize “inpainting” as a modification of captions.  Kahn discloses that captions can be modified and then returned to the original captions.  This is interpreted as “inpainting” (modifying) the caption which the original caption would have been displayed.  Therefore, the modified or inpainted captions are in place, or covering the area of the first captions.

Conclusion
Claims 1, 3-9, 11, 13-19, 51-52  are rejected.

Applicant's amendment necessitated the new ground(s) of rejection presented in this Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP § 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a).
A shortened statutory period for reply to this final action is set to expire THREE MONTHS from the mailing date of this action. In the event a first reply is filed within TWO MONTHS of the mailing date of this final action and the advisory action is not mailed until after the end of the THREE-MONTH shortened statutory period, then the shortened statutory period will expire on the date the advisory action is mailed, and any nonprovisional extension fee (37 CFR 1.17(a)) pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of the advisory action. In no event, however, will the statutory period for reply expire later than SIX MONTHS from the mailing date of this final action.


Inquiries
Any inquiry concerning this communication or earlier communications from the examiner should be directed to MUSHFIKH I ALAM whose telephone number is (571)270-1710. The examiner can normally be reached 1:00PM-9:00PM.
Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice.
If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, Nasser Goodarzi can be reached on 571-272-4195. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300.
Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000.

MUSHFIKH I. ALAM
Primary Examiner
Art Unit 2426



/MUSHFIKH I ALAM/Primary Examiner, Art Unit 2426                                                                                                                                                                                                        2/3/2026

Read full office action

Prosecution Timeline

Aug 31, 2023

Application Filed

Jan 15, 2025

Non-Final Rejection — §103

Apr 21, 2025

Response Filed

May 08, 2025

Final Rejection — §103

Aug 14, 2025

Request for Continued Examination

Aug 19, 2025

Response after Non-Final Action

Aug 20, 2025

Non-Final Rejection — §103

Dec 22, 2025

Response Filed

Feb 04, 2026

Final Rejection — §103 (current)

Precedent Cases

Applications granted by this same examiner with similar technology

18/175,465

Patent 12587707

SESSION TYPE CLASSIFICATION FOR MODELING

2y 5m to grant Granted Mar 24, 2026

17/973,287

Patent 12581157

SYSTEMS AND METHODS FOR MEDIA CONTENT HAND-OFF BASED ON TYPE OF BUFFERED DATA

2y 5m to grant Granted Mar 17, 2026

18/120,333

Patent 12578752

DISPLAY DEVICE AND METHOD FOR OPERATING THE SAME

2y 5m to grant Granted Mar 17, 2026

18/607,023

Patent 12563241

INTERACTIVE METHOD AND APPARATUS, ELECTRONIC DEVICE, AND STORAGE MEDIUM

2y 5m to grant Granted Feb 24, 2026

17/898,925

Patent 12556751

SYSTEMS AND METHODS FOR IMPROVING LIVE STREAMING

2y 5m to grant Granted Feb 17, 2026

Study what changed to get past this examiner. Based on 5 most recent grants.

AI Strategy Recommendation

Get an AI-powered prosecution strategy using examiner precedents, rejection analysis, and claim mapping.

Prosecution Projections

5-6

Expected OA Rounds

58%

Grant Probability

96%

With Interview (+38.5%)

3y 9m

Median Time to Grant

High

PTA Risk

Based on 509 resolved cases by this examiner. Grant probability derived from career allow rate.