Last updated: April 19, 2026

Application No. 19/325,482

Semantically Interpreted Video Method, System, and Apparatus

Final Rejection §112

Filed

Sep 10, 2025

Examiner

KY, KEVIN

Art Unit

2671

Tech Center

2600 — Communications

Assignee

Revealit Corporation

OA Round

2 (Final)

Interview Optional

— +25.3% interview lift. This examiner has a relatively high allow rate; a written response may suffice.

Based on 549 resolved cases, 2023–2026

Examiner Intelligence

KY, KEVIN View full profile →

Grants 76% — above average

Career Allow Rate

420 granted / 549 resolved

+14.5% vs TC avg

Strong +25% interview lift

Without

With

+25.3%

Interview Lift

resolved cases with interview

Typical timeline

2y 6m

Avg Prosecution

33 currently pending

Career history

582

Total Applications

across all art units

Statute-Specific Performance

§101

17.6%

-22.4% vs TC avg

§103

46.5%

+6.5% vs TC avg

§102

20.8%

-19.2% vs TC avg

§112

9.9%

-30.1% vs TC avg

Black line = Tech Center average estimate • Based on career data from 549 resolved cases

Office Action

§112

DETAILED ACTION
Claim Rejections - 35 USC § 112
The following is a quotation of the first paragraph of 35 U.S.C. 112(a):
(a) IN GENERAL.—The specification shall contain a written description of the invention, and of the manner and process of making and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the art to which it pertains, or with which it is most nearly connected, to make and use the same, and shall set forth the best mode contemplated by the inventor or joint inventor of carrying out the invention.

The following is a quotation of the first paragraph of pre-AIA  35 U.S.C. 112:
The specification shall contain a written description of the invention, and of the manner and process of making and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the art to which it pertains, or with which it is most nearly connected, to make and use the same, and shall set forth the best mode contemplated by the inventor of carrying out his invention.

Claims 1-20 are rejected under 35 U.S.C. 112(a) or 35 U.S.C. 112 (pre-AIA ), first paragraph, as failing to comply with the written description requirement. The claim(s) contains subject matter which was not described in the specification in such a way as to reasonably convey to one skilled in the relevant art that the inventor or a joint inventor, or for applications subject to pre-AIA  35 U.S.C. 112, the inventor(s), at the time the application was filed, had possession of the claimed invention.
Claims 1, 9 and 17 requires “generating automatically an interpretation the first plurality of syntactical elements by applying a first trained computer-implemented neural network, wherein the first trained computer implemented neural network is trained using first training syntactical elements and the first trained neural network learns to make semantic-based inferences from received syntactical elements; identifying probabilistically by applying a second trained computer implemented neural network the one or more objects in accordance with the interpreting of the first plurality of syntactical elements, wherein the second trained neural network is trained using a second training set that comprises second training syntactical elements and training images and the second trained neural network learns during training correspondences between each of a plurality of subsets of the second training syntactical elements and one or more patterns of pixels in the training images; generating a communication comprising a second plurality of syntactical elements by applying a third trained computer-implemented neural network, wherein the second plurality of syntactical elements refers to a plurality of attributes that are associated with the identified one or more objects”. The specifications do not disclose sufficient detail to demonstrate possession of these claimed neural-networks. In particular, the specifications fails to disclose the structure or architecture of any of the recite neural networks, how “semantic-based inferences” are generated or represented, how correspondences between syntactical elements and pixel patterns are learned, or any specific algorithm, model topology, or training regime enabling the claimed learning behavior. The disclosure at various paragraphs of the specifications (¶26-30, ¶33-34, ¶39-40, ¶164-165, ¶190-191) merely states that neural network perform these functions, without describing how those functions are achieved. Such high-level, result-oriented descriptions do not demonstrate possession of the full scope of the claimed invention.  Dependent claims 2-8, 10-16 and 18-20 do not cure the deficiencies identified in respective independent claims 1, 9 and 17. Accordingly, these dependent claims are not supported by adequate written description for at least the same reasons as claims 1, 9 and 17.

Claims 1-20 are rejected under 35 U.S.C. 112(a) or 35 U.S.C. 112 (pre-AIA ), first paragraph, as failing to comply with the enablement requirement.  The claim(s) contains subject matter which was not described in the specification in such a way as to enable one skilled in the art to which it pertains, or with which it is most nearly connected, to make and/or use the invention.
Claims 1, 9 and 17 broadly encompasses interpretation of textual and oral syntactical elements, probabilistic object identification in video, semantic context inference, latency-aware object identification, and training and deployment of one or more neural networks performing all of the claimed functions. However, the specifications various paragraphs (¶26-30, ¶33-34, ¶39-40, ¶164-165, ¶190-191) does not teach how to train the claimed neural networks to perform semantic inference, how probabilities and contextual inferences are computed, how latency considerations affect identification, how audio interpretation is integrated in video processing, and how a single neural network can be trained to perform all recited roles (claim 6, 14, and 20). As a result, a person of ordinary skill in the art would be required to engage in undue experimentation, including designing architectures, selecting representations, and inventing training strategies not disclosed in the specifications. Therefore, claims 1-20 are not enabled across their full scope. Dependent claims 2-8, 10-16 and 18-20 do not cure the deficiencies identified in respective independent claims 1, 9 and 17. Accordingly, these dependent claims are not enabled across their full scope for at least the same reasons as claims 1, 9 and 17.

Response to Arguments
Applicant's arguments filed 3/11/2026 directed to 35 U.S.C. 112(a) have been fully considered but they are not persuasive.
While the specifications does describe the use of neural networks in a general sense, it does not reasonably convey to one of ordinary skill in the art that the inventors had possession of the specific architecture now being claimed, namely, a system comprising three distinct trained neural networks, each having a different functional role and training paradigm as recited in claims 1, 9 and 17.
In particular, the specification consistently describes an “adaptive system” that may employ neural networks for various tasks, such as object identification or response generation, but these disclosures are generic and interchangeable, rather than indicative of a modular, multi-network pipeline. For example, the specifications explains that object identification may be performed through application of a neural network and that responses may be generated using neural networks of statistical learning approaches. However, such disclosures merely demonstrate that neural networks may be used within the system, not that the inventors possessed three separately trained neural networks with the distinct roles now claimed. The present claim requires (i) a first trained computer-implemented neural network trained using first training syntactical elements and the first trained neural network learns to make semantic-based inferences from the first plurality of syntactical elements received, (ii) a second trained neural network trained using a second training set that comprises second training syntactical elements and training images, and (iii) a third neural network that generates a communication comprising syntactical elements referring to the object attributes. The specifications does not describe this tripartite architecture, nor does it separate training processes or datasets corresponding to each of these recited neural networks. 
Applicant’s reliance on disclosures relating to semantic processing and syntactical elements is likewise not persuasive. Although the specifications discusses parsing queries into syntactical elements and forming semantic chains, these descriptions are mainly framed in terms of rule-based or structure representations (e.g. semantic chains, RDF-like relationships), rather than a trained neural network that learns to make semantic-based inferences from syntactical inputs, as claimed. These cited portions of the specifications do not describe a neural network being trained on syntactical elements to perform the claimed interpretive function.
With respect to the second neural network, the specifications does disclose training using labeled images, where labels may correspond to words or phrases, and matching pixel patterns to such labels. However, this disclosure does not support the claimed requirement that the neural network is trained using “a plurality of subsets of syntactical elements” and learns “correspondences between each of those subsets…and one or more patterns of pixels”. The specifications describes associating labels with images, but does not describe partitional syntactical elements into subsets or using such subsets in a training process to learn structured correspondences with image features. Nor does the specifications describe a multimodal training paradigm in which syntactical structures, as opposed to simple labels, are jointly learned with image data in a manner required by the claim.
With respect to the third neural network, the specifications describes generating responses or explanations including assembling outputs from words or phrases and providing recommendations or attributes. However, these disclosures are consistent with template-based or rule-based generation, and do not demonstrate possession of a trained neural network that generates a communication comprising syntactical elements as claimed. The specifications does not describe training a neural network for natural language generation or otherwise indicate that the output communication is produced by a trained neural model, as opposed to deterministic or statistical assembly techniques. 
Taken together, the cited portions of the specifications demonstrate that neural networks may be used generally within the disclosed system, but they do not convey possession of the specific combination of three distinct trained neural networks, each with the particular training inputs, learned relationships, and functional roles as claimed. Accordingly, the specifications does not provide adequate written description support for the full scope of the claimed invention, and the rejections under 35 U.S.C. 112(a) is maintained.

Conclusion
THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a).
A shortened statutory period for reply to this final action is set to expire THREE MONTHS from the mailing date of this action. In the event a first reply is filed within TWO MONTHS of the mailing date of this final action and the advisory action is not mailed until after the end of the THREE-MONTH shortened statutory period, then the shortened statutory period will expire on the date the advisory action is mailed, and any nonprovisional extension fee (37 CFR 1.17(a)) pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of the advisory action. In no event, however, will the statutory period for reply expire later than SIX MONTHS from the mailing date of this final action.
Any inquiry concerning this communication or earlier communications from the examiner should be directed to KEVIN KY whose telephone number is (571)272-7648. The examiner can normally be reached Monday-Friday 9-5PM.
Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice.
If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, Vincent Rudolph can be reached at 571-272-8243. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300.
Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000.





/KEVIN KY/Primary Examiner, Art Unit 2671

Read full office action

Prosecution Timeline

Sep 10, 2025

Application Filed

Jan 07, 2026

Non-Final Rejection — §112

Mar 11, 2026

Response Filed

Mar 23, 2026

Final Rejection — §112 (current)

Precedent Cases

Applications granted by this same examiner with similar technology

17/676,432

Patent 12597158

POSE ESTIMATION

2y 5m to grant Granted Apr 07, 2026

18/814,687

Patent 12597291

IMAGE ANALYSIS FOR PERSONAL INTERACTION

2y 5m to grant Granted Apr 07, 2026

18/222,090

Patent 12586393

KNOWLEDGE-DRIVEN SCENE PRIORS FOR SEMANTIC AUDIO-VISUAL EMBODIED NAVIGATION

2y 5m to grant Granted Mar 24, 2026

18/570,168

Patent 12586559

METHOD AND APPARATUS FOR GENERATING SPEECH OUTPUTS IN A VEHICLE

2y 5m to grant Granted Mar 24, 2026

19/080,452

Patent 12579382

NATURAL LANGUAGE GENERATION USING KNOWLEDGE GRAPH INCORPORATING TEXTUAL SUMMARIES

2y 5m to grant Granted Mar 17, 2026

Study what changed to get past this examiner. Based on 5 most recent grants.

AI Strategy Recommendation

Get an AI-powered prosecution strategy using examiner precedents, rejection analysis, and claim mapping.

Prosecution Projections

3-4

Expected OA Rounds

76%

Grant Probability

99%

With Interview (+25.3%)

2y 6m

Median Time to Grant

Moderate

PTA Risk

Based on 549 resolved cases by this examiner. Grant probability derived from career allow rate.