Last updated: April 18, 2026

Application No. 18/504,889

EFFICIENT EXECUTION OF MACHINE LEARNING MODELS USING PARTITIONING

Final Rejection §103

Filed

Nov 08, 2023

Examiner

KY, KEVIN

Art Unit

2671

Tech Center

2600 — Communications

Assignee

Qualcomm Incorporated

OA Round

2 (Final)

Interview Optional

— +25.3% interview lift. This examiner has a relatively high allow rate; a written response may suffice.

Based on 549 resolved cases, 2023–2026

Examiner Intelligence

KY, KEVIN View full profile →

Grants 76% — above average

Career Allow Rate

420 granted / 549 resolved

+14.5% vs TC avg

Strong +25% interview lift

Without

With

+25.3%

Interview Lift

resolved cases with interview

Typical timeline

2y 6m

Avg Prosecution

33 currently pending

Career history

582

Total Applications

across all art units

Statute-Specific Performance

§101

17.6%

-22.4% vs TC avg

§103

46.5%

+6.5% vs TC avg

§102

20.8%

-19.2% vs TC avg

§112

9.9%

-30.1% vs TC avg

Black line = Tech Center average estimate • Based on career data from 549 resolved cases

Office Action

§103

DETAILED ACTION
Claim Rejections - 35 USC § 103
The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action:
A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made.

Claim(s) 1-3, 5, 7, 13-15, 17, 19, 25-30 is/are rejected under 35 U.S.C. 103 as being unpatentable over Pemmaraju et al (US 20210319298) in view of Nimmagadda et al (US 20210390460).
Regarding claim 1, Pemmaraju discloses a processor-implemented method of graphical machine learning (¶14 deep learning models by selectively providing for subgraphs; ¶15 Given a deep learning model and a target backend device to be executed on, the technology creates subgraphs that are based on supported operators or nodes.), comprising: 
receiving a graph for a machine learning model (¶15 The technology helps improve the overall performance of deep learning models by selectively providing for subgraphs to be run on a backend in a manner to reduce or eliminate unnecessary fallbacks between the default runtime and the backend), the graph for the machine learning model including a plurality of subgraphs representing different portions of the machine learning model (¶19 The graph partitioner 120 takes the pretrained model architecture, as marked by the operator capability manager 110, and partitions (e.g., divides) the model into subgraphs (i.e., groups of operators, or clusters). The subgraphs are allocated into two groups—supported subgraphs and unsupported subgraphs); 
generating an inference based on executing the machine learning model across the plurality of process domains (¶23 The inference engine 170 controls execution of the model code on the various hardware units that are employed for the particular model optimization. The inference engine 170 reads the input data and compiled graphs, instantiates inference on the selected hardware, and returns the output of the inference; ¶35-37 running the efficient subgraphs on the hardware backend, where the efficient subgraphs are as selected based on the compute-based graph partitioning process).
Pemmaraju fails to teach where Nimmagadda teaches instantiating the machine learning model across a plurality of process domains associated with a same application based on the plurality of subgraphs in the graph for the machine learning model (¶24 The graph partitioner 120 takes the pretrained model architecture, as marked by the operator capability manager 110, and partitions (e.g., divides) the model into subgraphs (i.e., groups of operators, or clusters). The subgraphs are allocated into two groups—supported subgraphs and unsupported subgraphs; ¶66 The system 158 may partition and distribute subgraphs of the AI model to execute across the AI accelerators 148, graphics processor 132, host processor 134 and/or the one or more edge nodes) and on maximum addressable memory spaces of the plurality of process domains (¶20 an AI model graph (or IR of the AI model graph) may be partitioned based on computations and required memory resources of the AI model graph as well as supported computations and memory capacities of edge devices to reduce network communication and load balance; ¶42 a first portion 356a of the first subgraph 356 is maintained as part of the first subgraph 356. The first portion 356a may include a maximum amount of layers that have memory resource requirements less than the memory capacity of the first accelerator 360a. In contrast, a second portion 356b from the first subgraph 356 are reassigned and pushed into the second subgraph 358 for execution);
taking one or more actions based on the generated inference (¶28 The inference engine 170 reads the input data and compiled graphs, instantiates inference on the selected hardware, and returns the output of the inference).
Therefore, it would have been obvious to one with ordinary skill in the art before the effective filing date of the invention to have implemented the teaching of instantiating the machine learning model across a plurality of process domains associated with a same application based on the plurality of subgraphs in the graph for the machine learning model, and on maximum addressable memory spaces of the plurality of process domains, and taking one or more actions based on the generated inference  from Nimmagadda into the method as disclosed by Pemmaraju. The motivation for doing this is to improve processing architectures that execute artificial intelligence (AI) processing.

Regarding claim 2, the combination of Pemmaraju and Nimmagadda disclose the method of claim 1, wherein instantiating the machine learning model across the plurality of process domains comprises instantiating a portion of the machine learning model represented by a corresponding subgraph from the plurality of subgraphs based on an amount of memory associated with the portion of the machine learning model and an available amount of memory on a process domain from the plurality of process domains (Nimmagadda ¶20 an AI model graph (or IR of the AI model graph) may be partitioned based on computations and required memory resources of the AI model graph as well as supported computations and memory capacities of edge devices to reduce network communication and load balance; ¶40 The process 350 modifies the first subgraph 356 based on a memory capacity of the first accelerator 360a, 362. That is, the process 350 modifies the first subgraph 356 based on memory resources (e.g., size of weights and activation tensor sizes) required by the first subgraph 356 and a memory capacity of a first accelerator 360a.; the first subgraph 356 may be readjusted and modified to reduce the memory requirements of the first subgraph 356). The motivation to combine the references is discussed above in the rejection of claim 1.

Regarding claim 3, the combination of Pemmaraju and Nimmagadda disclose the method of claim 1, wherein instantiating the machine learning model across the plurality of process domains comprises mapping outputs of a first subgraph from the plurality of subgraphs to inputs of a second subgraph of the plurality of subgraphs (Pemmaraju ¶51 block 310 provides for generating a first set of subgraphs based on supported nodes of a model graph, wherein the supported nodes have operators that are supported by a hardware backend device separate from a default runtime; Illustrated processing block 330 provides for selecting, from the first set of subgraphs, a second set of subgraphs to be run on the hardware backend device based on the evaluated compute efficiency).

Regarding claim 5, the combination of Pemmaraju and Nimmagadda disclose the method of claim 1, wherein executing the machine learning model across the plurality of process domains comprises sequentially executing the plurality of subgraphs based on outputs of a first subgraph in the plurality of subgraphs corresponding to inputs of a second subgraph in the plurality of subgraphs (Pemmaraju Fig. 3A & ¶51 illustrated processing block 310 provides for generating a first set of subgraphs based on supported nodes of a model graph, wherein the supported nodes have operators that are supported by a hardware backend device separate from a default runtime. Illustrated processing block 320 provides for evaluating a compute efficiency of each subgraph of the first set of subgraphs relative to the hardware backend device and to a default CPU associated with the default runtime. Illustrated processing block 330 provides for selecting, from the first set of subgraphs, a second set of subgraphs to be run on the hardware backend device based on the evaluated compute efficiency.).

Regarding claim 7, the combination of Pemmaraju and Nimmagadda disclose the method of claim 1, further comprising releasing the plurality of process domains to terminate execution of the machine learning model (Pemmaraju ¶66 After completion of execution of the operations specified by the code instructions, back end logic 58 retires the instructions of code 42).

Regarding claim(s) 13-15, 17 and 19 (drawn to a system):               
The rejection/proposed combination of Pemmaraju and Nimmagadda, explained in the rejection of methods claim(s) 1-3, 5 and 7, anticipates/renders obvious the steps of the system of claim(s) 13-15, 17 and 19 because these steps occur in the operation of the proposed combination as discussed above. Thus, the arguments similar to that presented above for claim(s) 1-3, 5 and 7 is/are equally applicable to claim(s) 13-15, 17 and 19. See further Pemmaraju ¶55-56.

Regarding claim(s) 25-29 (drawn to a system):               
The rejection/proposed combination of Pemmaraju and Nimmagadda, explained in the rejection of method claim(s) 1-3, 5 and 7, anticipates/renders obvious the steps of the system of claim(s) 25-29 because these steps occur in the operation of the proposed combination as discussed above. Thus, the arguments similar to that presented above for claim(s) 1-3, 5 and 7 is/are equally applicable to claim(s) 25-29. See further Pemmaraju ¶55-56.

Regarding claim(s) 30 (drawn to a CRM):               
The rejection/proposed combination of Pemmaraju and Nimmagadda, explained in the rejection of method claim(s) 1, anticipates/renders obvious the steps of the computer readable medium of claim(s) 30 because these steps occur in the operation of the proposed combination as discussed above. Thus, the arguments similar to that presented above for claim(s) 1 is/are equally applicable to claim(s) 30. See further Pemmaraju ¶55-56.

3.	Claim(s) 4 and 16 is/are rejected under 35 U.S.C. 103 as being unpatentable over the combination of Pemmaraju and Nimmagadda as applied to claim 1 and 13 above, and further in view of Kovvuri et al (US 20190286973).
Regarding claim 4, the combination of Pemmaraju and Nimmagadda disclose the method of claim 1, but fail to teach where Kovvuri teaches wherein each respective subgraph of the plurality of subgraphs is associated with a respective memory space shared within the same application (Kovvuri ¶43 techniques can be applied to such operations to reduce the demands for computation as well as memory bandwidth in a given system; ¶49 the memory interface 140 manages allocation of virtual memory, expanding the available main memory 145).
Therefore, it would have been obvious to one with ordinary skill in the art before the effective filing date of the invention to have implemented the teaching of wherein each respective subgraph of the plurality of subgraphs is associated with a respective memory space shared within the same application from Kovvuri into the method as disclosed by the combination of Pemmaraju and Nimmagadda. The motivation for doing this is to improve performance and cost of machine learning models.

Regarding claim(s) 16 (drawn to a system):               
The rejection/proposed combination of Pemmaraju, Nimmagadda and Kovvuri, explained in the rejection of methods claim(s) 4, anticipates/renders obvious the steps of the system of claim(s) 16 because these steps occur in the operation of the proposed combination as discussed above. Thus, the arguments similar to that presented above for claim(s) 4 is/are equally applicable to claim(s) 16. See further Pemmaraju ¶55-56.

Claim(s) 6 and 18 is/are rejected under 35 U.S.C. 103 as being unpatentable over the combination of Pemmaraju and Nimmagadda as applied to claim 5 and 17 above, and further in view of Eidelman et al (US 20230214754).
Regarding claim 6, the combination of Pemmaraju and Nimmagadda disclose the method of claim 5, but fail to teach where Eidelman teaches wherein sequentially executing the plurality of subgraphs comprises an atomic operation (¶441-442 Subgraph merging module may create a subgraph with a mix of keyed and non-keyed nodes and keyed and non-keyed relationships; subgraph merging module may execute the merging logic as an atomic operation).
Therefore, it would have been obvious to one with ordinary skill in the art before the effective filing date of the invention to have implemented the teaching of wherein sequentially executing the plurality of subgraphs comprises an atomic operation from Eidelman into the method as disclosed by the combination of Pemmaraju and Nimmagadda. The motivation for doing this is to improve merging logic of graphs and subgraphs.

Regarding claim(s) 18 (drawn to a system):               
The rejection/proposed combination of Pemmaraju, Nimmagadda and Eidelman, explained in the rejection of method claim(s) 6, anticipates/renders obvious the steps of the system of claim(s) 18 because these steps occur in the operation of the proposed combination as discussed above. Thus, the arguments similar to that presented above for claim(s) 6 is/are equally applicable to claim(s) 18. See further Pemmaraju ¶55-56.
	
Claim(s) 8-9 and 20-21 is/are rejected under 35 U.S.C. 103 as being unpatentable over the combination of Pemmaraju and Nimmagadda as applied to claim 1 and 13 above, and further in view of Freed et al (US 20240378666).
Regarding claim 8, the combination of Pemmaraju and Nimmagadda disclose the method of claim 1, but fail to teach where Freed teaches wherein the machine learning model comprises a generative artificial intelligence model (¶29 a generative artificial intelligence model may be developed to provide analysis and assist users).
Therefore, it would have been obvious to one with ordinary skill in the art before the effective filing date of the invention to have implemented the teaching of wherein the machine learning model comprises a generative artificial intelligence model from Freed into the method as disclosed by the combination of Pemmaraju and Nimmagadda. The motivation for doing this is to improve methods for providing insight and analysis of data.

Regarding claim 9, the combination of Pemmaraju, Nimmagadda, and Freed disclose the method of claim 8, wherein the one or more actions comprise generating a response to an input query using the generative artificial intelligence model (Freed ¶8-9 a generative artificial intelligence model configured to receive as input query and the borrower profile and generate predictive responses to the query).
Therefore, it would have been obvious to one with ordinary skill in the art before the effective filing date of the invention to have implemented the teaching of wherein the one or more actions comprise generating a response to an input query using the generative artificial intelligence model from Freed into the method as disclosed by the combination of Pemmaraju and Nimmagadda. The motivation for doing this is to improve methods for providing insight and analysis of data.

Regarding claim(s) 20-21 (drawn to a system):               
The rejection/proposed combination of Pemmaraju, Nimmagadda and Freed, explained in the rejection of method claim(s) 8-9, anticipates/renders obvious the steps of the system of claim(s) 20-21 because these steps occur in the operation of the proposed combination as discussed above. Thus, the arguments similar to that presented above for claim(s) 8-9 is/are equally applicable to claim(s) 20-21. See further Pemmaraju ¶55-56.

Claim(s) 10-12 and 22-24 is/are rejected under 35 U.S.C. 103 as being unpatentable over the combination of Pemmaraju and Nimmagadda as applied to claim 1 and 13 above, and further in view of Michalakis et al (US 20190384291).
Regarding claim 10, the combination of Pemmaraju and Nimmagadda disclose the method of claim 1, but fail to teach where Michalakis teaches wherein the machine learning model comprises a classifier neural network (¶79 the artificial neural network 508 may include a feature extractor 510, a classifier 512).
Therefore, it would have been obvious to one with ordinary skill in the art before the effective filing date of the invention to have implemented the teaching of wherein the machine learning model comprises a classifier neural network from Michalakis into the method as disclosed by the combination of Pemmaraju and Nimmagadda. The motivation for doing this is to improve machine learning models, such as the object detection models.

Regarding claim 11, the combination of Pemmaraju, Nimmagadda and Michalakis disclose the method of claim 10, wherein the one or more actions comprise generating one or more control signals to control an autonomous vehicle based on a classification of one or more objects in a scene generated by the classifier neural network (Michalakis ¶88 at block 612, the autonomous vehicle controls one or more actions based on the adjusted autonomous driving system).
Therefore, it would have been obvious to one with ordinary skill in the art before the effective filing date of the invention to have implemented the teaching of wherein the one or more actions comprise generating one or more control signals to control an autonomous vehicle based on a classification of one or more objects in a scene generated by the classifier neural network from Michalakis into the method as disclosed by the combination of Pemmaraju and Nimmagadda. The motivation for doing this is to improve machine learning models, such as the object detection models.

Regarding claim 12, the combination of Pemmaraju, Nimmagadda and Michalakis disclose the method of claim 10, wherein the one or more actions comprise applying different levels of compression to different portions of an image based on classifications of different objects in the image generated by the classifier neural network (Michalakis ¶46 the sub-sampling may compress the frame via a compression standard, such as Dirac, moving picture experts group (MPEG)-4, high efficiency video coding (HEVC), etc. Furthermore, the sub-sampling may adjust the video's frame-rate. For example, the frame rate may be adjusted from thirty frames per second to three frames per second; ¶67 At block 408, the sub-sampling model sub-samples the frame corresponding to the extracted features. The sub-sampling may include reducing a frame rate, reducing the frame's resolution, adjusting attributes of objects in the frame, compressing the frame, encoding the frame, reducing a number of channels of the frame, filtering elements out of the frame, and/or other techniques from reducing a memory footprint of the frame.).
Therefore, it would have been obvious to one with ordinary skill in the art before the effective filing date of the invention to have implemented the teaching of wherein the one or more actions comprise applying different levels of compression to different portions of an image based on classifications of different objects in the image generated by the classifier neural network from Michalakis into the method as disclosed by the combination of Pemmaraju and Nimmagadda. The motivation for doing this is to improve machine learning models, such as the object detection models.

Regarding claim(s) 22-24 (drawn to a system):               
The rejection/proposed combination of Pemmaraju, Nimmagadda and Michalakis, explained in the rejection of method claim(s) 10-12, anticipates/renders obvious the steps of the system of claim(s) 22-24because these steps occur in the operation of the proposed combination as discussed above. Thus, the arguments similar to that presented above for claim(s) 10-12 is/are equally applicable to claim(s) 22-24. See further Pemmaraju ¶55-56.

Response to Arguments
Applicant’s arguments with respect to claim(s) 1-30 have been considered but are moot because the new ground of rejection does not rely on any reference applied in the prior rejection of record for any teaching or matter specifically challenged in the argument.

Conclusion
Applicant's amendment necessitated the new ground(s) of rejection presented in this Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP § 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a).
A shortened statutory period for reply to this final action is set to expire THREE MONTHS from the mailing date of this action. In the event a first reply is filed within TWO MONTHS of the mailing date of this final action and the advisory action is not mailed until after the end of the THREE-MONTH shortened statutory period, then the shortened statutory period will expire on the date the advisory action is mailed, and any nonprovisional extension fee (37 CFR 1.17(a)) pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of the advisory action. In no event, however, will the statutory period for reply expire later than SIX MONTHS from the mailing date of this final action.

Any inquiry concerning this communication or earlier communications from the examiner should be directed to KEVIN KY whose telephone number is (571)272-7648. The examiner can normally be reached Monday-Friday 9-5PM.
Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice.
If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, Vincent Rudolph can be reached at 571-272-8243. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300.
Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000.





/KEVIN KY/Primary Examiner, Art Unit 2671

Read full office action

Prosecution Timeline

Nov 08, 2023

Application Filed

Sep 29, 2025

Non-Final Rejection — §103

Dec 29, 2025

Response Filed

Apr 06, 2026

Final Rejection — §103 (current)

Precedent Cases

Applications granted by this same examiner with similar technology

17/676,432

Patent 12597158

POSE ESTIMATION

2y 5m to grant Granted Apr 07, 2026

18/814,687

Patent 12597291

IMAGE ANALYSIS FOR PERSONAL INTERACTION

2y 5m to grant Granted Apr 07, 2026

18/222,090

Patent 12586393

KNOWLEDGE-DRIVEN SCENE PRIORS FOR SEMANTIC AUDIO-VISUAL EMBODIED NAVIGATION

2y 5m to grant Granted Mar 24, 2026

18/570,168

Patent 12586559

METHOD AND APPARATUS FOR GENERATING SPEECH OUTPUTS IN A VEHICLE

2y 5m to grant Granted Mar 24, 2026

19/080,452

Patent 12579382

NATURAL LANGUAGE GENERATION USING KNOWLEDGE GRAPH INCORPORATING TEXTUAL SUMMARIES

2y 5m to grant Granted Mar 17, 2026

Study what changed to get past this examiner. Based on 5 most recent grants.

AI Strategy Recommendation

Get an AI-powered prosecution strategy using examiner precedents, rejection analysis, and claim mapping.

Prosecution Projections

3-4

Expected OA Rounds

76%

Grant Probability

99%

With Interview (+25.3%)

2y 6m

Median Time to Grant

Moderate

PTA Risk

Based on 549 resolved cases by this examiner. Grant probability derived from career allow rate.