Last updated: May 29, 2026

Application No. 18/353,637

EFFICIENT TENSOR REMATERIALIZATION FOR NEURAL NETWORKS

Non-Final OA §101§102§103§112

Filed

Jul 17, 2023

Priority

Jan 25, 2023 — provisional 63/441,150

Examiner

HICKS, AUSTIN JAMES

Art Unit

2142

Tech Center

2100 — Computer Architecture & Software

Assignee

Qualcomm Incorporated

OA Round

1 (Non-Final)

Interview Optional

— +24.7% interview lift. Examiner has a relatively high allowance rate (75%); +24.7% interview lift. A written response may suffice.

Based on 411 resolved cases, 2023–2026

Examiner Intelligence

HICKS, AUSTIN JAMES View full profile →

Grants 75% — above average

Career Allowance Rate

310 granted / 411 resolved

+20.4% vs TC avg

Strong +25% interview lift

Without

With

+24.7%

Interview Lift

resolved cases with interview

Typical timeline

3y 2m

Avg Prosecution

30 currently pending

Career history

462

Total Applications

across all art units

Statute-Specific Performance

§101

3.8%

-36.2% vs TC avg

§103

82.6%

+42.6% vs TC avg

§102

9.1%

-30.9% vs TC avg

§112

3.9%

-36.1% vs TC avg

Black line = Tech Center average estimate • Based on career data from 411 resolved cases

Office Action

§101 §102 §103 §112

Notice of Pre-AIA  or AIA  Status
The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA .

Claim Interpretation
The following is a quotation of 35 U.S.C. 112(f):
(f) Element in Claim for a Combination. – An element in a claim for a combination may be expressed as a means or step for performing a specified function without the recital of structure, material, or acts in support thereof, and such claim shall be construed to cover the corresponding structure, material, or acts described in the specification and equivalents thereof. 

 Claims 22-28 use means-for language. The claims in this application are given their broadest reasonable interpretation using the plain meaning of the claim language in light of the specification as it would be understood by one of ordinary skill in the art.  The broadest reasonable interpretation of a claim element (also commonly referred to as a claim limitation) is limited by the description in the specification when 35 U.S.C. 112(f) or pre-AIA  35 U.S.C. 112, sixth paragraph, is invoked. 
As explained in MPEP § 2181, subsection I, claim limitations that meet the following three-prong test will be interpreted under 35 U.S.C. 112(f) or pre-AIA  35 U.S.C. 112, sixth paragraph:
(A)	the claim limitation uses the term “means” or “step” or a term used as a substitute for “means” that is a generic placeholder (also called a nonce term or a non-structural term having no specific structural meaning) for performing the claimed function; 
(B)	the term “means” or “step” or the generic placeholder is modified by functional language, typically, but not always linked by the transition word “for” (e.g., “means for”) or another linking word or phrase, such as “configured to” or “so that”; and 
(C)	the term “means” or “step” or the generic placeholder is not modified by sufficient structure, material, or acts for performing the claimed function. 
Use of the word “means” (or “step”) in a claim with functional language creates a rebuttable presumption that the claim limitation is to be treated in accordance with 35 U.S.C. 112(f) or pre-AIA  35 U.S.C. 112, sixth paragraph. The presumption that the claim limitation is interpreted under 35 U.S.C. 112(f) or pre-AIA  35 U.S.C. 112, sixth paragraph, is rebutted when the claim limitation recites sufficient structure, material, or acts to entirely perform the recited function. 
Absence of the word “means” (or “step”) in a claim creates a rebuttable presumption that the claim limitation is not to be treated in accordance with 35 U.S.C. 112(f) or pre-AIA  35 U.S.C. 112, sixth paragraph. The presumption that the claim limitation is not interpreted under 35 U.S.C. 112(f) or pre-AIA  35 U.S.C. 112, sixth paragraph, is rebutted when the claim limitation recites function without reciting sufficient structure, material or acts to entirely perform the recited function. 
Claim limitations in this application that use the word “means” (or “step”) are being interpreted under 35 U.S.C. 112(f) or pre-AIA  35 U.S.C. 112, sixth paragraph, except as otherwise indicated in an Office action. Conversely, claim limitations in this application that do not use the word “means” (or “step”) are not being interpreted under 35 U.S.C. 112(f) or pre-AIA  35 U.S.C. 112, sixth paragraph, except as otherwise indicated in an Office action.

Claim Rejections - 35 USC § 112
The following is a quotation of 35 U.S.C. 112(b):
(b)  CONCLUSION.—The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the subject matter which the inventor or a joint inventor regards as the invention.

Claims 10, 17 and 14 recite the limitation "the second precedence".  There is insufficient antecedent basis for this limitation in the claim.
Claim 3 is rejected under 35 U.S.C. 112(b) or 35 U.S.C. 112 (pre-AIA ), second paragraph, as being indefinite for failing to particularly point out and distinctly claim the subject matter which the inventor or a joint inventor (or for applications subject to pre-AIA  35 U.S.C. 112, the applicant), regards as the invention. In claim 3, Applicant claims the “precedence constraint for each of the multiple nodes…” This contradicts the first claimed precedence constraint in claim 1, where it’s “a precedence constraint for the multiple nodes...” Amend to claim a second precedence constraint for each of the multiple nodes.
Claims 5, 12, 19 and 26 recite the limitation "the node".  There is insufficient antecedent basis for this limitation in the claim. There is no defined one node. The defined nodes are multiple nodes and “each node”.


Claim Rejections - 35 USC § 101
35 U.S.C. 101 reads as follows:
Whoever invents or discovers any new and useful process, machine, manufacture, or composition of matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the conditions and requirements of this title.


Claims 1-28 are rejected under 35 U.S.C. 101 because the claimed invention is directed to an abstract idea of a mathematical relationship without significantly more. The claims recite receiving a graph NN, determining parameters of the graph and its nodes. This judicial exception is not integrated into a practical application because this method does not improve the functioning of a computer and is merely linked to the technical field of computing. The claims do not include additional elements that are sufficient to amount to significantly more than the judicial exception because the additional elements of memory and processors are generic computer part.



Claim Rejections - 35 USC § 102
The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the basis for the rejections under this section made in this Office action:
A person shall be entitled to a patent unless –

(a)(1) the claimed invention was patented, described in a printed publication, or in public use, on sale, or otherwise available to the public before the effective filing date of the claimed invention.

Claims 1-6, 8-13, 15-20 and 22-27 are rejected under 35 U.S.C. 102(a)(1) as being described by POET: Training Neural Networks on Tiny Devices with Integrated Rematerialization and Paging by Patil et al

Patil teaches claims 1, 8, 15 and 22. A processor-implemented method, performed by at least one processor, the method comprising:
receiving, by the at least one processor, a graph representing an artificial neural network (ANN), the graph including multiple nodes connected by edges and each node represents an operation; (Patil abs. “POET, an algorithm to enable training large neural networks on memory-scarce battery-operated edge devices.” The neural network and its nodes are shown in fig. 1, below.)

    PNG
    media_image1.png
    184
    728
    media_image1.png
    Greyscale

determining, by the at least one processor, retention intervals for the multiple nodes based on a precedence constraint for the multiple nodes, the retention intervals corresponding to a time interval for retaining each node output in a local memory; and (Patil sec. 5.2 “consider a graph where the output depends on the result of two operations v1 and v2 where both nodes have equivalent memory costs but v2 is cheaper to evaluate. A paging strategy may evict v2 which would force rematerialization to recompute the more expensive v1 rather than v2. We represent a schedule as a series of nodes that are either being saved SRAM, (re)computed R or paged from secondary storage SAUX.” The schedule is the retention intervals, the precedence is the series of nodes.)
determining, by the at least one processor, a node of the multiple nodes to recompute based on the retention intervals. (Patil sec. 5.2 “A paging strategy may evict v2 which would force rematerialization to recompute the more expensive v1 rather than v2. We represent a schedule as a series of nodes that are either being saved SRAM, (re)computed R or paged from secondary storage SAUX.”)

Patil teaches claims 2, 9, 16, 23. The processor-implemented method of claim 1, further comprising determining, by the at least one processor, an order for execution of the multiple nodes based on the precedence constraint and a memory constraint. (Patil sec. 5.1 “Given a directed acyclic dataflow graph G = (V,E) with n nodes, a topological ordering {v1,...,vn} is computed which constrains execution to that order of instructions. Two key decision variables are introduced: (1) R ∈ {0,1}n×n where rt,i represents the decision to (re)materialize an operation vi at timestep t and (2) S ∈ {0,1}n× n where st,i represents whether the result of an operation vi is resident in memory at timestep t.”)

Patil teaches claims 3, 10, 17, 24. The processor-implemented method of claim 2, further comprising determining, by the at least one processor, the precedence constraint for each of the multiple nodes based on the retention intervals. (Patil sec. 5.1 “Given a directed acyclic dataflow graph G = (V,E) with n nodes, a topological ordering {v1,...,vn} is computed which constrains execution to that order of instructions. Two key decision variables are introduced: (1) R ∈ {0,1}n×n where rt,i represents the decision to (re)materialize an operation vi at timestep t and (2) S ∈ {0,1}n× n where st,i represents whether the result of an operation vi is resident in memory at timestep t.”)

Patil teaches claims 4, 11, 18, 25. The processor-implemented method of claim 2, further comprising determining, by the at least one processor, the memory constraint based on a physical memory capacity of the local memory and the retention intervals. (Patil sec. 5.1 “From the rematerialzation matrix R, and the storage matrix S, we define a series of constraints to maintain graph dependencies. All arguments for an operation j must be resident in memory prior to running that operation, yielding constraint …. Similarly, the result of an operation is only resident in memory in one of the two cases: a) if it was already res ident in memory before, or b) if it was (re)materialized …. To adhere to the strict constraints on the peak memory used during training, an intermediate variable U ∈ Rn×n is defined. Ut,i is the total memory used by the system during training at timestep t when evaluating operation i. By bounding the maximum value of… to the user-specified memory limit µRAM, we limit the total memory consumption during training.”)

Patil teaches claims 5, 12, 19 and 26. The processor-implemented method of claim 1, in which the retention intervals are determined based on a recompute constraint, the recompute constraint defining a number of times that the node is permitted to be recomputed. (Patil sec. 5.0 “we do not limit rematerialization to occur once. We assume auxiliary storage (e.g., flash/ SD card) is available. However, if auxiliary storage is not available, the POET optimizer will fall back to only performing rematerialization.” The rest of that last sentence is only performing rematierialization once. This makes sense because the previous sentence states that the default is one rematerialization, and without auxiliary memory, there isn’t time for multiple saves for rematerialization. Patil sec 5.0 “POET finds an energy optimal schedule by choosing to either a) rematerialize or b) page the tensors to/from secondary storage such as an SD card.” Choosing to rematerialize/recompute the tensors only once if there is no SD card, is a schedule based on a recompute limit.)

Patil teaches claims 6, 13, 20 and 27. The processor-implemented method of claim 1, in which the precedence constraint is determined based on the edges connecting the multiple nodes. (Patil sec. 4 “When the deleted tensor is needed again, for example, to compute gradients during backpropagation, it is recomputed from the other dependent activations as dictated by the lineage.” The lineage is the string of edges connecting to the other nodes from the deleted tensor.)

Claim Rejections - 35 USC § 103
The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action:
A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made.

Claims 7, 14, 21 and 28 is/are rejected under 35 U.S.C. 103 as being unpatentable over POET: Training Neural Networks on Tiny Devices with Integrated Rematerialization and Paging by Patil et al and TensorDIMM: A Practical Near-Memory Processing Architecture for Embeddings and Tensor Operations in Deep Learning by Kwon et al.

Patil teaches claims 7, 14, 21 and 28. The processor-implemented method of claim 1, in which the local memory  (Patil sec 5.0 “POET finds an energy optimal schedule by choosing to either a) rematerialize or b) page the tensors to/from secondary storage such as an SD card.”) 
	Patil doesn’t teach tightly-coupled memory.
	However, Kwon teaches the local memory comprises a tightly-coupled memory. (Kwon abs “We present our vertically integrated hardware/software co-design, which includes a custom DIMM module enhanced with near-memory processing cores tailored for DL tensor operations.” Near-memory cores make the memory tightly-coupled.)
	Kwon, Patil and the claims all teach machine learning memory operations. It would have been obvious to a person having ordinary skill in the art, at the time of filing, to use tightly-coupled memory because “tensor reduction operations should be conducted ‘near memory’ using a lightweight NMP core, incurring minimum power and area overheads to the buffered DIMM device.” Kwon sec. 4.2.


Conclusion
Any inquiry concerning this communication or earlier communications from the examiner should be directed to Austin Hicks whose telephone number is (571)270-3377. The examiner can normally be reached Monday - Thursday 8-4 PST.
Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice.
If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, Mariela Reyes can be reached at (571) 270-1006. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300.
Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000.





/AUSTIN HICKS/Primary Examiner, Art Unit 2142

Read full office action

Prosecution Timeline

Jul 17, 2023

Application Filed

Feb 24, 2026

Non-Final Rejection mailed — §101, §102, §103 (current)

Precedent Cases

Applications granted by this same examiner with similar technology

17/638,748

Patent 12639558

NEURAL NETWORK PROCESSOR SYSTEM AND METHODS OF OPERATING AND FORMING THEREOF

4y 3m to grant Granted May 26, 2026

17/956,638

Patent 12626157

IDENTIFYING IDLE-CORES IN DATA CENTERS USING MACHINE-LEARNING (ML)

3y 7m to grant Granted May 12, 2026

18/047,397

Patent 12626178

QUANTUM COMPUTING BASED KERNEL ALIGNMENT FOR A SUPPORT VECTOR MACHINE TASK

3y 6m to grant Granted May 12, 2026

17/694,063

Patent 12608441

SCREENING FOR FLUCTUATING ENERGY RELAXATION TIMES

4y 1m to grant Granted Apr 21, 2026

17/786,650

Patent 12591767

NEURAL NETWORK ACCELERATION CIRCUIT AND METHOD

3y 9m to grant Granted Mar 31, 2026

Study what changed to get past this examiner. Based on 5 most recent grants.

Strategy Recommendation AI-generated — please review before filing

Get a prosecution strategy drawn from examiner precedents, rejection analysis, and claim mapping.

Typically takes 5-10 seconds — AI-generated, attorney review required before filing

Prosecution Projections

1-2

Expected OA Rounds

75%

Grant Probability

99%

With Interview (+24.7%)

3y 2m (~3m remaining)

Median Time to Grant

Low

PTA Risk

Based on 411 resolved cases by this examiner. Grant probability derived from career allowance rate.