Prosecution Insights
Last updated: May 29, 2026
Application No. 18/259,479

SYSTEM AND METHOD FOR LOCAL SPATIAL FEATURE POOLING FOR FINE-GRAINED REPRESENTATION LEARNING

Final Rejection §103
Filed
Jun 27, 2023
Priority
Feb 16, 2021 — provisional 63/149,714 +1 more
Examiner
ISLAM, MEHRAZUL NMN
Art Unit
2662
Tech Center
2600 — Communications
Assignee
Carnegie Mellon University
OA Round
3 (Final)
60%
Grant Probability
Moderate
4-5
OA Rounds
4m
Est. Remaining
87%
With Interview

Examiner Intelligence

Grants 60% of resolved cases
60%
Career Allowance Rate
32 granted / 53 resolved
-1.6% vs TC avg
Strong +26% interview lift
Without
With
+26.2%
Interview Lift
resolved cases with interview
Typical timeline
3y 3m
Avg Prosecution
31 currently pending
Career history
98
Total Applications
across all art units

Statute-Specific Performance

§101
1.2%
-38.8% vs TC avg
§103
96.7%
+56.7% vs TC avg
§102
1.2%
-38.8% vs TC avg
Black line = Tech Center average estimate • Based on career data from 53 resolved cases

Office Action

§103
DETAILED ACTION Notice of Pre-AIA or AIA Status The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA . Continued Examination Under 37 CFR 1.114 A request for continued examination under 37 CFR 1.114, including the fee set forth in 37 CFR 1.17(e), was filed in this application after final rejection. Since this application is eligible for continued examination under 37 CFR 1.114, and the fee set forth in 37 CFR 1.17(e) has been timely paid, the finality of the previous Office action has been withdrawn pursuant to 37 CFR 1.114. Applicant's submission filed on 04/07/2026 has been entered. Response to Amendment 5. In light of Applicant’s amendment of the claims, the claim objections with respect to claims 4-12 are withdrawn. Status of Claims Claims 1-14 are pending. Claims 1, 2 and 4-14 are amended. Response to Arguments Applicant’s amendment of independent Claim 1, which has altered the scope of the claims of the instant application, has necessitated the new ground(s) of rejection presented in this office action with respect to claims of the instant application. Accordingly, in response to Applicant’s arguments that are merely directed to the amended portion of the claims, new analyses have been presented below, which make Applicant’s arguments moot. Claim Rejections - 35 USC § 103 The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action: A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made. The factual inquiries for establishing a background for determining obviousness under 35 U.S.C. 103 are summarized as follows: 1. Determining the scope and contents of the prior art. 2. Ascertaining the differences between the prior art and the claims at issue. 3. Resolving the level of ordinary skill in the pertinent art. 4. Considering objective evidence present in the application indicating obviousness or nonobviousness. This application currently names joint inventors. In considering patentability of the claims the examiner presumes that the subject matter of the various claims was commonly owned as of the effective filing date of the claimed invention(s) absent any evidence to the contrary. Applicant is advised of the obligation under 37 CFR 1.56 to point out the inventor and effective filing dates of each claim that was not commonly owned as of the effective filing date of the later invention in order for the examiner to consider the applicability of 35 U.S.C. 102(b)(2)(C) for any potential 35 U.S.C. 102(a)(2) prior art against the later invention. Claims 1-7 and 9-14 are rejected under 35 U.S.C. 103 as being unpatentable over Fu et al. (US 2020/0160124 A1), in view of Sriram et al. (US 2020/0151489 A1) and in further view of Liao et al. (US 2021/0201071 A1). Regarding claim 1, Fu teaches, A method comprising: extracting (Fu, ¶0113: “method in the subject matter described herein, comprising: extracting”) key local landmarks from an input image; the key local landmarks representing (Fu, ¶0033: “sub-network 222 performs the feature extraction at a finer scale than the full-image scale, the extracted feature 223 may be referred to as a local feature”) that, when considered together, are indicative of a structure of an object in the input image; (Fu, ¶0022: “extracting features from each of the regions and recognizing a specific category of the object based on these features”) global feature representations (Fu, ¶0037: “global feature information of the image 170, and thus may be combined with the local feature 223”) produced by the deep CNN model (Fu, ¶0034: “sub-network 222 may be comprised of a CNN network”) to create combined feature representations. (Fu, ¶0046: “the local feature 333 may be combined with the global feature 213”). However, Fu does not explicitly teach, mapping the pixel locations of the key local landmarks into a feature map of an intermediate convolutional layer of a deep CNN model at positions in the intermediate convolutional layer corresponding to the pixel locations of the key local landmarks from the input image; extracting local feature representations of the key landmarks from the feature map of the intermediate convolutional layer of the deep CNN model containing the mapped key local landmarks. In an analogous field of endeavor, Sriram teaches, mapping the pixel locations of the key local landmarks (Sriram, ¶0031: “the bounding shapes may be computed by the object detector 106 as pixel locations”) into a feature map of an intermediate convolutional layer of a deep CNN model at positions in the intermediate convolutional layer (Sriram, ¶0082: “CNN 112D may more accurately account for the portions of the feature map computed by the feature extractor 504 that correspond to the person region 518, the object region 520, and the union region 522”) corresponding to the pixel locations of the key local landmarks (Sriram, ¶0072: “object coordinates (e.g., coordinates corresponding to the object region, such as the bounding shape delineating the object region”) from the input image; (Sriram, ¶0033: “For a given object detected within an instance of the sensor data 102, pixel coordinate(s) corresponding to the object may be used”). Therefore, it would have been obvious to a person of ordinary skill in the art before the effective filing date of the claimed invention to modify Fu using the teachings of Sriram to introduce mapping landmark pixel locations representing the boundary/structure of an object in a CNN based feature map. A person skilled in the art would be motivated to combine the known elements as described above and achieve the predictable result of classification of an object using key pixel coordinates representing the object. Therefore, it would have been obvious to combine the analogous arts Fu and Sriram to obtain the above-described limitations in claim 1. However, the combination of Fu and Sriram does not explicitly teach, extracting local feature representations of the key landmarks from the feature map of the intermediate convolutional layer of the deep CNN model containing the mapped key local landmarks. In another analogous field of endeavor, Liao teaches, extracting local feature representations of the key landmarks from the feature map of the intermediate convolutional layer of the deep CNN model (Liao, ¶0091: “feature map is extracted from the fifth convolutional layer of the learning network 510 even though the feature map from any other layer may be used”) containing the mapped key local landmarks. (Liao, ¶0032: “objects contained in the image such as persons, animals, natural features, buildings and the like”). Therefore, it would have been obvious to a person of ordinary skill in the art before the effective filing date of the claimed invention to modify Fu in view of Sriram using the teachings of Liao to introduce extracting features from an intermediate convolutional layer. A person skilled in the art would be motivated to combine the known elements as described above and achieve the predictable result of obtaining multiscale representation of an object for improved object classification. Therefore, it would have been obvious to combine the analogous arts Fu, Sriram and Liao to obtain the invention in claim 1. Regarding claim 2, Fu in view of Sriram and in further view of Liao teaches, The method of claim 1, further comprising: sending the combined feature representations to a classifier to be used (Fu, ¶0037: “FC layer 217 is concatenated to the output of the FC layer 227… input to a further Softmax layer”) to classify the object in the input image from which the key local landmarks were extracted. (Fu, ¶0037: “input to a further Softmax layer (not shown) to determine the category of the object in the image”). Regarding claim 3, Fu in view of Sriram and in further view of Liao teaches, The method of claim 1, further comprising: selecting a subset of the local feature representations (Fu, ¶0084: “second attention region is comprised in the first attention region and comprises a discriminative sub-portion of the object in the image”) to be combined with the global feature representations. (Fu, ¶0046: “the local feature 333 may be combined with the global feature 213 and/or local feature 223 to determine the category of the object in the image 170”). Regarding claim 4, Fu in view of Sriram and in further view of Liao teaches, The method of claim 3, wherein the subset of local feature representations is selected based on a weighting scheme wherein a predetermined number of higher-weighted local feature representations are selected. (Sriram, ¶0039: “generate a plurality of confidences associated with each object/person pair, and the highest confidence 116 may be used”). Therefore, it would have been obvious to a person of ordinary skill in the art before the effective filing date of the claimed invention to modify Fu, in view of Sriram, in further view of Liao, using the additional teachings of Sriram to introduce confidence scoring. A person skilled in the art would be motivated to combine the known elements as described above and achieve the predictable result of selecting the objects with highest confidence. Therefore, it would have been obvious to combine the analogous arts Fu, Sriram and Liao to obtain the invention in claim 4. Regarding claim 5, Fu in view of Sriram and in further view of Liao teaches, The method of claim 4, wherein the weighting scheme is a learned weighting scheme. (Fu, ¶0024: “accurate discriminative portion localization can promote learning fine-grained features, which in turn can further help to accurately localize the discriminative portions”). Regarding claim 6, Fu in view of Sriram and in further view of Liao teaches, The method of claim 5, wherein the learned weighting scheme assigns weights depending on the ability of the local feature representations to discriminate (Sriram, ¶0033: “a confidence 116 may be determined for each object”) between objects in the input image belonging to different subclasses. (Fu, ¶0021: “for different species of birds, the differences may lie in the colors and/or patterns of their necks, backs or tails, the shapes and/or colors of their beaks or claws, or the like. Such a portion that is applicable to determine a specific category of an object may be referred to as a discriminative portion of the object”). Therefore, it would have been obvious to a person of ordinary skill in the art before the effective filing date of the claimed invention to modify Fu, in view of Sriram, in further view of Liao, using the additional teachings of Sriram to introduce assigning confidence based on accuracy of detection. A person skilled in the art would be motivated to combine the known elements as described above and achieve the predictable result of only selecting the feature representations with higher detection accuracy. Therefore, it would have been obvious to combine the analogous arts Fu, Sriram and Liao to obtain the invention in claim 6. Regarding claim 7, Fu in view of Sriram and in further view of Liao teaches, The method of claim 4, wherein the predetermined number is a learned number. (Sriram, ¶0048: “the parameters may be learned by the machine learning model(s) 112 during training”). Therefore, it would have been obvious to a person of ordinary skill in the art before the effective filing date of the claimed invention to modify Fu, in view of Sriram, and in further view of Liao, using the additional teachings of Sriram, to introduce a learned parameter. A person skilled in the art would be motivated to combine the known elements as described above and achieve the predictable result of learning to optimize a selection parameter during training. Therefore, it would have been obvious to combine the analogous arts Fu, Sriram and Liao to obtain the invention in claim 7. Regarding claim 9, Fu in view of Sriram and in further view of Liao teaches, The method of claim 3, wherein the subset of local feature representations is selected based on explicit knowledge of a domain of objects depicted in the input image. (Fu, ¶0052: “if the learning network structure in FIG. 2 or 3 is to be trained as being capable of recognizing a plurality of species of birds, the training images may include images of different species of birds”). Regarding claim 10, Fu in view of Sriram and in further view of Liao teaches, The method of claim 1, wherein the local feature representations are combined with the global feature representations by concatenation. (Fu, ¶0037: “FC layer 217 is concatenated to the output of the FC layer 227”). Regarding claim 11, Fu in view of Sriram and in further view of Liao teaches, The method of claim 1, wherein the key local landmarks in the input image are mapped to the feature map (Fu, ¶0035: "after a third convolutional layer of the deep CNN model. (Li, ¶0091: “feature map is extracted from the fifth convolutional layer of the learning network 510 even though the feature map from any other layer may be used”). The proposed combination as well as the motivation for combining Fu, Sriram and Liao references presented in the rejection of claim 1, apply to claim 11 and are incorporated herein by reference. Thus, the method recited in claim 11 is met by Fu, Sriram and Liao. Regarding claim 12, Fu in view of Sriram and in further view of Liao teaches, The method of claim 1, wherein extracting key local landmarks from an input image comprises exposing the input image to a CNN model trained with a dataset comprising images with annotated landmarks. (Fu, ¶0022: “determining specific regions in the image by known bounding boxes or region annotations in a supervised fashion; then, extracting features from each of the regions and recognizing a specific category of the object based on these features”). Regarding claim 13, Fu in view of Sriram and in further view of Liao teaches, A system, comprising: a processor; and memory, storing software that, when executed by the processor, performs the method of claim 1. (Fu, ¶0105: “The device comprises: a processing unit; and a memory coupled to the processing unit and having instructions stored thereon. The instructions, when executed by the processing unit, cause the device to perform acts”). Regarding claim 14, Fu in view of Sriram and in further view of Liao teaches, A system, comprising: a processor; and memory, storing software that, when executed by the processor, performs the method of claim 4. (Fu, ¶0105: “The device comprises: a processing unit; and a memory coupled to the processing unit and having instructions stored thereon. The instructions, when executed by the processing unit, cause the device to perform acts”). Claim 8 is rejected under 35 U.S.C. 103 as being unpatentable over Fu et al. (US 2020/0160124 A1), in view of Sriram et al. (US 2020/0151489 A1), in further view of Liao et al. (US 2021/0201071 A1) and still in further view of Iwamoto et al. (US 2014/0328543 A1). Regarding claim 8, Fu in view of Sriram, in further view of Liao teaches, The method of claim 7, (Fu, ¶0021: “for different species of birds, the differences may lie in the colors and/or patterns of their necks, backs or tails, the shapes and/or colors of their beaks or claws, or the like. Such a portion that is applicable to determine a specific category of an object may be referred to as a discriminative portion of the object”). However, the combination of Fu, Sriram and Liao does not explicitly teach, wherein the predetermined number is learned based on an optimal number of local feature presentations needed. In an analogous field of endeavor, Iwamoto teaches, wherein the predetermined number is learned based on an optimal number of local feature presentations needed (Iwamoto, ¶0100: “the selection number determining unit 50 may be configured so as to determine the number of feature points and the number of dimensions so that at least one of the number of feature points and the number of dimensions is reduced”). Therefore, it would have been obvious to a person of ordinary skill in the art before the effective filing date of the claimed invention to modify Fu, in view of Li and in further view of Liao using the teachings of Iwamoto, to introduce selecting an optimal number of feature points. A person skilled in the art would be motivated to combine the known elements as described above and achieve the predictable result of increasing the speed of operation. Therefore, it would have been obvious to combine the analogous arts Fu, Sriram, Liao and Iwamoto to obtain the invention in claim 8. Conclusion Any inquiry concerning this communication or earlier communications from the examiner should be directed to MEHRAZUL ISLAM whose telephone number is (571)270-0489. The examiner can normally be reached Monday-Friday: 8am-5pm. Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice. If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, Saini Amandeep can be reached at (571) 272-3382. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300. Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. /MEHRAZUL ISLAM/Examiner, Art Unit 2662 /AMANDEEP SAINI/Supervisory Patent Examiner, Art Unit 2662
Read full office action

Prosecution Timeline

Jun 27, 2023
Application Filed
Sep 11, 2025
Non-Final Rejection mailed — §103
Dec 10, 2025
Response Filed
Jan 23, 2026
Final Rejection mailed — §103
Apr 07, 2026
Request for Continued Examination
Apr 13, 2026
Response after Non-Final Action
Apr 22, 2026
Non-Final Rejection mailed — §103 (current)

Precedent Cases

Applications granted by this same examiner with similar technology

Patent 12632950
ROBOTIC BUILDING INSPECTION
4y 7m to grant Granted May 19, 2026
Patent 12607459
REAL-TIME THREE-DIMENSIONAL SHAPE MEASUREMENT SYSTEM AND SHAPE MEASUREMENT METHOD USING DIAGONAL LINE PATTERN IRRADIATION METHOD
2y 9m to grant Granted Apr 21, 2026
Patent 12602808
METHOD FOR INSPECTING AN OBJECT
4y 9m to grant Granted Apr 14, 2026
Patent 12592075
REMOTE SENSING FOR INTELLIGENT VEGETATION TRIM PREDICTION
3y 6m to grant Granted Mar 31, 2026
Patent 12579695
Method of Generating Target Image Data, Electrical Device and Non-Transitory Computer Readable Medium
3y 2m to grant Granted Mar 17, 2026
Study what changed to get past this examiner. Based on 5 most recent grants.

Strategy Recommendation AI-generated — please review before filing

Get a prosecution strategy drawn from examiner precedents, rejection analysis, and claim mapping.
Typically takes 5-10 seconds — AI-generated, attorney review required before filing

Prosecution Projections

4-5
Expected OA Rounds
60%
Grant Probability
87%
With Interview (+26.2%)
3y 3m (~4m remaining)
Median Time to Grant
High
PTA Risk
Based on 53 resolved cases by this examiner. Grant probability derived from career allowance rate.

Sign in with your work email

Enter your email to receive a magic link. No password needed.

Personal email addresses (Gmail, Yahoo, etc.) are not accepted.

Free tier: 3 strategy analyses per month