Prosecution Insights
Last updated: May 29, 2026
Application No. 18/466,773

POINT CLOUD SEARCH USING MULTI-MODAL EMBEDDINGS

Non-Final OA §103
Filed
Sep 13, 2023
Examiner
PATEL, JAYESH A
Art Unit
2677
Tech Center
2600 — Communications
Assignee
GM Cruise Holdings LLC
OA Round
2 (Non-Final)
83%
Grant Probability
Favorable
2-3
OA Rounds
2m
Est. Remaining
89%
With Interview

Examiner Intelligence

Grants 83% — above average
83%
Career Allowance Rate
746 granted / 894 resolved
+21.4% vs TC avg
Moderate +5% lift
Without
With
+5.3%
Interview Lift
resolved cases with interview
Typical timeline
2y 11m
Avg Prosecution
15 currently pending
Career history
921
Total Applications
across all art units

Statute-Specific Performance

§101
2.4%
-37.6% vs TC avg
§103
77.1%
+37.1% vs TC avg
§102
7.5%
-32.5% vs TC avg
§112
9.3%
-30.7% vs TC avg
Black line = Tech Center average estimate • Based on career data from 894 resolved cases

Office Action

§103
Notice of Pre-AIA or AIA Status The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA . Response to Arguments Applicant’s arguments with respect to claims 1-20 have been considered but are moot because the new ground of rejection does not rely on any reference applied in the prior rejection of record for any teaching or matter specifically challenged in the argument. Claim Rejections - 35 USC § 103 In the event the determination of the status of the application as subject to AIA 35 U.S.C. 102 and 103 (or as subject to pre-AIA 35 U.S.C. 102 and 103) is incorrect, any correction of the statutory basis (i.e., changing from AIA to pre-AIA ) for the rejection will not be considered a new ground of rejection if the prior art relied upon, and the rationale supporting the rejection, would be the same under either status. The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action: A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made. Claims 1-2, 4-9, 11-16 and 18-20 are rejected under 35 U.S.C. 103 as being unpatentable over O’ Hara et al., (US12158762) hereafter O’Hara in view of NPL1 (CLIP2: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data, Yihan Zeng et al., arXiv, 26 mar 2023, Pages 1-15) hereafter NPL1 . 1. Regarding claim 1, O’Hara discloses an apparatus (figs 1, 2A-2C and 3, col 9 lines 32 through col 10 lines 4, col 12 lines 10 through col 17 lines 60 shows and discloses an apparatus) comprising: at least one memory (fig 1 memory 124, col 9 lines 32 through col 10 lines 4 shows and describes at least one memory); and at least one processor coupled to the at least one memory, the at least one processor configured (fig 1 processor 122 coupled to memory 124, col 9 lines 32 through col 10 lines 4 shows and describes the at least one processor coupled to the at least one memory configured to process the instructions) to: receive road data, wherein the road data represents a real-world environment encountered by an autonomous vehicle (AV) and wherein the road data comprises point cloud data representing a plurality of objects (figs 1, 2B-2C,3 (301), col 12 lines 10-34, col 14 lines 65 through col 15 lines 10 shows and discloses a LIDAR image that contains the point cloud (depth information) of captured objects in front of the autonomous vehicle on the road meeting the claim limitations) and; generate, for each of the plurality of objects, a corresponding set of first embeddings (fig 2C, col 14 lines 21-27 shows and discloses generate by the text encoder 293, for each of the plurality of objects, a corresponding set of first embeddings TE_1, TE_2------ TE_M corresponding to the plurality of the objects A, B----M meeting the claim limitations) receive a text string corresponding to a searched object (figs 2C col 14 lines 1-27 shows and discloses receiving by the text encoder a text string corresponding to a searched object (i.e for example Text string describing object A, Text string B describing object B—etc.) meeting the above claim limitations); generate a second embedding corresponding to the searched object (figs 2C, col 14 lines 27-44 shows and discloses generating an image embedding X corresponding to the searched object (for example object A, object B etc.) meeting the claim limitations) identify a matching object among the plurality of objects based on a comparison of the set of first embeddings and the second embedding (fig 2C, col 14 lines 30-51 discloses identifying a particular object based on the similarity measures between the image embedding X (second embedding) and the text embeddings TE_1----TE_M (first set of the embeddings) and determines that image X matches the particular object (i.e a matching object) meeting the above claim limitations). As seen above O’Hara discloses set of embeddings. O’Hara is silent and however fails to disclose wherein each embedding in the first set of embeddings is a multi-modal embedding including at least text data, image data and point-cloud data and wherein the second embedding is a multi-modal embedding including at least text data, image data and point-cloud data. NPL1 discloses wherein each embedding in the first set of embeddings is a multi-modal embedding including at least text data, image data and point-cloud data (fig 2, page 3 shows Text encoder (embedding), Image encoder (embedding) and PC encoder (Point-cloud embedding) in the cross-modal (multi-modal) pretraining (generating) meeting the above claim limitations) and wherein the second embedding is a multi-modal embedding including at least text data, image data and point-cloud data (fig 2, page 3 shows the recognition (i.e the searched object) output data ET (Text data), EP (point cloud data) and EI (Image data) in the CLIP Zero-shot recognition meeting the above claim limitations). Before the effective filing date of the invention was made, O’Hara and NPL1 are combinable because they are from the same filed of endeavor and are analogous art of image processing. The suggestion/motivation would be an optimized, enhanced and increased quality system/method on page 2 col 2 lines 14-27. Therefore, it would be obvious and within one of ordinary skill in the art to have recognized the advantages of NPL1 in the apparatus of O’Hara to obtain the invention as specified in claim 1. 2. Regarding claim 2, O’Hara and NPL1 discloses the apparatus of claim 1. O’Hara discloses further wherein the comparison is based on a set of distances between the second embedding and the set of first embeddings (col 3 lines 28-38 and col 19 lines 13-62 discloses wherein the comparison is based on a set of distances (i.e cosine distances) between the second embedding (i.e the image embedding) and the set of first embeddings (i.e plurality of text embeddings) meeting the claim limitations). 3. Regarding claim 4, O’Hara and NPL1 discloses the apparatus of claim 1. O’Hara discloses further wherein the second embedding is generated using the text string (figs 2A, 2C, col 14 lines 1-51 shows and discloses wherein the second embedding (i.e the image embedding) is generated using the text string meeting the claim limitations). 4. Regarding claim 5, O’Hara and NPL1 discloses the apparatus of claim 1. O’Hara discloses further wherein the at least one processor is further configured to: receive an image corresponding to the searched object (fig 2C shows and discloses wherein the at least one processor is further configured to: receive an image corresponding to the searched object). 5. Regarding claim 6, O’Hara and NPL1 discloses the apparatus of claim 5. O’Hara discloses further wherein the second embedding is based on the image (figs 2C, col 14 lines 27-44 shows and discloses generating an image embedding X corresponding to the searched object (for example object A, object B etc.) meeting the claim limitations). 6. Regarding claim 7, O’Hara and NPL1 discloses the apparatus of claim 1. O’Hara discloses further wherein for each object of the plurality of objects, embedding in the corresponding set of first embeddings comprises a vector representing characteristics of the object in the plurality of the objects (fig 2C and col 11 lines 1-11 discloses wherein for each object of the plurality of objects, embedding in the corresponding set of first embeddings comprises a vector representing characteristics of the object in the plurality of the objects). 7. Claim 8 is a corresponding method claim of claim 1. See the corresponding explanation of claim 1. 8. Claim 9 is a corresponding method claim of claim 2. See the corresponding explanation of claim 2. 9. Claim 11 is a corresponding method claim of claim 4. See the corresponding explanation of claim 4. 10. Claim 12 is a corresponding method claim of claim 5. See the corresponding explanation of claim 5. 11. Claim 13 is a corresponding method claim of claim 6. See the corresponding explanation of claim 6. 12. Claim 14 is a corresponding method claim of claim 7. See the corresponding explanation of claim 7. 13. Claim 15 is a corresponding non-transitory computer readable medium claim of claim 1. See the corresponding explanation of claim 1. 14. Claim 16 is a corresponding non-transitory computer readable medium claim of claim 2. See the corresponding explanation of claim 2. 15. Claim 18 is a corresponding non-transitory computer readable medium claim of claim 4. See the corresponding explanation of claim 4. 16. Claim 19 is a corresponding non-transitory computer readable medium claim of claim 5. See the corresponding explanation of claim 5. 17. Claim 20 is a corresponding non-transitory computer readable medium claim of claim 6. See the corresponding explanation of claim 6. Claims 3, 10 and 17 are rejected under 35 U.S.C. 103 as being unpatentable over O’ Hara in view of NPL1 and in further view of CHEN et al.(US20230047160) hereafter CHEN. 18. Regarding claim 3, O’Hara and NPL1 discloses the apparatus of claim 2. O’Hara discloses further wherein the matching object is a lowest (col 3 lines 28-38, col 11 lines 31-56 and col 19 lines 13-62 discloses wherein the matching object is the lowest cosine distance in the set of distances). O’Hara discloses other similarity distance measures as seen in the disclosure. O’Hara and NPL1 are however silent and fails to disclose a lowest Euclidean distance. CHEN discloses a lowest Euclidean distance of the matching object in para 0064. Before the effective filing date of the invention was made, CHEN, O’Hara and NPL1 are combinable because they are from the same filed of endeavor and are analogous art of vehicle data processing. The suggestion/motivation would be an efficient system (i.e minimizes aggregate loss) at para 0016. Therefore, it would be obvious and within one of ordinary skill in the art to have recognized the advantages of CHEN in the apparatus of O’Hara and NPL1 to obtain the invention as specified in claim 3. 19. Claim 10 is a corresponding method claim of claim 3. See the corresponding explanation of claim 3. 20. Claim 17 is a corresponding non-transitory computer readable medium claim of claim 3. See the corresponding explanation of claim 3. Examiner's Note: Examiner has cited figures, and paragraphs in the references as applied to the claims above for the convenience of the applicant. Although the specified citations are representative of the teachings in the art and are applied to the specific limitations within the individual claim, other passages and figures may apply as well. It is respectfully requested for the applicant, in preparing the responses, to fully consider the references in entirety as potentially teaching all or part of the claimed invention, as well as the context of the passage as taught by the prior art or disclosed by the examiner. Examiner has also cited references in PTO892 but not relied on, which are relevant and pertinent to the applicant’s disclosure, and may also be reading (anticipatory/obvious) on the claims and claimed limitations. Applicant is advised to consider the references in preparing the response/amendments in-order to expedite the prosecution. Conclusion Applicant's amendment necessitated the new ground(s) of rejection presented in this Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP § 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a). A shortened statutory period for reply to this final action is set to expire THREE MONTHS from the mailing date of this action. In the event a first reply is filed within TWO MONTHS of the mailing date of this final action and the advisory action is not mailed until after the end of the THREE-MONTH shortened statutory period, then the shortened statutory period will expire on the date the advisory action is mailed, and any nonprovisional extension fee (37 CFR 1.17(a)) pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of the advisory action. In no event, however, will the statutory period for reply expire later than SIX MONTHS from the mailing date of this final action. Any inquiry concerning this communication or earlier communications from the examiner should be directed to JAYESH PATEL whose telephone number is (571)270-1227. The examiner can normally be reached IFW Mon-FRI. Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice. If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, Andrew Bee can be reached at 571-270-5183. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300. Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. JAYESH PATEL Primary Examiner Art Unit 2677 /JAYESH A PATEL/Primary Examiner, Art Unit 2677
Read full office action

Prosecution Timeline

Show 1 earlier event
Sep 05, 2025
Non-Final Rejection mailed — §103
Nov 24, 2025
Interview Requested
Dec 10, 2025
Applicant Interview (Telephonic)
Dec 10, 2025
Examiner Interview Summary
Dec 11, 2025
Response Filed
Jan 16, 2026
Final Rejection mailed — §103
Feb 19, 2026
Interview Requested
Mar 03, 2026
Response after Non-Final Action

Precedent Cases

Applications granted by this same examiner with similar technology

Patent 12639938
ASYNCHRONOUS MULTIMODAL FEATURE FUSION
2y 6m to grant Granted May 26, 2026
Patent 12632978
CONFIGURATION OF ROBOT OPERATIONAL ENVIRONMENT INCLUDING LAYOUT OF SENSORS
3y 7m to grant Granted May 19, 2026
Patent 12626396
ELECTRONIC DEVICE AND CONTROLLING METHOD OF ELECTRONIC DEVICE
3y 1m to grant Granted May 12, 2026
Patent 12620260
INFORMATION PROCESSING METHOD, COMPUTER DEVICE, AND STORAGE MEDIUM
2y 4m to grant Granted May 05, 2026
Patent 12620208
FACIAL BEAUTY PREDICTION METHOD, APPARATUS, DEVICE AND STORAGE MEDIUM
2y 0m to grant Granted May 05, 2026
Study what changed to get past this examiner. Based on 5 most recent grants.

Strategy Recommendation AI-generated — please review before filing

Get a prosecution strategy drawn from examiner precedents, rejection analysis, and claim mapping.
Typically takes 5-10 seconds — AI-generated, attorney review required before filing

Prosecution Projections

2-3
Expected OA Rounds
83%
Grant Probability
89%
With Interview (+5.3%)
2y 11m (~2m remaining)
Median Time to Grant
Moderate
PTA Risk
Based on 894 resolved cases by this examiner. Grant probability derived from career allowance rate.

Sign in with your work email

Enter your email to receive a magic link. No password needed.

Personal email addresses (Gmail, Yahoo, etc.) are not accepted.

Free tier: 3 strategy analyses per month