Last updated: May 29, 2026

Application No. 18/466,773

POINT CLOUD SEARCH USING MULTI-MODAL EMBEDDINGS

Non-Final OA §103

Filed

Sep 13, 2023

Examiner

PATEL, JAYESH A

Art Unit

2677

Tech Center

2600 — Communications

Assignee

GM Cruise Holdings LLC

OA Round

2 (Non-Final)

Interview Optional

— +5.3% interview lift. Interview already conducted in this application's prosecution history. This examiner has a 83% grant rate with +5.3% interview lift. Since an interview has already been tried, recommend written response with narrowed claims based on precedent claim evolution patterns.

Based on 894 resolved cases, 2023–2026

Examiner Intelligence

PATEL, JAYESH A View full profile →

Grants 83% — above average

Career Allowance Rate

746 granted / 894 resolved

+21.4% vs TC avg

Moderate +5% lift

Without

With

+5.3%

Interview Lift

resolved cases with interview

Typical timeline

2y 11m

Avg Prosecution

15 currently pending

Career history

921

Total Applications

across all art units

Statute-Specific Performance

§101

2.4%

-37.6% vs TC avg

§103

77.1%

+37.1% vs TC avg

§102

7.5%

-32.5% vs TC avg

§112

9.3%

-30.7% vs TC avg

Black line = Tech Center average estimate • Based on career data from 894 resolved cases

Office Action

§103

Notice of Pre-AIA  or AIA  Status
The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA .

Response to Arguments
Applicant’s arguments with respect to claims 1-20 have been considered but are moot because the new ground of rejection does not rely on any reference applied in the prior rejection of record for any teaching or matter specifically challenged in the argument.

Claim Rejections - 35 USC § 103
In the event the determination of the status of the application as subject to AIA  35 U.S.C. 102 and 103 (or as subject to pre-AIA  35 U.S.C. 102 and 103) is incorrect, any correction of the statutory basis (i.e., changing from AIA  to pre-AIA ) for the rejection will not be considered a new ground of rejection if the prior art relied upon, and the rationale supporting the rejection, would be the same under either status.  

The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action:
A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made.

Claims 1-2, 4-9, 11-16 and 18-20 are rejected under 35 U.S.C. 103 as being unpatentable over O’ Hara et al., (US12158762) hereafter O’Hara in view of NPL1 (CLIP2: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data, Yihan Zeng et al., arXiv, 26 mar 2023, Pages 1-15) hereafter NPL1 .

1. Regarding claim 1, O’Hara discloses an apparatus (figs 1, 2A-2C and 3, col 9 lines 32 through col 10 lines 4, col 12 lines 10 through col 17 lines 60 shows and discloses an apparatus) comprising: 
at least one memory (fig 1 memory 124, col 9 lines 32 through col 10 lines 4 shows and describes at least one memory); and 
at least one processor coupled to the at least one memory, the at least one processor configured (fig 1 processor 122 coupled to memory 124, col 9 lines 32 through col 10 lines 4 shows and describes the at least one processor coupled to the at least one memory configured to process the instructions) to: 
receive road data, wherein the road data represents a real-world environment encountered by an autonomous vehicle (AV) and wherein the road data comprises point cloud data representing a plurality of objects (figs 1, 2B-2C,3 (301), col 12 lines 10-34, col 14 lines 65 through col 15 lines 10 shows and discloses a LIDAR image that contains the point cloud (depth information) of captured objects in front of the autonomous vehicle on the road meeting the claim limitations) and; 
generate, for each of the plurality of objects, a corresponding set of first embeddings (fig 2C, col 14 lines 21-27 shows and discloses generate by the text encoder 293, for each of the plurality of objects, a corresponding set of first embeddings TE_1, TE_2------ TE_M corresponding to the plurality of the objects A, B----M meeting the claim limitations) 
receive a text string corresponding to a searched object (figs 2C col 14 lines 1-27 shows and discloses receiving by the text encoder a text string corresponding to a searched object (i.e for example Text string describing object A, Text string B describing object B—etc.) meeting the above claim limitations); 
generate a second embedding corresponding to the searched object (figs 2C, col 14 lines 27-44 shows and discloses generating an image embedding X corresponding to the searched object (for example object A, object B etc.) meeting the claim limitations) 
identify a matching object among the plurality of objects based on a comparison of the set of first embeddings and the second embedding (fig 2C, col 14 lines 30-51 discloses identifying a particular object based on the similarity measures between the image embedding X (second embedding) and the text embeddings TE_1----TE_M (first set of the embeddings) and determines that image X matches the particular object (i.e a matching object) meeting the above claim limitations). As seen above O’Hara discloses set of embeddings. O’Hara is silent and however fails to disclose wherein each embedding in the first set of embeddings is a multi-modal embedding including at least text data, image data and point-cloud data
and wherein the second embedding is a multi-modal embedding including at least text data, image data and point-cloud data.
NPL1 discloses wherein each embedding in the first set of embeddings is a multi-modal embedding including at least text data, image data and point-cloud data (fig 2, page 3 shows Text encoder (embedding), Image encoder (embedding) and PC encoder (Point-cloud embedding) in the cross-modal (multi-modal) pretraining (generating) meeting the above claim limitations) and wherein the second embedding is a multi-modal embedding including at least text data, image data and point-cloud data (fig 2, page 3 shows the recognition (i.e the searched object) output data ET (Text data), EP (point cloud data) and EI (Image data) in the CLIP Zero-shot recognition meeting the above claim limitations). Before the effective filing date of the invention was made, O’Hara and NPL1 are combinable because they are from the same filed of endeavor and are analogous art of image processing. The suggestion/motivation would be an optimized, enhanced and increased quality system/method on page 2 col 2 lines 14-27. Therefore, it would be obvious and within one of ordinary skill in the art to have recognized the advantages of NPL1 in the apparatus of O’Hara to obtain the invention as specified in claim 1.

2. Regarding claim 2, O’Hara and NPL1 discloses the apparatus of claim 1. O’Hara discloses further wherein the comparison is based on a set of distances between the second embedding and the set of first embeddings (col 3 lines 28-38 and col 19 lines 13-62 discloses wherein the comparison is based on a set of distances (i.e cosine distances) between the second embedding (i.e the image embedding) and the set of first embeddings (i.e plurality of text embeddings) meeting the claim limitations).
3. Regarding claim 4, O’Hara and NPL1 discloses the apparatus of claim 1. O’Hara discloses further wherein the second embedding is generated using the text string (figs 2A, 2C, col 14 lines 1-51 shows and discloses wherein the second embedding (i.e the image embedding) is generated using the text string meeting the claim limitations).  

4. Regarding claim 5, O’Hara and NPL1 discloses the apparatus of claim 1. O’Hara discloses further wherein the at least one processor is further configured to: receive an image corresponding to the searched object (fig 2C shows and discloses wherein the at least one processor is further configured to: receive an image corresponding to the searched object).  

5. Regarding claim 6, O’Hara and NPL1 discloses the apparatus of claim 5. O’Hara discloses further wherein the second embedding is based on the image (figs 2C, col 14 lines 27-44 shows and discloses generating an image embedding X corresponding to the searched object (for example object A, object B etc.) meeting the claim limitations).  

6. Regarding claim 7, O’Hara and NPL1 discloses the apparatus of claim 1. O’Hara discloses further wherein for each object of the plurality of objects, embedding in the corresponding set of first embeddings comprises a vector representing characteristics of the object in the plurality of the objects (fig 2C and col 11 lines 1-11 discloses wherein for each object of the plurality of objects, embedding in the corresponding set of first embeddings comprises a vector representing characteristics of the object in the plurality of the objects). 

7. Claim 8 is a corresponding method claim of claim 1. See the corresponding explanation of claim 1.

8. Claim 9 is a corresponding method claim of claim 2. See the corresponding explanation of claim 2.

9. Claim 11 is a corresponding method claim of claim 4. See the corresponding explanation of claim 4.

10. Claim 12 is a corresponding method claim of claim 5. See the corresponding explanation of claim 5.

11. Claim 13 is a corresponding method claim of claim 6. See the corresponding explanation of claim 6.

12. Claim 14 is a corresponding method claim of claim 7. See the corresponding explanation of claim 7.

13. Claim 15 is a corresponding non-transitory computer readable medium claim of claim 1. See the corresponding explanation of claim 1.
14. Claim 16 is a corresponding non-transitory computer readable medium claim of claim 2. See the corresponding explanation of claim 2.

15. Claim 18 is a corresponding non-transitory computer readable medium claim of claim 4. See the corresponding explanation of claim 4.

16. Claim 19 is a corresponding non-transitory computer readable medium claim of claim 5. See the corresponding explanation of claim 5.

17. Claim 20 is a corresponding non-transitory computer readable medium claim of claim 6. See the corresponding explanation of claim 6.

Claims 3, 10 and 17 are rejected under 35 U.S.C. 103 as being unpatentable over O’ Hara in view of NPL1 and in further view of CHEN et al.(US20230047160) hereafter CHEN.

18. Regarding claim 3, O’Hara and NPL1 discloses the apparatus of claim 2. O’Hara discloses further wherein the matching object is a lowest (col 3 lines 28-38, col 11 lines 31-56 and col 19 lines 13-62 discloses wherein the matching object is the lowest cosine distance in the set of distances). O’Hara discloses other similarity distance measures as seen in the disclosure. O’Hara and NPL1 are however silent and fails to disclose a lowest Euclidean distance. 
	CHEN discloses a lowest Euclidean distance of the matching object in para 0064. Before the effective filing date of the invention was made, CHEN, O’Hara and NPL1 are combinable because they are from the same filed of endeavor and are analogous art of vehicle data processing. The suggestion/motivation would be an efficient system (i.e minimizes aggregate loss) at para 0016. Therefore, it would be obvious and within one of ordinary skill in the art to have recognized the advantages of CHEN in the apparatus of O’Hara and NPL1 to obtain the invention as specified in claim 3.
19. Claim 10 is a corresponding method claim of claim 3. See the corresponding explanation of claim 3.

20. Claim 17 is a corresponding non-transitory computer readable medium claim of claim 3. See the corresponding explanation of claim 3.

Examiner's Note: Examiner has cited figures, and paragraphs in the references as applied to the claims above for the convenience of the applicant. Although the specified citations are representative of the teachings in the art and are applied to the specific limitations within the individual claim, other passages and figures may apply as well. It is respectfully requested for the applicant, in preparing the responses, to fully consider the references in entirety as potentially teaching all or part of the claimed invention, as well as the context of the passage as taught by the prior art or disclosed by the examiner. Examiner has also cited references in PTO892 but not relied on, which are relevant and pertinent to the applicant’s disclosure, and may also be reading (anticipatory/obvious) on the claims and claimed limitations. Applicant is advised to consider the references in preparing the response/amendments in-order to expedite the prosecution.
Conclusion
Applicant's amendment necessitated the new ground(s) of rejection presented in this Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP § 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a).
A shortened statutory period for reply to this final action is set to expire THREE MONTHS from the mailing date of this action. In the event a first reply is filed within TWO MONTHS of the mailing date of this final action and the advisory action is not mailed until after the end of the THREE-MONTH shortened statutory period, then the shortened statutory period will expire on the date the advisory action is mailed, and any nonprovisional extension fee (37 CFR 1.17(a)) pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of the advisory action. In no event, however, will the statutory period for reply expire later than SIX MONTHS from the mailing date of this final action.
Any inquiry concerning this communication or earlier communications from the examiner should be directed to JAYESH PATEL whose telephone number is (571)270-1227. The examiner can normally be reached IFW Mon-FRI.
Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice.
If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, Andrew Bee can be reached at 571-270-5183. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300.
Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000.

JAYESH PATEL
Primary Examiner
Art Unit 2677



/JAYESH A PATEL/Primary Examiner, Art Unit 2677

Read full office action

Prosecution Timeline

Show 1 earlier event

Sep 05, 2025

Non-Final Rejection mailed — §103

Nov 24, 2025

Interview Requested

Dec 10, 2025

Applicant Interview (Telephonic)

Dec 10, 2025

Examiner Interview Summary

Dec 11, 2025

Response Filed

Jan 16, 2026

Final Rejection mailed — §103

Feb 19, 2026

Interview Requested

Mar 03, 2026

Response after Non-Final Action

Precedent Cases

Applications granted by this same examiner with similar technology

18/509,026

Patent 12639938

ASYNCHRONOUS MULTIMODAL FEATURE FUSION

2y 6m to grant Granted May 26, 2026

17/918,627

Patent 12632978

CONFIGURATION OF ROBOT OPERATIONAL ENVIRONMENT INCLUDING LAYOUT OF SENSORS

3y 7m to grant Granted May 19, 2026

18/194,205

Patent 12626396

ELECTRONIC DEVICE AND CONTROLLING METHOD OF ELECTRONIC DEVICE

3y 1m to grant Granted May 12, 2026

18/396,971

Patent 12620260

INFORMATION PROCESSING METHOD, COMPUTER DEVICE, AND STORAGE MEDIUM

2y 4m to grant Granted May 05, 2026

18/702,055

Patent 12620208

FACIAL BEAUTY PREDICTION METHOD, APPARATUS, DEVICE AND STORAGE MEDIUM

2y 0m to grant Granted May 05, 2026

Study what changed to get past this examiner. Based on 5 most recent grants.

Strategy Recommendation AI-generated — please review before filing

Get a prosecution strategy drawn from examiner precedents, rejection analysis, and claim mapping.

Typically takes 5-10 seconds — AI-generated, attorney review required before filing

Prosecution Projections

2-3

Expected OA Rounds

83%

Grant Probability

89%

With Interview (+5.3%)

2y 11m (~2m remaining)

Median Time to Grant

Moderate

PTA Risk

Based on 894 resolved cases by this examiner. Grant probability derived from career allowance rate.