Last updated: April 19, 2026
Application No. 18/157,100
INFORMATION PROCESSING APPARATUS, LEARNING APPARATUS, IMAGE RECOGNITION APPARATUS, INFORMATION PROCESSING METHOD, LEARNING METHOD, IMAGE RECOGNITION METHOD, AND NON-TRANSITORY-COMPUTER-READABLE STORAGE MEDIUM

Non-Final OA §103
Filed
Jan 20, 2023
Examiner
VANCHY JR, MICHAEL J
Art Unit
2666
Tech Center
2600 — Communications
Assignee
Canon Kabushiki Kaisha
OA Round
1 (Non-Final)
Interview Optional

— +20.1% interview lift. This examiner has a relatively high allow rate; a written response may suffice.
Based on 606 resolved cases, 2023–2026
Examiner Intelligence

VANCHY JR, MICHAEL J View full profile →
Grants 67% — above average
Career Allow Rate
404 granted / 606 resolved
+4.7% vs TC avg
Strong +20% interview lift
Without
With
+20.1%
Interview Lift
resolved cases with interview
Typical timeline
3y 4m
Avg Prosecution
16 currently pending
Career history
622
Total Applications
across all art units
Statute-Specific Performance

§101
11.7%
-28.3% vs TC avg
§103
60.8%
+20.8% vs TC avg
§102
8.4%
-31.6% vs TC avg
§112
10.4%
-29.6% vs TC avg
Black line = Tech Center average estimate • Based on career data from 606 resolved cases
Office Action

§103
DETAILED ACTION Notice of Pre-AIA or AIA Status The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA. Claim Interpretation 112(f) The following is a quotation of 35 U.S.C. 112(f): (f) Element in Claim for a Combination. – An element in a claim for a combination may be expressed as a means or step for performing a specified function without the recital of structure, material, or acts in support thereof, and such claim shall be construed to cover the corresponding structure, material, or acts described in the specification and equivalents thereof. The following is a quotation of pre-AIA 35 U.S.C. 112, sixth paragraph: An element in a claim for a combination may be expressed as a means or step for performing a specified function without the recital of structure, material, or acts in support thereof, and such claim shall be construed to cover the corresponding structure, material, or acts described in the specification and equivalents thereof. This application includes one or more claim limitations that do not use the word “means,” but are nonetheless being interpreted under 35 U.S.C. 112(f) or pre-AIA 35 U.S.C. 112, sixth paragraph, because the claim limitation(s) uses a generic placeholder that is coupled with functional language without reciting sufficient structure to perform the recited function and the generic placeholder is not preceded by a structural modifier. Such claim limitation(s) is/are: As to claims 1-15 and 21-25, the “generation units” are considered to read on a computer that executes the computer program corresponding to each of the functional units, thereby performing the function of the functional unit; wherein the functional units may be implemented by hardware (Fig. 2; [0052]). As to claims 10 and 11, the “acquisition units” are considered to read on a computer that executes the computer program corresponding to each of the functional units, thereby performing the function of the functional unit; wherein the functional units may be implemented by hardware (Fig. 2; [0052]). As to claim 11, the “identification units” are considered to read on a computer that executes the computer program corresponding to each of the functional units, thereby performing the function of the functional unit; wherein the functional units may be implemented by hardware (Fig. 2; [0052]). As to claims 12-15 and 22-25, the “learning units” are considered to read on a computer that executes the computer program corresponding to each of the functional units, thereby performing the function of the functional unit; wherein the functional units may be implemented by hardware (Fig. 2; [0052]). As to claims 12-15, 17-20, and 22-25 , the “ detection units” are considered to read on a computer that executes the computer program corresponding to each of the functional units, thereby performing the function of the functional unit; wherein the functional units may be implemented by hardware (Fig. 2; [0052]). As to claims 15 and 25, the “formation units” are considered to read on a computer that executes the computer program corresponding to each of the functional units, thereby performing the function of the functional unit; wherein the functional units may be implemented by hardware (Fig. 2; [0052]). Because this/these claim limitation(s) is/are being interpreted under 35 U.S.C. 112(f) or pre-AIA 35 U.S.C. 112, sixth paragraph, it/they is/are being interpreted to cover the corresponding structure described in the specification as performing the claimed function, and equivalents thereof. If applicant does not intend to have this/these limitation(s) interpreted under 35 U.S.C. 112(f) or pre-AIA 35 U.S.C. 112, sixth paragraph, applicant may: (1) amend the claim limitation(s) to avoid it/them being interpreted under 35 U.S.C. 112(f) or pre-AIA 35 U.S.C. 112, sixth paragraph (e.g., by reciting sufficient structure to perform the claimed function); or (2) present a sufficient showing that the claim limitation(s) recite(s) sufficient structure to perform the claimed function so as to avoid it/them being interpreted under 35 U.S.C. 112(f) or pre-AIA 35 U.S.C. 112, sixth paragraph. Claim Rejections - 35 USC § 103 In the event the determination of the status of the application as subject to AIA 35 U.S.C. 102 and 103 (or as subject to pre-AIA 35 U.S.C. 102 and 103) is incorrect, any correction of the statutory basis (i.e., changing from AIA to pre-AIA) for the rejection will not be considered a new ground of rejection if the prior art relied upon, and the rationale supporting the rejection, would be the same under either status. The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action: A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made. The factual inquiries for establishing a background for determining obviousness under 35 U.S.C. 103 are summarized as follows: 1. Determining the scope and contents of the prior art. 2. Ascertaining the differences between the prior art and the claims at issue. 3. Resolving the level of ordinary skill in the pertinent art. 4. Considering objective evidence present in the application indicating obviousness or nonobviousness. Claim(s) 1- 10 and 12-25 are rejected under 35 U.S.C. 103 as being unpatentable over Marino et al., US 2017/0278289 A1 (Marino), and further in view of Quinton et al., US 2022/0327811 A1 (Quinton). Regarding claim 1 , Marino teaches an information processing apparatus (apparatus and system for integrating source digital content with target digital content) ([0002-0003]) , comprising: a first generation unit (wherein content integration system 100 can use a content integration module 120) (Fig. 1; [0116]) configured to generate a synthesized image in which a second image is synthesized in a closed region in a first image (wherein the content integration module 120 can overlay or place source digital content (second image) onto the detected host region (closed region) of the target digital content (first image)) (Figs. 1 and 3A-3U; [0116]) ; and a second generation unit (host region identification module 110) (Fig. 1; [0076-0077]) configured to generate learning data (machine learning system for generating training set) ([0077]) , the learning data including a label (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , the label indicating an object region including a region corresponding to the closed region in the synthesized image (the label indicating a region for the closed region for where the second image will be placed to generate the synthesized image) (Figs. 3C, 3N, 3O, and 3S; [0077], [0117], and [0119-0120]) . However, Marino does not explicitly teach that the “synthesized image” is used within the learning data. Quinton teaches systems and methods for generating composite based data for use in machine learning systems (Abstract) ; wherein generating composite data comprising the desired label of a response entry and image data corresponding to the fragment of the composite image (Abstract) ; and wherein composite images can be used as composite training data ([0067]) . It would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention to modify Marino to include the synthesized/composite image as part of the learning data since it improves the efficiency of generating training data and the training process (Quinton; [0011] and [0058]) . Regarding claim 2 , Marino teaches wherein the first generation unit (wherein content integration system 100 can use a content integration module 120) (Fig. 1; [0116]) acquires an image having a texture as the second image (acquiring texture of the source digital content so as to be able to recreate the luminance (and thus the texture and luminance changes) of the target digital content in the source digital content) ([0088] and [0344]) , and the first generation unit (wherein content integration system 100 can use a content integration module 120) (Fig. 1; [0116]) generates a synthesized image in which the second image is synthesized in the closed region in the first image (wherein the content integration module 120 can overlay or place source digital content (second image) onto the detected host region (closed region) of the target digital content (first image)) (Figs. 1 and 3A-3U; [0116]) . Regarding claim 3 , Marino teaches wherein the first generation unit (content integration system 100) (Fig. 1; [0116]) generates a closed region using a geometric figure (generating a closed region using a bounding box; either a 2D or 3D bounding box) (Figs. 3C, 3O, and 3S; [0117] and [0119-0120]) , sets the generated closed region on the first image (generating the closed region on the target digital content) ([0116]) , and generates a synthesized image in which the second image is synthesized in the closed region (wherein the content integration module 120 can overlay or place source digital content (second image) onto the detected host region (closed region) of the target digital content (first image)) (Figs. 1 and 3A-3U; [0116-0117] and [0119-0120]) . Regarding claim 4 , Marino teaches wherein the first generation unit (wherein content integration system 100 can use a content integration module 120) (Fig. 1; [0116]) generates a synthesized image in which the second image is synthesized in a two-dimensional projection region (wherein the source digital content (second image) can be placed/synthesized into the target digital content (first image); wherein the first image area (host region) is a 2D surface such as a wall) ([0343]) in which a virtual object having a three-dimensional shape is projected on the first image (wherein the synthesized image can include the source digital content (second image) is a 3D shape/object projected on the host region of the target digital content (first image)) ([0343-344]) . Regarding claim 5 , Marino teaches wherein the first generation unit (wherein content integration system 100 can use a content integration module 120) (Fig. 1; [0116]) generates a synthesized image in which the second image is synthesized in a closed region set in the first image (wherein the content integration module 120 can overlay or place source digital content (second image) onto the detected host region (closed region) of the target digital content (first image)) (Figs. 1 and 3A-3U; [0116]) in response to an operation by a user (wherein a user can select the area as the host region within the target digital content (first image)) (Fig. 17B; [0068-0069] and [0279-0280]) . Regarding claim 6 , Marino teaches wherein the first generation unit (wherein content integration system 100 can use a content integration module 120) (Fig. 1; [0116]) generates a synthesized image in which the second image is synthesized in a closed region (wherein the content integration module 120 can overlay or place source digital content (second image) onto the detected host region (closed region) of the target digital content (first image)) (Figs. 1 and 3N-3Q; [0116] and [0119]) surrounding a contour of an object in the first image (wherein the bounding box surrounds the contour of the detected marker) (Figs. 3N-3Q; [0119]) . Regarding claim 7 , Marino teaches wherein the first generation unit (wherein content integration system 100 can use a content integration module 120) (Fig. 1; [0116]) generates a synthesized image in which the second image is synthesized in each closed region (wherein one or multiple host regions (with bounding boxes) can be identified) ([0066-0067]) in the first image (wherein the content integration module 120 can overlay or place source digital content (second image) onto the detected host region (closed region) of the target digital content (first image)) (Figs. 1 and 3A-3U; [0116]) . Regarding claim 8 , Marino teaches wherein the first generation unit (wherein content integration system 100 can use a content integration module 120) (Fig. 1; [0116]) generates a synthesized image in which a plurality of the second images are synthesized in the closed region in the first image (wherein the content integration module 120 can overlay or place a plurality of source digital content (second images) onto the detected host region (closed region) of the target digital content (first image)) (Figs. 1 and 3A-3M; [0116-0118]) . Regarding claim 9 , Marino teaches wherein the second generation unit (host region identification module 110) (Fig. 1; [0076-0077]) generates learning data (machine learning system for generating training set) ([0077]) , the learning data includes the label (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , and a texture label (acquiring texture of the source digital content so as to be able to recreate the luminance (and thus the texture and luminance changes) of the target digital content in the source digital content) ([0088] and [0344]) , and the texture label indicates a region having a texture in the closed region in the synthesized image (wherein the texture label indicates the texture within the closed region (bounding box) for the synthesized image) ([0088], [0095-0096], [0098], [0248], and [0344-0348]) . However, Marino does not explicitly teach that the “synthesized image” is used within the learning data. Quinton teaches systems and methods for generating composite based data for use in machine learning systems (Abstract) ; wherein generating composite data comprising the desired label of a response entry and image data corresponding to the fragment of the composite image (Abstract) ; and wherein composite images can be used as composite training data ([0067]) . It would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention to modify Marino to include the synthesized/composite image as part of the learning data since it improves the efficiency of generating training data and the training process (Quinton; [0011] and [0058]) . Regarding claim 10 , Marino teaches comprising an acquisition unit configured to acquire the second image (wherein the content integration system 100 can receive the source digital content (second image)) (Fig. 1; [0116]) , the second image being formed by cutting out a portion including a texture pattern (cropping the digital content including the texture in the content) ([0077] and [0318]) in a shape same as a shape of the closed region (wherein the source digital content can match the shape of the associated host region) ([0436-0437]) from a third image including the texture pattern (acquiring texture of the source digital content so as to be able to recreate the luminance (and thus the texture and luminance changes) of the target digital content in the source digital content) ([0088], [0114-0115], and [0344-0348]) . Marino also teaches wherein this gives the eventual placement of the source digital content a more immersed and realistic feel, improving viewer experience ([0343]) . Regarding claim 12 , Marino teaches a learning apparatus (apparatus with a neural network machine learning model) ([0012]) , comprising a learning unit (machine learning model) ([0066-0067]) configured to perform learning of a detection unit that detects an object region from an input image included in learning data (wherein the learning data includes learning a host region within the image, such as based on texture) ([0066-0067] and [0075-0077]) generated by a second generation unit (host region identification module 110) (Fig. 1; [0076-0077]) of an information processing apparatus (apparatus and system for integrating source digital content with target digital content) ([0002-0003]) and a label included in the learning data (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , wherein the information processing apparatus (apparatus and system for integrating source digital content with target digital content) ([0002-0003]) includes: a first generation unit (wherein content integration system 100 can use a content integration module 120) (Fig. 1; [0116]) configured to generate the synthesized image in which a second image is synthesized in a closed region in a first image (wherein the content integration module 120 can overlay or place source digital content (second image) onto the detected host region (closed region) of the target digital content (first image)) (Figs. 1 and 3A-3U; [0116]) ; and the second generation unit (host region identification module 110) (Fig. 1; [0076-0077]) configured to generate the learning data (machine learning system for generating training set) ([0077]) , the learning data including the label (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , the label indicating the object region including a region corresponding to the closed region in the synthesized image (the label indicating a region for the closed region for where the second image will be placed to generate the synthesized image) (Figs. 3C, 3N, 3O, and 3S; [0077], [0117], and [0119-0120]) . However, Marino does not explicitly teach that the “synthesized image” is used within the learning data. Quinton teaches systems and methods for generating composite based data for use in machine learning systems (Abstract) ; wherein generating composite data comprising the desired label of a response entry and image data corresponding to the fragment of the composite image (Abstract) ; and wherein composite images can be used as composite training data ([0067]) . It would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention to modify Marino to include the synthesized/composite image as part of the learning data since it improves the efficiency of generating training data and the training process (Quinton; [0011] and [0058]) . Regarding claim 13 , Marino teaches an image recognition apparatus (apparatus and system for integrating source digital content with target digital content) ([0002-0003]) , comprising a detection unit configured to detect an object region (detecting a host region within the image) ([0066-0067] and [0075-0077]) from an input image using a detection unit learned by a learning apparatus (wherein the learning data includes learning a host region within the image, such as based on texture) ([0066-0067] and [0075-0077]) that includes learning unit (machine learning model) ([0066-0067]) , the learning unit (machine learning model) ([0066-0067]) performing learning of the detection unit that detects the object region from the input image included in learning data (wherein the learning data includes learning a host region within the image, such as based on texture) ([0066-0067] and [0075-0077]) generated by a second generation unit (host region identification module 110) (Fig. 1; [0076-0077]) of an information processing apparatus (apparatus and system for integrating source digital content with target digital content) ([0002-0003]) and a label included in the learning data (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , wherein the information processing apparatus (apparatus and system for integrating source digital content with target digital content) ([0002-0003]) includes: a first generation unit (wherein content integration system 100 can use a content integration module 120) (Fig. 1; [0116]) configured to generate the synthesized image in which a second image is synthesized in a closed region in a first image (wherein the content integration module 120 can overlay or place source digital content (second image) onto the detected host region (closed region) of the target digital content (first image)) (Figs. 1 and 3A-3U; [0116]) ; and the second generation unit (host region identification module 110) (Fig. 1; [0076-0077]) configured to generate the learning data (machine learning system for generating training set) ([0077]) , the learning data including the label (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , the label indicating the object region including a region corresponding to the closed region in the synthesized image (the label indicating a region for the closed region for where the second image will be placed to generate the synthesized image) (Figs. 3C, 3N, 3O, and 3S; [0077], [0117], and [0119-0120]) . However, Marino does not explicitly teach that the “synthesized image” is used within the learning data. Quinton teaches systems and methods for generating composite based data for use in machine learning systems (Abstract) ; wherein generating composite data comprising the desired label of a response entry and image data corresponding to the fragment of the composite image (Abstract) ; and wherein composite images can be used as composite training data ([0067]) . It would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention to modify Marino to include the synthesized/composite image as part of the learning data since it improves the efficiency of generating training data and the training process (Quinton; [0011] and [0058]) . Regarding claim 14 , Marino teaches a learning apparatus (apparatus with a neural network machine learning model) ([0012]) , comprising a learning unit (machine learning model) ([0066-0067]) configured to perform learning of a first detection unit and a second detection unit (wherein the host region identification module 110 includes a plurality of host region identification sub-modules) (Fig. 1; [0096]) included in learning data (wherein the learning data includes learning a host region within the image, such as based on texture) ([0066-0067] and [0075-0077]) generated by a second generation unit (host region identification module 110) (Fig. 1; [0076-0077]) of an information processing apparatus (apparatus and system for integrating source digital content with target digital content) ([0002-0003]) , a label included in the learning data (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , and a texture label included in the learning data (acquiring texture of the source digital content so as to be able to recreate the luminance (and thus the texture and luminance changes) of the target digital content in the source digital content) ([0088] and [0344]) , the first detection unit (wherein the host region identification module 110 includes a plurality of host region identification sub-modules) (Fig. 1; [0096]) detecting an object region from an input image (wherein the learning data includes learning a host region within the image) ([0066-0067] and [0075-0077]) , the second detection unit (wherein the host region identification module 110 includes a plurality of host region identification sub-modules) (Fig. 1; [0096]) detecting a region having a texture from the input image (wherein the learning data includes learning a host region within the image, such as based on texture) ([0066-0067] and [0075-0077]) , wherein the information processing apparatus (apparatus and system for integrating source digital content with target digital content) ([0002-0003]) includes: a first generation unit (wherein content integration system 100 can use a content integration module 120) (Fig. 1; [0116]) configured to generate the synthesized image in which a second image is synthesized in a closed region in a first image (wherein the content integration module 120 can overlay or place source digital content (second image) onto the detected host region (closed region) of the target digital content (first image)) (Figs. 1 and 3A-3U; [0116]) ; and the second generation unit (host region identification module 110) (Fig. 1; [0076-0077]) configured to generate the learning data (machine learning system for generating training set) ([0077]) , the learning data including the label (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , the label indicating the object region including a region corresponding to the closed region in the synthesized image (the label indicating a region for the closed region for where the second image will be placed to generate the synthesized image) (Figs. 3C, 3N, 3O, and 3S; [0077], [0117], and [0119-0120]) , wherein the second generation unit (host region identification module 110) (Fig. 1; [0076-0077]) generates the learning data (machine learning system for generating training set) ([0077]) including the label (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , and the texture label (acquiring texture of the source digital content so as to be able to recreate the luminance (and thus the texture and luminance changes) of the target digital content in the source digital content) ([0088] and [0344]) indicating a region having the texture in the closed region in the synthesized image (wherein the texture label indicates the texture within the closed region (bounding box) for the synthesized image) ([0088], [0095-0096], [0098], [0248], and [0344-0348]) . However, Marino does not explicitly teach that the “synthesized image” is used within the learning data. Quinton teaches systems and methods for generating composite based data for use in machine learning systems (Abstract) ; wherein generating composite data comprising the desired label of a response entry and image data corresponding to the fragment of the composite image (Abstract) ; and wherein composite images can be used as composite training data ([0067]) . It would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention to modify Marino to include the synthesized/composite image as part of the learning data since it improves the efficiency of generating training data and the training process (Quinton; [0011] and [0058]) . Regarding claim 15 , Marino teaches an image recognition apparatus (image recognition apparatus) ([0018]) , comprising a formation unit configured to form a new object region (forming a host region by forming a bounding box) (Figs. 3N and 3O; [0119]) using an object region detected from an input image (detecting a host region within the image) ([0066-0067] and [0075-0077]) using a first detection unit (wherein the host region identification module 110 includes a plurality of host region identification sub-modules) (Fig. 1; [0096]) learned (wherein the learning data includes learning a host region within the image) ([0066-0067] and [0075-0077]) by a learning apparatus (apparatus with a neural network machine learning model) ([0012]) and a texture region detected from the input image (wherein the learning data includes learning a host region within the image, such as based on texture) ([0066-0067] and [0075-0077]) using a second detection unit (wherein the host region identification module 110 includes a plurality of host region identification sub-modules) (Fig. 1; [0096]) learned by the learning apparatus (apparatus with a neural network machine learning model) ([0012]) , the learning apparatus (apparatus with a neural network machine learning model) ([0012]) including a learning unit (machine learning model) ([0066-0067]) configured to perform learning of the first detection unit and the second detection unit (wherein the host region identification module 110 includes a plurality of host region identification sub-modules) (Fig. 1; [0096]) included in learning data (wherein the learning data includes learning a host region within the image, such as based on texture) ([0066-0067] and [0075-0077]) generated by a second generation unit (host region identification module 110) (Fig. 1; [0076-0077]) of an information processing apparatus (apparatus and system for integrating source digital content with target digital content) ([0002-0003]) , a label included in the learning data (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , and a texture label included in the learning data (acquiring texture of the source digital content so as to be able to recreate the luminance (and thus the texture and luminance changes) of the target digital content in the source digital content) ([0088] and [0344]) , the first detection unit (wherein the host region identification module 110 includes a plurality of host region identification sub-modules) (Fig. 1; [0096]) detecting the object region from the input image (wherein the learning data includes learning a host region within the image) ([0066-0067] and [0075-0077]) , the second detection unit (wherein the host region identification module 110 includes a plurality of host region identification sub-modules) (Fig. 1; [0096]) detecting a region having a texture from the input image (wherein the learning data includes learning a host region within the image, such as based on texture) ([0066-0067] and [0075-0077]) , wherein the information processing apparatus (apparatus and system for integrating source digital content with target digital content) ([0002-0003]) includes: a first generation unit (wherein content integration system 100 can use a content integration module 120) (Fig. 1; [0116]) configured to generate the synthesized image in which a second image is synthesized in a closed region in a first image (wherein the content integration module 120 can overlay or place source digital content (second image) onto the detected host region (closed region) of the target digital content (first image)) (Figs. 1 and 3A-3U; [0116]) ; and the second generation unit (host region identification module 110) (Fig. 1; [0076-0077]) configured to generate the learning data (machine learning system for generating training set) ([0077]) , the learning data including the label (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , the label indicating the object region including a region corresponding to the closed region in the synthesized image (the label indicating a region for the closed region for where the second image will be placed to generate the synthesized image) (Figs. 3C, 3N, 3O, and 3S; [0077], [0117], and [0119-0120]) , wherein the second generation unit (host region identification module 110) (Fig. 1; [0076-0077]) generates the learning data (machine learning system for generating training set) ([0077]) including the label (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , and the texture label (acquiring texture of the source digital content so as to be able to recreate the luminance (and thus the texture and luminance changes) of the target digital content in the source digital content) ([0088] and [0344]) indicating a region having the texture in the closed region in the synthesized image (wherein the texture label indicates the texture within the closed region (bounding box) for the synthesized image) ([0088], [0095-0096], [0098], [0248], and [0344-0348]) . However, Marino does not explicitly teach that the “synthesized image” is used within the learning data. Quinton teaches systems and methods for generating composite based data for use in machine learning systems (Abstract) ; wherein generating composite data comprising the desired label of a response entry and image data corresponding to the fragment of the composite image (Abstract) ; and wherein composite images can be used as composite training data ([0067]) . It would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention to modify Marino to include the synthesized/composite image as part of the learning data since it improves the efficiency of generating training data and the training process (Quinton; [0011] and [0058]) . Regarding claim 16 , Marino teaches an information processing method performed by an information processing apparatus (apparatus , system s, and methods for integrating source digital content with target digital content) ([0002-000 4 ]) , the method comprising: generating a synthesized image in which a second image is synthesized in a closed region in a first image (wherein the content integration module 120 can overlay or place source digital content (second image) onto the detected host region (closed region) of the target digital content (first image)) (Figs. 1 and 3A-3U; [0116]) ; and generating learning data (machine learning system for generating training set) ([0077]) including a label (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , the label indicating an object region including a region corresponding to the closed region in the synthesized image (the label indicating a region for the closed region for where the second image will be placed to generate the synthesized image) (Figs. 3C, 3N, 3O, and 3S; [0077], [0117], and [0119-0120]) . However, Marino does not explicitly teach that the “synthesized image” is used within the learning data. Quinton teaches systems and methods for generating composite based data for use in machine learning systems (Abstract) ; wherein generating composite data comprising the desired label of a response entry and image data corresponding to the fragment of the composite image (Abstract) ; and wherein composite images can be used as composite training data ([0067]) . It would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention to modify Marino to include the synthesized/composite image as part of the learning data since it improves the efficiency of generating training data and the training process (Quinton; [0011] and [0058]) . Regarding claim 17 , Marino teaches a learning method performed by a learning apparatus (apparatus with a neural network machine learning model) ([0012]) , comprising performing learning (machine learning model) ([0066-0067]) of a detection unit that detects an object region from an input image included in learning data (wherein the learning data includes learning a host region within the image, such as based on texture) ([0066-0067] and [0075-0077]) generated in an information processing method (apparatus, systems, and methods for integrating source digital content with target digital content) ([0002-0004]) and a label included in the learning data (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , wherein the information processing method (apparatus, systems, and methods for integrating source digital content with target digital content) ([0002-0004]) includes: generating the synthesized image in which a second image is synthesized in a closed region in a first image (wherein the content integration module 120 can overlay or place source digital content (second image) onto the detected host region (closed region) of the target digital content (first image)) (Figs. 1 and 3A-3U; [0116]) ; and generating the learning data (machine learning system for generating training set) ([0077]) including the label (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , the label indicating the object region including a region corresponding to the closed region in the synthesized image (the label indicating a region for the closed region for where the second image will be placed to generate the synthesized image) (Figs. 3C, 3N, 3O, and 3S; [0077], [0117], and [0119-0120]) . However, Marino does not explicitly teach that the “synthesized image” is used within the learning data. Quinton teaches systems and methods for generating composite based data for use in machine learning systems (Abstract) ; wherein generating composite data comprising the desired label of a response entry and image data corresponding to the fragment of the composite image (Abstract) ; and wherein composite images can be used as composite training data ([0067]) . It would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention to modify Marino to include the synthesized/composite image as part of the learning data since it improves the efficiency of generating training data and the training process (Quinton; [0011] and [0058]) . Regarding claim 18 , Marino teaches a n image recognition method performed by an image recognition apparatus (apparatus, systems, and methods for integrating source digital content with target digital content) ([0002-0004]) , comprising detecting an object region (detecting a host region within the image) ([0066-0067] and [0075-0077]) from an input image using a detection unit learned by a learning method (wherein the learning data includes learning a host region within the image, such as based on texture) ([0066-0067] and [0075-0077]) included in learning data (wherein the learning data includes learning a host region within the image, such as based on texture) ([0066-0067] and [0075-0077]) generated in an information processing method (apparatus, systems, and methods for integrating source digital content with target digital content) ([0002-0004]) and a label included in the learning data (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , the learning method performing learning of the detection unit that detects the object region from the input image (wherein the learning data includes learning a host region within the image, such as based on texture) ([0066-0067] and [0075-0077]) , wherein the information processing method (apparatus, systems, and methods for integrating source digital content with target digital content) ([0002-0004]) includes: generating the synthesized image in which a second image is synthesized in a closed region in a first image (wherein the content integration module 120 can overlay or place source digital content (second image) onto the detected host region (closed region) of the target digital content (first image)) (Figs. 1 and 3A-3U; [0116]) ; and generating the learning data (machine learning system for generating training set) ([0077]) including the label (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , the label indicating the object region including a region corresponding to the closed region in the synthesized image (the label indicating a region for the closed region for where the second image will be placed to generate the synthesized image) (Figs. 3C, 3N, 3O, and 3S; [0077], [0117], and [0119-0120]) . However, Marino does not explicitly teach that the “synthesized image” is used within the learning data. Quinton teaches systems and methods for generating composite based data for use in machine learning systems (Abstract) ; wherein generating composite data comprising the desired label of a response entry and image data corresponding to the fragment of the composite image (Abstract) ; and wherein composite images can be used as composite training data ([0067]) . It would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention to modify Marino to include the synthesized/composite image as part of the learning data since it improves the efficiency of generating training data and the training process (Quinton; [0011] and [0058]) . Regarding claim 19 , Marino teaches a learning method performed by a learning apparatus (apparatus, systems, and methods for integrating source digital content with target digital content) ([0002-0004]) , comprising performing learning of a first detection unit and a second detection unit (wherein the host region identification module 110 includes a plurality of host region identification sub-modules) (Fig. 1; [0096]) included in learning data (wherein the learning data includes learning a host region within the image, such as based on texture) ([0066-0067] and [0075-0077]) generated in an information processing method (apparatus, systems, and methods for integrating source digital content with target digital content) ([0002-0004]) , a label included in the learning data (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , and a texture label included in the learning data (acquiring texture of the source digital content so as to be able to recreate the luminance (and thus the texture and luminance changes) of the target digital content in the source digital content) ([0088] and [0344]) , the first detection unit (wherein the host region identification module 110 includes a plurality of host region identification sub-modules) (Fig. 1; [0096]) detecting an object region from an input image (wherein the learning data includes learning a host region within the image) ([0066-0067] and [0075-0077]) , the second detection unit (wherein the host region identification module 110 includes a plurality of host region identification sub-modules) (Fig. 1; [0096]) detecting a region having a texture from the input image (wherein the learning data includes learning a host region within the image, such as based on texture) ([0066-0067] and [0075-0077]) , wherein the information processing method (apparatus, systems, and methods for integrating source digital content with target digital content) ([0002-0004]) includes: generating the synthesized image in which a second image is synthesized in a closed region in a first image (wherein the content integration module 120 can overlay or place source digital content (second image) onto the detected host region (closed region) of the target digital content (first image)) (Figs. 1 and 3A-3U; [0116]) ; and generating the learning data (machine learning system for generating training set) ([0077]) including the label (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , the label indicating the object region including a region corresponding to the closed region in the synthesized image (the label indicating a region for the closed region for where the second image will be placed to generate the synthesized image) (Figs. 3C, 3N, 3O, and 3S; [0077], [0117], and [0119-0120]) , wherein the generating generates the learning data (machine learning system for generating training set) ([0077]) including the label (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , and the texture label (acquiring texture of the source digital content so as to be able to recreate the luminance (and thus the texture and luminance changes) of the target digital content in the source digital content) ([0088] and [0344]) indicating a region having the texture in the closed region in the synthesized image (wherein the texture label indicates the texture within the closed region (bounding box) for the synthesized image) ([0088], [0095-0096], [0098], [0248], and [0344-0348]) . However, Marino does not explicitly teach that the “synthesized image” is used within the learning data. Quinton teaches systems and methods for generating composite based data for use in machine learning systems (Abstract) ; wherein generating composite data comprising the desired label of a response entry and image data corresponding to the fragment of the composite image (Abstract) ; and wherein composite images can be used as composite training data ([0067]) . It would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention to modify Marino to include the synthesized/composite image as part of the learning data since it improves the efficiency of generating training data and the training process (Quinton; [0011] and [0058]) . Regarding claim 20 , Marino teaches a n image recognition method performed by an image recognition apparatus (apparatus, systems, and methods for integrating source digital content with target digital content) ([0002-0004]) , comprising forming a new object region (forming a host region by forming a bounding box) (Figs. 3N and 3O; [0119]) using an object region detected from an input image (detecting a host region within the image) ([0066-0067] and [0075-0077]) using a first detection unit (wherein the host region identification module 110 includes a plurality of host region identification sub-modules) (Fig. 1; [0096]) learned by a learning method and a texture region detected from the input image using a second detection unit (wherein the host region identification module 110 includes a plurality of host region identification sub-modules) (Fig. 1; [0096]) learned (wherein the learning data includes learning a host region within the image) ([0066-0067] and [0075-0077]) by the learning method (a neural network machine learning model) ([0012]) , the learning method performing learning of the first detection unit and the second detection unit (wherein the host region identification module 110 includes a plurality of host region identification sub-modules) (Fig. 1; [0096]) included in learning data (wherein the learning data includes learning a host region within the image, such as based on texture) ([0066-0067] and [0075-0077]) generated in an information processing method (apparatus, systems, and methods for integrating source digital content with target digital content) ([0002-0004]) , a label included in the learning data (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , and a texture label included in the learning data (acquiring texture of the source digital content so as to be able to recreate the luminance (and thus the texture and luminance changes) of the target digital content in the source digital content) ([0088] and [0344]) , the first detection unit (wherein the host region identification module 110 includes a plurality of host region identification sub-modules) (Fig. 1; [0096]) detecting the object region from the input image (wherein the learning data includes learning a host region within the image) ([0066-0067] and [0075-0077]) , the second detection unit (wherein the host region identification module 110 includes a plurality of host region identification sub-modules) (Fig. 1; [0096]) detecting a region having a texture from the input image (wherein the learning data includes learning a host region within the image, such as based on texture) ([0066-0067] and [0075-0077]) , wherein the information processing method (apparatus, systems, and methods for integrating source digital content with target digital content) ([0002-0004]) includes: generating the synthesized image in which a second image is synthesized in a closed region in a first image (wherein the content integration module 120 can overlay or place source digital content (second image) onto the detected host region (closed region) of the target digital content (first image)) (Figs. 1 and 3A-3U; [0116]) ; and generating the learning data (machine learning system for generating training set) ([0077]) including the label (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , the label indicating the object region including a region corresponding to the closed region in the synthesized image (the label indicating a region for the closed region for where the second image will be placed to generate the synthesized image) (Figs. 3C, 3N, 3O, and 3S; [0077], [0117], and [0119-0120]) , wherein the generating generates the learning data (machine learning system for generating training set) ([0077]) including the label (the learning data including a location for the second image; such as a bounding box) (Figs. 3C, 3N, 3O, and 3S; [0077]) , and the texture label (acquiring texture of the source digital content so as to be able to recreate the luminance (and thus the texture and luminance changes) of the target digital content in the source digital content) ([0088] and [0344]) indicating a region having the texture in the closed region in the synthesized image (wherein the texture label indicates the texture within the closed region (bounding box) for the synthesized image) ([0088], [0095-0096], [0098], [0248], and [0344-0348]) . However, Marino does not explicitly teach that the “synthesized image” is used within the learning data. Quinton teaches systems and methods for generating composite based data for use in machine learning systems (Abstract) ; wherein generating composite data comprising the desired label of a response entry and image data corresponding to the fragment of the composite image (Abstract) ; and wherein composite images can be used as composite training data ([0067]) . It would have been obvious to one of ordinary skill in the art before the effective filin
Read full office action
Prosecution Timeline

Jan 20, 2023
Application Filed
Mar 21, 2026
Non-Final Rejection — §103 (current)
Precedent Cases

Applications granted by this same examiner with similar technology

18/160,186
Patent 12602906
IMAGE RECOGNITION APPARATUS
2y 5m to grant Granted Apr 14, 2026
17/584,140
Patent 12579596
MANAGING ARTIFICIAL-INTELLIGENCE DERIVED IMAGE ATTRIBUTES
2y 5m to grant Granted Mar 17, 2026
18/533,652
Patent 12579634
REAL-TIME PROCESS DEFECT DETECTION AUTOMATION SYSTEM AND METHOD USING MACHINE LEARNING MODEL
2y 5m to grant Granted Mar 17, 2026
18/506,681
Patent 12573225
METHODS AND SYSTEMS OF FIELD DETECTION IN A DOCUMENT
2y 5m to grant Granted Mar 10, 2026
17/838,131
Patent 12551101
SYSTEM AND METHOD FOR DIGITAL MEASUREMENTS OF SUBJECTS
2y 5m to grant Granted Feb 17, 2026
Study what changed to get past this examiner. Based on 5 most recent grants.
AI Strategy Recommendation

Get an AI-powered prosecution strategy using examiner precedents, rejection analysis, and claim mapping.
Prosecution Projections

1-2
Expected OA Rounds
67%
Grant Probability
87%
With Interview (+20.1%)
3y 4m
Median Time to Grant
Low
PTA Risk
Based on 606 resolved cases by this examiner. Grant probability derived from career allow rate.