Last updated: May 29, 2026

Application No. 18/467,781

USER INTERFACE FOR AI-GUIDED CONTENT GENERATION

Final Rejection §103

Filed

Sep 15, 2023

Priority

Mar 21, 2023 — provisional 63/491,321 +1 more

Examiner

MUELLER, PAUL JOSEPH

Art Unit

2657

Tech Center

2600 — Communications

Assignee

Shopify Inc.

OA Round

4 (Final)

Interview Optional

— +33.7% interview lift. Interview already conducted in this application's prosecution history. This examiner has a 76% grant rate with +33.7% interview lift. Since an interview has already been tried, recommend written response with narrowed claims based on precedent claim evolution patterns.

Based on 130 resolved cases, 2023–2026

Examiner Intelligence

MUELLER, PAUL JOSEPH View full profile →

Grants 76% — above average

Career Allowance Rate

99 granted / 130 resolved

+14.2% vs TC avg

Strong +34% interview lift

Without

With

+33.7%

Interview Lift

resolved cases with interview

Typical timeline

2y 10m

Avg Prosecution

18 currently pending

Career history

158

Total Applications

across all art units

Statute-Specific Performance

§101

3.1%

-36.9% vs TC avg

§103

93.2%

+53.2% vs TC avg

§102

1.1%

-38.9% vs TC avg

§112

1.1%

-38.9% vs TC avg

Black line = Tech Center average estimate • Based on career data from 130 resolved cases

Office Action

§103

DETAILED ACTION
Introduction
This office action is in response to Applicant’s amendment filed on March 11, 2026. 
No claims have been amended. Claims 2 and 13 have been previously cancelled. Claims 21 and 22 have been previously added. Claims 1, 3-12 and 14-22 are pending in the application. As such, claims 1, 3-12 and 14-22 have been examined. 

Notice of Pre-AIA  or AIA  Status
The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA .

Drawings
The drawings were received on September 15, 2023.  These drawings have been accepted and considered by the Examiner.

Response to Amendments and Arguments
No amendments to the claims have been made.
In view of the arguments, the rejections to claims 1, 3-12 and 14-22 under 35 U.S.C. 103 have been maintained.
Applicant’s arguments regarding the prior art rejections under 35 U.S.C 103, received on March 11, 2026, have been fully considered.
Applicant argues: On pages 1-2, "Benedetto is concerned with selection among prompt-language alternatives, not selection from model-generated outputs."
Examiner response: Examiner is not intending to assert that Benedetto teaches the "user selection" that is from the generated output.  Examiner is asserting a PHOSITA could combine the "user selection" already obtained by Ciminelli into the process of Benedetto by substituting the "user selection" already obtained by Ciminelli for the "second text prompt" of Benedetto (Fig. 5B, callout 560).
Without modifying the original mapping maintained below, a more detailed mapping and explanation is provided here, for purposes of clarification.
Ciminelli, as modified above, teaches the user selection, which is from the generated output.
Ciminelli, as modified above, does not teach, however Benedetto teaches
modifying the first text prompt based on the [user selection] to obtain a second text prompt (Benedetto in [0065, Fig 5A, Fig 5B] teaches using text prompt aggregator which can be used to aggregate the first text prompt and the second text prompt (user selection) to generate the aggregated user prompt (second text prompt)).


    PNG
    media_image1.png
    726
    449
    media_image1.png
    Greyscale

Examiner’s position is Benedetto teaches generating a first text prompt, receiving a second text prompt from the user, and then generating a third aggregate prompt. Ciminelli teaches the user selecting a portion of the output.  It is the combination of the references which renders the claimed element obvious.
Applicant’s arguments with respect to claims 1, 3-12 and 14-22 have been considered, and are not considered to be persuasive. Therefore, the rejections of claims 1, 3-12 and 14-22 have been maintained.

Claim Rejections - 35 USC § 103
In the event the determination of the status of the application as subject to AIA  35 U.S.C. 102 and 103 (or as subject to pre-AIA  35 U.S.C. 102 and 103) is incorrect, any correction of the statutory basis (i.e., changing from AIA  to pre-AIA ) for the rejection will not be considered a new ground of rejection if the prior art relied upon, and the rationale supporting the rejection, would be the same under either status.  
The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action:
A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made.

Claims 1-3, 9-10, 12-14, 18 and 20 are rejected under 35 U.S.C. 103 as being unpatentable over Ciminelli et al. (US Patent Pub. No. 20240086051 A1, as supported by provisional application 63/444,841 filed 2/10/2023), hereinafter Ciminelli, in view of Sadr et al. (US Patent No. 11941678 B2), hereinafter Sadr, in view of Benedetto et al. (US Patent Pub. No. 20240193351 A1), hereinafter Benedetto.

Regarding claims 1, 12 and 20, Ciminelli teaches a computer-implemented method, a computing system, and a non-transitory processor-readable medium (Ciminelli in [0044, Fig. 1B] teaches using a personal computer to implement the method), 
[claim 12 only] a processor; and a memory coupled to the processor, the memory storing computer-executable instructions that, when executed by the processor, are to cause the processor to (Ciminelli in [0064] teaches using a processor, memory, and instructions which are non-transitory):
[claim 20 only] non-transitory processor-readable medium storing processor-executable instructions that, when executed by a processor, are to cause the processor to (Ciminelli in [0064] teaches using a processor, memory, and instructions which are non-transitory):
comprising:
obtaining [a plurality of outputs] of a generative model based on input of a first text prompt (Ciminelli in [0085] teaches using a generative model and generating a plurality of alternative outputs, and in [0005] teaches receiving textual input in a natural language, and in [0069] teaches the textual input is referred to as a prompt);
presenting [the plurality of outputs] via a user interface (Ciminelli in [0085] teaches presenting the plurality of alternative drafts of the output, in the same display screen, in different display screens, simultaneously, one after the other);
receiving, via the user interface, user selection of a desired portion of the [plurality of outputs, wherein the plurality of outputs comprise multiple different outputs generated via the generative model based on a same text prompt] (Ciminelli in [0006] teaches a selection of a portion of the output may be received from the user);
and
providing the second text prompt as input to the generative model for obtaining a second output (Ciminelli in [0006] teaches using the selected portion to indicate a change desired, implementing the desired change, and generating a new output for the desired change, and in [0008] teaches using a second textual input to indicate the desired change).
Ciminelli teaches the generative model which produces generative output.
Ciminelli does not teach, however Sadr teaches
plurality of outputs, wherein the plurality of outputs comprise multiple different outputs generated via [the generative model] based on a same text prompt (Sadr in [col 3 ln 40-67, Fig. 2, Fig. 3M] teaches using a first search query (prompt) to generate a plurality of first search results (responses), providing the plurality of results to the user in a display, obtaining a text input and an image selection form the user, generating a multi-modal prompt (second prompt) for input into an image-generation model to generate a model-generated image, and providing the second search results to the user on a display).

    PNG
    media_image2.png
    509
    884
    media_image2.png
    Greyscale


    PNG
    media_image3.png
    535
    887
    media_image3.png
    Greyscale

Sadr is considered to be analogous to the claimed invention because it is in the same field of image-generation models. Therefore, it would have been obvious to someone of ordinary skill in the art before the effective filing date of the claimed invention to have modified Ciminelli further in view of Sadr to allow for using a first search query (prompt) to generate a plurality of first search results (responses). Motivation to do so would provide for using a plurality of descriptor user interface elements which allow for more detailed prompt generation (Sadr [col 14 ln 10-37]).

Ciminelli, as modified above, does not teach, however Benedetto teaches
modifying the first text prompt based on the user selection to obtain a second text prompt (Benedetto in [0065, Fig 5A, Fig 5B] teaches using text prompt aggregator which can be used to aggregate the first text prompt and the second text prompt (user selection) to generate the aggregated user prompt (second text prompt)).

Benedetto is considered to be analogous to the claimed invention because it is in the same field of image-generation models. Therefore, it would have been obvious to someone of ordinary skill in the art before the effective filing date of the claimed invention to have modified Ciminelli, as modified above, further in view of Benedetto to allow for using a text prompt aggregator to aggregate the first text prompt and the second text prompt to generate the aggregated user prompt. Motivation to do so would provide for a user prompt to be refined by taking into consideration additional keywords (Benedetto [0038]).

Regarding claims 3 and 14, Ciminelli, as modified above, teaches the method and computing system of claims 1 and 12.
Ciminelli further teaches
wherein the generative model comprises one of a text-to-image model or a large language model (LLM) (Ciminelli in [0069] teaches using a Large Language Model (LLM) as a generative language model).

Regarding claims 9 and 18, Ciminelli, as modified above, teaches the method and computing system of claims 1 and 12.
Ciminelli further teaches
further comprising 
receiving, via the user interface, input of user edits of the at least one output (Ciminelli in [0006] teaches a user selects a portion of the first output for modification)
and 
wherein the second text prompt is obtained by modifying the first text prompt based on the user selection and the user edits (Ciminelli in [0006] teaches a user selects a portion of the first output for modification, and the desired changes are analyzed to determine the desired changes, then the system generates a new second output incorporating the desired changes).

Regarding claim 10, Ciminelli, as modified above, teaches the method of claim 9.
Ciminelli further teaches
wherein the user edits comprise at least one of: 
deletion of a portion of an output; 
replacement of a portion of an output (Ciminelli in [0203] teaches at least part of the base version of the design may be replaced and/or modified based on the mathematical object to generate the new version of the design); 
or 
addition of text or image (Ciminelli in [0029] teaches user inputs may be text fields and/or image carousels).

Claims 4 and 15 are rejected under 35 U.S.C. 103 as being unpatentable over Ciminelli, in view of Sadr, in view of Benedetto, in view of Mathewson et al. (US Patent Pub. No. 20240184982 A1, as supported by provisional application 63/429,532 filed 12/1/2022), hereinafter Mathewson.

Regarding claims 4 and 15, Ciminelli, as modified above, teaches the method and computing system of claims 1 and 12.
Ciminelli, as modified above, does not teach, however Mathewson teaches
wherein receiving the user selection of a desired portion comprises:
performing text processing of a text output for obtaining a list of one or more tokens (Mathewson in [0032] teaches using a language model neural network to generate a sequence of tokens);
presenting the one or more tokens via the user interface (Mathewson in [0083] teaches using a user interface for presenting the current output sequence for editing); 
and
receiving selection of at least one of the one or more tokens (Mathewson in [0083] teaches presenting the current output sequence for editing, and allowing the user to add, modify, or delete text tokens).
Mathewson is considered to be analogous to the claimed invention because it is in the same field of LLMs. Therefore, it would have been obvious to someone of ordinary skill in the art before the effective filing date of the claimed invention to have modified Ciminelli, as modified above, further in view of Mathewson to allow for the user to add, modify, or delete text tokens. Motivation to do so would allow for performing generation steps in an interactive manner which allows the user to refine the final content of the textual work without requiring the user to compose the actual textual work (Mathewson [0052]).

Claims 5 and 16 are rejected under 35 U.S.C. 103 as being unpatentable over Ciminelli, in view of Sadr, in view of Benedetto, in view of Cohen et al. (US Patent Pub. No. 20240169502 A1), hereinafter Cohen.

Regarding claims 5 and 16, Ciminelli, as modified above, teaches the method and computing system of claims 1 and 12.
Ciminelli, as modified above, does not teach, however Cohen teaches
wherein receiving the user selection of a desired portion comprises:
performing object detection of an image output for identifying one or more objects (Cohen in [0085, Fig. 2] teaches identification of various objects in an image);
graphically representing the one or more objects via the user interface (Cohen in [0205, Fig. 8C] teaches selecting an object and moving it to a different position in the image (moving the graphical representation); 
and
receiving selection of at least one of the one or more objects (Cohen in [0091, Fig. 2] teaches detecting a user selection of an object in an image).
Cohen is considered to be analogous to the claimed invention because it is in the same field of digital image object selection. Therefore, it would have been obvious to someone of ordinary skill in the art before the effective filing date of the claimed invention to have modified Ciminelli, as modified above, further in view of Cohen to allow for selecting an object and moving it to a different position in an image. Motivation to do so would allow for flexible and intuitive editing of digital images while efficiently reducing the user interactions typically required to make such edits (Cohen [0002]).

Claims 6-8, 17 and 21-22 are rejected under 35 U.S.C. 103 as being unpatentable over Ciminelli, in view of Sadr, in view of Benedetto, in view of Quek et al. (US Patent Pub. No. 20090015869 A1), hereinafter Quek.

Regarding claims 6 and 17, Ciminelli, as modified above, teaches the method and computing system of claims 1 and 12.
Ciminelli, as modified above, teaches in claim 1 the user interface, and graphically representing the user selection.
Ciminelli, as modified above, does not teach, however Quek teaches
further comprising 
displaying, via the user interface, a sandbox region for graphically representing the user selection (Quek in [0085] teaches using a region for displaying multiple images which have been dragged and dropped there), 
wherein the sandbox region is dynamically updated based on selections of desired portions across multiple different outputs (Quek in [0085] teaches using a region for displaying multiple images which have been dragged and dropped there, and also teaches that the style of the interface can be adjusted, in addition to the addition of new images (this maps to dynamic)).
Quek is considered to be analogous to the claimed invention because it is in the same field of digital image object selection. Therefore, it would have been obvious to someone of ordinary skill in the art before the effective filing date of the claimed invention to have modified Ciminelli, as modified above, further in view of Quek to allow for selecting an object and moving it to a different interface. Motivation to do so would allow for intuitive visual abstractions and mechanisms for efficiently and quickly creating image collages (Quek [0027]).

Regarding claim 7, Ciminelli, as modified above, teaches the method of claim 6.
Ciminelli, as modified above, does not teach, however Quek teaches
further comprising:
receiving, via the user interface, input for changing a property of a selected desired portion of the at least one output in the sandbox region (Quek in [0087] teaches the user can use a computer I/O device to adjust the positions, the orientations, or the sizes of the images); 
and
updating the user interface to represent the inputted change of the property (Quek in [0087] teaches user can also add a border to the image, or enhance the colors of the image by change contrast, color saturation, or tones (e.g. from color to sepia or black-and-white) [these changes correspond to updating]).
Quek is considered to be analogous to the claimed invention because it is in the same field of digital image object selection. Therefore, it would have been obvious to someone of ordinary skill in the art before the effective filing date of the claimed invention to have modified Ciminelli, as modified above, as modified above, further in view of Quek to allow for adjust the positions, the orientations, or the sizes of the images. Motivation to do so would allow for intuitive visual abstractions and mechanisms for efficiently and quickly creating image collages (Quek [0027]).

Regarding claim 8, Ciminelli, as modified above, teaches the method of claim 7.
Ciminelli, as modified above, does not teach, however Quek teaches
wherein the property of the selected desired portion comprises one of 
location (Quek in [0087] teaches the user can use a computer I/O device to adjust the positions, the orientations, or the sizes of the images), 
scale (Quek in [0087] teaches the user can use a computer I/O device to adjust the positions, the orientations, or the sizes of the images), 
color (Quek in [0087] teaches user can also add a border to the image, or enhance the colors of the image by change contrast, color saturation, or tones (e.g. from color to sepia or black-and-white), 
or 
language.
Quek is considered to be analogous to the claimed invention because it is in the same field of digital image object selection. Therefore, it would have been obvious to someone of ordinary skill in the art before the effective filing date of the claimed invention to have modified Ciminelli, as modified above, further in view of Quek to allow for adjust the positions, the colors, or the sizes of the images. Motivation to do so would allow for intuitive visual abstractions and mechanisms for efficiently and quickly creating image collages (Quek [0027]).

Regarding claims 21 and 22, Ciminelli, as modified above, teaches the method and computing system of claims 6 and 17.
Ciminelli, as modified above, teaches in claims 6 and 17 above, the sandbox region and using the sandbox region for displaying multiple images which have been dragged and dropped there.
Ciminelli, as modified above, does not teach, however Sadr teaches
wherein the [sandbox region] is progressively populated responsive to selections of preferred content portions across the multiple different outputs that are received via the user interface (Sadr in [col 3 ln 40-67] teaches using a first search query (prompt) to generate a plurality of first search results (responses), providing the plurality of results to the user in a display, obtaining a text input and an image selection form the user, generating a multi-modal prompt (second prompt) for input into an image-generation model to generate a model-generated image, and providing the second search results to the user on a display, and in [col 3 ln 37-53] teaches FIG. 10 depicts an illustration of an example user collection interface according to example embodiments of the present disclosure. For example, the user collection interface can provide a landing experience that provides various inspiration bucket and personalization methods. In particular, a “recommended for you” panel can be provided in the interface that may be determined based on previous creations, based on preferences, and/or based on historical data (e.g., purchase history, search history, and/or browsing history). The user collection interface can include a dream closet that displays a plurality of previous model-generated datasets. The user may interact with a preferences interface to select varying preferences that can be utilized for further “recommended for you” suggestions, which can include automated prompt generation and processing for automatically generated model-generated datasets).

    PNG
    media_image4.png
    642
    895
    media_image4.png
    Greyscale

Sadr is considered to be analogous to the claimed invention because it is in the same field of image-generation models. Therefore, it would have been obvious to someone of ordinary skill in the art before the effective filing date of the claimed invention to have modified Ciminelli, as modified above, further in view of Sadr to allow for a user to make multiple selections for further processing and analysis. Motivation to do so would provide for using a plurality of descriptor user interface elements which allow for more detailed prompt generation (Sadr [col 14 ln 10-37]).

Claims 11 and 19 are rejected under 35 U.S.C. 103 as being unpatentable over Ciminelli, in view of Sadr, in view of Benedetto, in view of Shlens et al. (US Patent No. 10535164 B2), hereinafter Shlens.

Regarding claims 11 and 19, Ciminelli, as modified above, teaches the method and computing system of claims 1 and 12.
Ciminelli, as modified above, teaches in claim 1 the second text prompt, and providing it as input to the generative model for obtaining the second output.
Ciminelli, as modified above, does not teach, however Shlens teaches
further comprising:
receiving user input of an adherence weight value representing a desired level of adherence to the user selection (Shlens in [col 3 ln 10-17] teaches allowing a user to select a combination of multiple image styles and to specify a weight that should be applied to each), 
wherein [the second text prompt and] the adherence weight value are provided as input [to the generative model for obtaining the second output] (Shlens in [col 3 ln 10-17] teaches allowing a user to select a combination of multiple image styles and to specify a weight that should be applied to each).
Shlens is considered to be analogous to the claimed invention because it is in the same field of digital image modification. Therefore, it would have been obvious to someone of ordinary skill in the art before the effective filing date of the claimed invention to have modified Ciminelli, as modified above, further in view of Shlens to allow for specify weights that should be applied to each modification desired. Motivation to do so would allow for an output image which can be provided to a user more efficiently i.e. in a shorter amount of time, providing other related advantages such as a reduced screen-on time, with further benefits in terms of the power consumption of the device (Shlens [col 2 ln 12-26]).

Conclusion
THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a).
A shortened statutory period for reply to this final action is set to expire THREE MONTHS from the mailing date of this action. In the event a first reply is filed within TWO MONTHS of the mailing date of this final action and the advisory action is not mailed until after the end of the THREE-MONTH shortened statutory period, then the shortened statutory period will expire on the date the advisory action is mailed, and any nonprovisional extension fee (37 CFR 1.17(a)) pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of the advisory action. In no event, however, will the statutory period for reply expire later than SIX MONTHS from the mailing date of this final action.

Any inquiry concerning this communication or earlier communications from the examiner should be directed to PAUL J. MUELLER whose telephone number is (571)272-1875. The examiner can normally be reached M-F 9:00am-5:00pm (Eastern).
Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice.
If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, Daniel C. Washburn can be reached at 571-272-5551. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300.
Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000.

PAUL MUELLER
Examiner
Art Unit 2657



/PAUL J. MUELLER/Examiner, Art Unit 2657                                                                                                                                                                                                        

/DANIEL C WASHBURN/Supervisory Patent Examiner, Art Unit 2657

Read full office action

Prosecution Timeline

Show 3 earlier events

Oct 07, 2025

Final Rejection mailed — §103

Dec 08, 2025

Response after Non-Final Action

Dec 18, 2025

Non-Final Rejection mailed — §103

Jan 30, 2026

Interview Requested

Feb 05, 2026

Applicant Interview (Telephonic)

Feb 05, 2026

Examiner Interview Summary

Mar 11, 2026

Response Filed

Apr 13, 2026

Final Rejection mailed — §103 (current)

Precedent Cases

Applications granted by this same examiner with similar technology

18/336,380

Patent 12632673

UTILIZING EMBEDDING-BASED CLAIM-RELATION GRAPHS FOR EFFICIENT SYNTOPICAL READING OF CONTENT COLLECTIONS

2y 11m to grant Granted May 19, 2026

18/375,794

Patent 12632648

GENERATIVE LARGE LANGUAGE MODEL (LLM) DECENTRALIZED NETWORK

2y 7m to grant Granted May 19, 2026

18/477,680

Patent 12632651

AUTOMATIC LANGUAGE MODEL (LM) INPUT OPTIMIZATION USING TEXTUAL GRADIENTS

2y 7m to grant Granted May 19, 2026

18/544,647

Patent 12633305

END-TO-END SPEECH DIARIZATION VIA ITERATIVE SPEAKER EMBEDDING

2y 5m to grant Granted May 19, 2026

17/739,908

Patent 12614554

ERROR CORRECTION OVERWRITE FOR AUDIO ARTIFACT REDUCTION

3y 11m to grant Granted Apr 28, 2026

Study what changed to get past this examiner. Based on 5 most recent grants.

Strategy Recommendation AI-generated — please review before filing

Get a prosecution strategy drawn from examiner precedents, rejection analysis, and claim mapping.

Typically takes 5-10 seconds — AI-generated, attorney review required before filing

Prosecution Projections

5-6

Expected OA Rounds

76%

Grant Probability

99%

With Interview (+33.7%)

2y 10m (~1m remaining)

Median Time to Grant

High

PTA Risk

Based on 130 resolved cases by this examiner. Grant probability derived from career allowance rate.