Prosecution Insights
Last updated: April 19, 2026
Application No. 18/832,750

IMAGE PROCESSING METHOD AND APPARATUS, AND DEVICE AND STORAGE MEDIUM

Non-Final OA §101§103
Filed
Jul 24, 2024
Examiner
HOANG, PHI
Art Unit
2619
Tech Center
2600 — Communications
Assignee
BEIJING ZITIAO NETWORK TECHNOLOGY CO., LTD.
OA Round
1 (Non-Final)
82%
Grant Probability
Favorable
1-2
OA Rounds
2y 8m
To Grant
98%
With Interview

Examiner Intelligence

Grants 82% — above average
82%
Career Allow Rate
756 granted / 928 resolved
+19.5% vs TC avg
Strong +17% interview lift
Without
With
+17.0%
Interview Lift
resolved cases with interview
Typical timeline
2y 8m
Avg Prosecution
25 currently pending
Career history
953
Total Applications
across all art units

Statute-Specific Performance

§101
10.5%
-29.5% vs TC avg
§103
53.0%
+13.0% vs TC avg
§102
11.9%
-28.1% vs TC avg
§112
13.4%
-26.6% vs TC avg
Black line = Tech Center average estimate • Based on career data from 928 resolved cases

Office Action

§101 §103
DETAILED ACTION Notice of Pre-AIA or AIA Status The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA . Claim Rejections - 35 USC § 101 35 U.S.C. 101 reads as follows: Whoever invents or discovers any new and useful process, machine, manufacture, or composition of matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the conditions and requirements of this title. Claim 14 is rejected under 35 U.S.C. 101 because the claimed invention is directed to non-statutory subject matter. The claims describe a computer-readable storage medium storing instructions. However, the specification does not clearly define the nature of the “computer-readable storage medium" to exclude media such as a wireless signal or carrier wave; therefore, the “computer-readable storage medium” can be interpreted to include non-statutory media such as the wireless signal or carrier wave. The Examiner recommends replacing “computer-readable storage medium” with “non-transitory computer-readable storage medium”. Claim Rejections - 35 USC § 103 In the event the determination of the status of the application as subject to AIA 35 U.S.C. 102 and 103 (or as subject to pre-AIA 35 U.S.C. 102 and 103) is incorrect, any correction of the statutory basis (i.e., changing from AIA to pre-AIA ) for the rejection will not be considered a new ground of rejection if the prior art relied upon, and the rationale supporting the rejection, would be the same under either status. The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action: A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made. Claim(s) 1, 3-6, 11, 13, 14, and 16-19 is/are rejected under 35 U.S.C. 103 as being unpatentable over Barlier et al. (US 2021/0264656 A1). Regarding claim 1, Barlier discloses an image processing method, comprising: in response to a received service execution instruction (Paragraph 0287, user selection of a virtual avatar template representation), identifying a target object in a video stream captured in real time, and determining position information of the target object in the video stream; (Paragraphs 0668-0669, live camera preview of a user that can be determined to be in a field-of-view the camera different from a background for transposing the virtual avatar) displaying a virtual model on the target object in the video stream according to the position information (Figure 26F and paragraph 0669, transposing of the avatar onto the user) and playing a target audio on loop, and controlling the virtual model to display a corresponding animation expression according to the target audio (Paragraph 0294, Barlier does not clearly disclose playing audio data on loop; however, Barlier discloses that recorded audio data can be played during the looping of a display of an animation of the avatar. It would have been obvious that the recorded audio also loops because the animation of the avatar is displayed in a loop that also plays the recorded audio at the same time). Regarding claim 3, Barlier discloses wherein when a plurality of target objects is identified, the displaying the virtual model on the target object in the video stream according to the position information comprises: determining a to-be-mounted object from the plurality of target objects; (Paragraph 0668, categorizing the subject from background objects) determining a target virtual model matched with the to-be-mounted object from a preset virtual model set according to a first size of the to-be-mounted object in the video stream; (Paragraph 0287, one avatar can be selected from a number of virtual avatar templates and sized based on the user in the camera view, paragraph 0347) wherein the preset virtual model set comprises a plurality of virtual models with different sizes; (Paragraphs 0287 and 0347, virtual avatar templates where the avatar that is displayed is based on the size of the user’s head in the camera view) and displaying the target virtual model correspondingly on the to-be-mounted object in the video stream according to the position information of the to-be-mounted object (Figure 26F and paragraph 0669, transposing of the avatar onto the user). Regarding claim 4, Barlier discloses positioning the to-be-mounted object in real time to determine whether a position of the to-be-mounted object in the video stream has changed; (Figure 26F and paragraph 0669, avatar transposed on the user’s head) and in response to a change of the position of the to-be-mounted object in the video stream, displaying the target virtual model correspondingly on the to-be-mounted object in the video stream according to changed position information (Figure 26G and paragraph 0669, a movement of the user’s head changes the movement of the avatar). Regarding claim 5, Barlier discloses in response to the change of the position of the to-be-mounted object in the video stream, acquiring a second size of the to-be-mounted object in the video stream, the second size being different from the first size; (Paragraph 0347, detecting changes in size of the user’s head) and adjusting the target virtual model by scaling according to the second size (Paragraph 0347, changing the size of the avatar based on the changes to the size of the user’s head). Regarding claim 6, Barlier discloses wherein the adjusting the target virtual model by scaling according to the second size comprises: acquiring a size adjustment operation instruction for the target virtual model; and adjusting the target virtual model by scaling according to the size adjustment operation instruction (Paragraph 0347, the size of the avatar changes based on detected changes in size of the user’s head). Regarding claim 11, Barlier discloses displaying a service control and a shooting control in an interface of the video stream; (Figure 26F, list of avatars than can be transposed on the user’s head as well as an option of no avatar) wherein the service control is configured to trigger the service execution instruction, and the shooting control is configured to end a flow corresponding to the image processing method (Figure 26F, selected robot avatar can be transposed on the user’s head where none would have no avatar transposed on the user’s head). Regarding claims 13 and 14, similar reasoning as discussed in claim 1 is applied. Regarding claim 16, similar reasoning as discussed in claim 3 is applied. Regarding claim 17, similar reasoning as discussed in claim 4 is applied. Regarding claim 18, similar reasoning as discussed in claim 5 is applied. Regarding claim 19, similar reasoning as discussed in claim 6 is applied. Claim(s) 2 and 15 is/are rejected under 35 U.S.C. 103 as being unpatentable over Barlier et al. (US 2021/0264656 A1) in view of Goel et al. (US 2019/0043239 A1). Regarding claim 2, Barlier discloses all limitations as discussed in claim 1. Barlier does not clearly disclose wherein the target audio comprises a plurality of audio tracks, different audio tracks are configured to control different virtual models, and animation expressions of the virtual models to which different audio tracks correspond are different; the controlling the virtual model to display the corresponding animation expression according to the target audio comprises: controlling the virtual model corresponding to the audio track to display the corresponding animation expression according to currently played audio track data. Goel discloses an audio data storage storing one more audio tracks with associated emotional responses (Paragraphs 0040-0041) that are used to control a visual representation of an avatar (Paragraphs 0046-0047). Goel’s technique of controlling a visual representation of avatars using emotional responses associated with audio tracks would have been recognized by one of ordinary skill in the art to be applicable to the animation of an avatar of Barlier and the results would have been predictable in the animation of avatars using emotional responses associated with audio tracks. Therefore, the claimed subject matter would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention. Regarding claim 15, similar reasoning as discussed in claim 2 is applied. Claim(s) 10 is/are rejected under 35 U.S.C. 103 as being unpatentable over Barlier et al. (US 2021/0264656 A1) in view of Quek et al. (US 2021/0076107 A1). Regarding claim 10, Barlier discloses all limitations as discussed in claim 1. Barlier does not clearly disclose synchronously displaying, in an interface of the video stream, subtitle information corresponding to the target audio. Quek discloses playing of a video and audio feed that is displayed with synchronized subtitles (Paragraph 0046). Quek’s technique of playing of a video and audio feed that is displayed with synchronized subtitles would have been recognized by one of ordinary skill in the art to be applicable to the display of an animated avatar with recorded audio data of Barlier and the results would have been predictable in the playing of video of an animated avatar with recorded audio data that is displayed with synchronized subtitles. Therefore, the claimed subject matter would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention. Allowable Subject Matter Claims 7-9 and 20-21 are objected to as being dependent upon a rejected base claim, but would be allowable if rewritten in independent form including all of the limitations of the base claim and any intervening claims. Regarding claim 7, the prior art does not clearly disclose the image processing method according to claim 1, wherein the displaying the virtual model on the target object in the video stream according to the position information comprises: acquiring a background material ball corresponding to the video stream, an object material ball corresponding to the virtual model and a mask map of the virtual model, the mask map comprising facial feature information and local skin color information of the virtual model; fusing the background material ball and the object material ball according to the mask map to obtain a fused virtual model; performing an edge feathering process on the fused virtual model to obtain a feathered virtual model; and displaying the feathered virtual model on the target object in the video stream according to the position information. Regarding claim 20, similar reasoning as discussed in claim 7 is applied. Conclusion The prior art made of record and not relied upon is considered pertinent to applicant's disclosure. Moon et al. (US 2023/0117639 A1) discloses determining background material properties in an image. Any inquiry concerning this communication or earlier communications from the examiner should be directed to PHI HOANG whose telephone number is (571)270-3417. The examiner can normally be reached Mon-Fri 8:00-5:00. Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice. If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, JASON CHAN can be reached at (571)272-3022. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300. Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. /PHI HOANG/Primary Examiner, Art Unit 2619
Read full office action

Prosecution Timeline

Jul 24, 2024
Application Filed
Jan 09, 2026
Non-Final Rejection — §101, §103 (current)

Precedent Cases

Applications granted by this same examiner with similar technology

Patent 12602889
METHOD AND SYSTEM OF RENDERING A 3D IMAGE FOR AUTOMATED FACIAL MORPHING
2y 5m to grant Granted Apr 14, 2026
Patent 12592010
NEURAL NETWORK-BASED IMAGE LIGHTING
2y 5m to grant Granted Mar 31, 2026
Patent 12579624
DISPLAY DEVICE AND OPERATING DRIVING THEREOF
2y 5m to grant Granted Mar 17, 2026
Patent 12561885
METHOD, SYSTEM, AND MEDIUM FOR ARTIFICIAL INTELLIGENCE-BASED COMPLETION OF A 3D IMAGE DURING ELECTRONIC COMMUNICATION
2y 5m to grant Granted Feb 24, 2026
Patent 12561866
CONTENT-SPECIFIC-PRESET EDITS FOR DIGITAL IMAGES
2y 5m to grant Granted Feb 24, 2026
Study what changed to get past this examiner. Based on 5 most recent grants.

AI Strategy Recommendation

Get an AI-powered prosecution strategy using examiner precedents, rejection analysis, and claim mapping.
Powered by AI — typically takes 5-10 seconds

Prosecution Projections

1-2
Expected OA Rounds
82%
Grant Probability
98%
With Interview (+17.0%)
2y 8m
Median Time to Grant
Low
PTA Risk
Based on 928 resolved cases by this examiner. Grant probability derived from career allow rate.

Sign in with your work email

Enter your email to receive a magic link. No password needed.

Personal email addresses (Gmail, Yahoo, etc.) are not accepted.

Free tier: 3 strategy analyses per month