Last updated: April 19, 2026
Application No. 18/197,327
Physical World Driven Environmental Themes for Avatars in Virtual/Augmented Reality Systems

Final Rejection §103
Filed
May 15, 2023
Examiner
TRUONG, KARL DUC
Art Unit
2614
Tech Center
2600 — Communications
Assignee
International Business Machines Corporation
OA Round
4 (Final)
Interview Optional

— +31.0% interview lift. This examiner has a relatively high allow rate; a written response may suffice.
Based on 29 resolved cases, 2023–2026
Examiner Intelligence

TRUONG, KARL DUC View full profile →
Grants 52% of resolved cases
Career Allow Rate
15 granted / 29 resolved
-10.3% vs TC avg
Strong +31% interview lift
Without
With
+31.0%
Interview Lift
resolved cases with interview
Typical timeline
2y 7m
Avg Prosecution
45 currently pending
Career history
Total Applications
across all art units
Statute-Specific Performance

§101
3.2%
-36.8% vs TC avg
§103
85.3%
+45.3% vs TC avg
§102
9.5%
-30.5% vs TC avg
§112
2.1%
-37.9% vs TC avg
Black line = Tech Center average estimate • Based on career data from 29 resolved cases
Office Action

§103
DETAILED ACTION
Notice of Pre-AIA  or AIA  Status
The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA .

Response to Amendment
This action is in response to the amendment filed on 30th January, 2026. Claims 1, 11, and 20 have been amended. Claims 1-20 remain rejected in the application.

Response to Arguments
Applicant's arguments with respect to Claims 1, 11, and 20, filed on 30th January, 2026, with respect to the rejection under 35 U.S.C. § 103 regarding that the prior art does not teach "each location within the computer generated virtual environment is associated with respective metadata specifying an intended emotion for that location." The proposed amended claim limitations have been fully considered, but are not persuasive.

In response to applicant's argument that the prior art does not teach "each location within the computer generated virtual environment is associated with respective metadata specifying an intended emotion for that location" as recited in Claim 1, these limitations are taught by Sumant. In particular, Sumant teaches the following:
Paragraph [0115]: discloses determining "whether modifications to the video game 112 <read on computer generated virtual environment> cause an emotional state <read on intended emotion> or predicted emotional state of the user to move in a particular direction," where "based on the particular direction of the change in emotional state or predicted emotional state, the game configuration system 134 may determine whether to make additional changes to the video game 112";
Paragraph [0115]: further discloses adapting portions of video game 112 based on the user's current emotional state, where if the current emotional state does not match the predicted emotional state (i.e., not achieving a desired emotional state), then changes to the appropriate portions <read on location> of video game 112 are increased; and
Paragraph [0114]: discloses particular configurations of video game 112 associated with certain emotional states, such as a negative emotional state, is stored at the game configuration repository 144, where the process of determining a predicted emotional state in a game environment can apply to other factors, such as the number of players; in addition, in order for the system to keep track of desirable game modifications, one skilled in the art would use metadata that uses a flag indicator to modify aspects of certain areas in a virtual environment, such as the type of music that plays to achieve the desired emotion; furthermore, the stored video game configurations in the repository are being interpreted as respective metadata that specifies an intended emotion for each location
Therefore, applicant’s remark cannot be considered persuasive.

Regarding arguments to Claims 2-10 and 12-19, they directly/indirectly depend on independent Claims 1, 11, and 20 respectively. Applicant does not argue anything other than independent Claims 1, 11, and 20. The limitations in those claims, in conjunction with combination, was previously established as explained.






Claim Rejections - 35 USC § 103
The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action:
A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made.


Claims 1-3, 6-13, and 16-20 are rejected under 35 U.S.C. 103 as being unpatentable over Baughman et al. (US 20180217808 A1, previously cited), hereinafter referenced as Baughman, in view of Sumant et al. (US 20200206631 A1, previously cited), hereinafter referenced as Sumant.

Regarding Claim 1, Baughman discloses a method, in a data processing system, for personalizing a computer generated virtual environment (Baughman, [0021]: teaches "a method for modifying a user response to visual and/or audio experience by augmenting the visual and/or audio experience of a user with alternative sounds determined to modify the user response by applying machine learning techniques, based on user feedback <read on personalizing a computer generated virtual environment>"), the method comprising:
collecting, from one or more sensors associated with a user, emotion data representing physiological conditions of the user in response to stimuli (Baughman, [0049]: teaches a machine learning program 300 comparing collective biometric data <read on physiological data of user> of the user's response to a pre-existing repository of biometric data associated with emotional and state-of-mind responses; [0048]: teaches the biometric feedback data coming from a biometric sensor 140; [0046]: teaches "in response to a given stimulus, the menu of populated responses can include a range of emotions <read on emotion data>, or states or mind <read on physiological conditions>, including: fear, anger, sadness, joy, disgust, trust, anticipation, surprise, shame, pity, indignation, envy, and love");
collecting, from one or more data source computing systems, stimuli context data and correlating the stimuli context data with the emotion data (Baughman, [0033]: teaches a repository 130 being a database, where it "includes stored data associating <read on correlating stimuli context data> the effect of one stimulus/response association based on machine learning");
[[training, via a machine learning training process, one or more machine learning computer models based on the emotion data and correlated stimuli context data to thereby generate one or more trained machine learning computer models that are trained to predict an emotion of the user from patterns of input data;]]
receiving runtime emotion data from the one or more sensors associated with the user (Baughman, [0042]: teaches camera 240 being used "to determine a user's response to a given stimulus by associating facial expressions to the relevant emotion <read on runtime emotion data>");
receiving runtime stimuli context data from a virtual environment provider computing system for the computer generated virtual environment (Baughman, [0033]: teaches a repository 130 being a database that can be updated in real-time, where it "includes stored data associating the effect of one stimulus/response association <read on runtime stimuli context data> based on machine learning"; [0040]: teaches an augmented display 235 of an AR headset 115 <read on environment provider computing system> displaying a "realistic representation of a user's surrounding environment <read on computer generated virtual environment>, replacing the real-world view"); and
[[generating, by the one or more trained machine learning computer models, a predicted emotion of the user based on the runtime emotion data and runtime stimuli context data which are input to the one or more trained machine learning computer models; and]]
[[based on a determination that an intended emotion of the user is different from the predicted emotion of the user, modifying a virtual environment theme of the computer generated virtual environment to elicit the intended emotion from the user, wherein]]
[[each location within the computer generated virtual environment is associated with respective metadata specifying an intended emotion for that location, and wherein]]
[[the intended emotion of the user is determined based at least in part on metadata associated with a location corresponding to the user within the computer generated virtual environment.]]

However, Baughman does not expressly disclose
training, via a machine learning training process, one or more machine learning computer models based on the emotion data and correlated stimuli context data to thereby generate one or more trained machine learning computer models that are trained to predict an emotion of the user from patterns of input data;
generating, by the one or more trained machine learning computer models, a predicted emotion of the user based on the runtime emotion data and runtime stimuli context data which are input to the one or more trained machine learning computer models; and
based on a determination that an intended emotion of the user is different from the predicted emotion of the user, modifying a virtual environment theme of the computer generated virtual environment to elicit the intended emotion from the user, wherein
each location within the computer generated virtual environment is associated with respective metadata specifying an intended emotion for that location, and wherein
the intended emotion of the user is determined based at least in part on metadata associated with a location corresponding to the user within the computer generated virtual environment.

Sumant discloses
training, via a machine learning training process, one or more machine learning computer models based on the emotion data and correlated stimuli context data to thereby generate one or more trained machine learning computer models that are trained to predict an emotion of the user from patterns of input data (Sumant, [0065]: teaches a model generation system 146 that includes a model generation rule set 170 for generating prediction model 160 <read on generate trained machine learning computer model> to predict the emotional state of the user, where "the prediction model 160 and/or the respective parameters 162 of the prediction models 160 may be derived during a training process based on particular input data, such as the historical data 152 <read on patterns of input data>, feedback data 154, and control data 156, and defined output criteria, which may be included with the control data 156, used for training purposes"; [0070]: teaches a retention analysis system 140 receiving input data 172 and applying said input data 172 to prediction model(s) 160, where the input data 172 includes "one or more pieces of sensory data associated with a user who is playing the video game 112"; [0077]: teaches sensory data including sensor data associated with user emotion/emotional state <read on emotion data>, which includes associated audio data, visual/image data, physiological data, etc. <read on correlated stimuli context data> that would trigger the emotional state);
generating, by the one or more trained machine learning computer models, a predicted emotion of the user based on the runtime emotion data and runtime stimuli context data which are input to the one or more trained machine learning computer models (Sumant, [0098]: teaches an emotion analysis system 126 of a retention analysis system 140 using a prediction model <read on machine learning model> to "determine a predicted emotional state <read on generating predicted emotion> for the user based at least in part on the set of sensory <read on runtime stimuli context data> and biometric data <read on runtime emotion data> received or otherwise obtained at the block 402"; [0065]: teaches "the prediction model 160 and/or the respective parameters 162 of the prediction models 160 may be derived during a training process based on particular input data, such as the historical data 152, feedback data 154, and control data 156, and defined output criteria, which may be included with the control data 156, used for training purposes"); and
based on a determination that an intended emotion of the user is different from the predicted emotion of the user, modifying a virtual environment theme of the computer generated virtual environment to elicit the intended emotion from the user (Sumant, [0102]: teaches decision block 408, where if it determines that "the desired emotional state <read on intended emotion> does not match the predicted emotional state, the process 400 proceeds to the block 410," which selects a video game configuration that is associated with the desired emotional state for the user, such as reducing the lighting in the game <read on modifying virtual theme>), wherein
each location within the computer generated virtual environment is associated with respective metadata specifying an intended emotion for that location (Sumant, [0115]: teaches determining "whether modifications to the video game 112 <read on computer generated virtual environment> cause an emotional state <read on intended emotion> or predicted emotional state of the user to move in a particular direction," where "based on the particular direction of the change in emotional state or predicted emotional state, the game configuration system 134 may determine whether to make additional changes to the video game 112"; [0115]: further teaches adapting portions of video game 112 based on the user's current emotional state, where if the current emotional state does not match the predicted emotional state (i.e., not achieving a desired emotional state), then changes to the appropriate portions <read on location> of video game 112 are increased; [0114]: teaches particular configurations of video game 112 associated with certain emotional states, such as a negative emotional state, is stored at the game configuration repository 144; Note: it should be noted that the process of determining a predicted emotional state in a game environment can apply to other factors, such as the number of players; in addition, in order for the system to keep track of desirable game modifications, one skilled in the art would use metadata that uses a flag indicator to modify aspects of certain areas in a virtual environment, such as the type of music that plays to achieve the desired emotion; furthermore, the stored video game configurations in the repository are being interpreted as respective metadata that specifies an intended emotion for each location), and wherein
the intended emotion of the user is determined based at least in part on metadata associated with a location corresponding to the user within the computer generated virtual environment (Sumant, [0086]: teaches sensors 116 being included as part of a VR/AR system <read on user within computer generated virtual environment>; [0092]: teaches modifying a state or configuration <read on metadata> of video game 112, such as music, sound effects, lighting <read on location>, etc.; [0096]: teaches process 400, which describes modifying a video game state/configuration based at least in part on a desired emotional state <read on intended emotion> of a user playing video game 112).

Sumant is analogous art with respect to Baughman because they are from the same field of endeavor, namely utilizing neural networks to elicit emotion responses of users via stimuli/triggers. Before the effective filing date of the claimed invention, it would have been obvious to a person of ordinary skill in the art to implement a state modification process based on the elicited emotion of the user, an emotion analysis system, and a retention analysis system as taught by Sumant into the teaching of Baughman. The suggestion for doing so would allow for real-time dynamic scene modification that ensures the user stays engaged with the virtual content, thereby improving the overall user experience. Therefore, it would have been obvious to combine Sumant with Baughman.

Regarding Claim 11, it recites the limitations that are similar in scope to Claim 1, but in a non-transitory computer-readable medium. As shown in the rejection, the combination of Baughman and Sumant discloses the limitations of Claim 1. Additionally, Baughman discloses a non-transitory computer-readable medium storing a set of instructions for wireless communication (Baughman, [0067]: teaches memory 506 <read on non-transitory computer-readable medium> and persistent storage 508 being computer-readable storage media, where memory 506 "can include any suitable volatile or non-volatile computer-readable storage media"; [0068]: teaches memory 506 being computer-readable storage media that is capable of storing program instructions or digital information), the set of instructions comprising:
one or more instructions that, when executed by one or more processors of a device, cause the device to (Baughman, [0068]: teaches a machine learning program and sound association program 400 being stored "in persistent storage 508 for execution by one or more of the respective computer processors 504 via one or more memories of memory 506"):…

Thus, Claim 11 is met by Baughman according to the mapping presented in the rejection of Claim 1, given the method corresponds to a non-transitory computer-readable medium.

Regarding Claim 20, it recites the limitations that are similar in scope to Claim 1, but in an apparatus. As shown in the rejection, the combination of Baughman and Sumant discloses the limitations of Claim 1. Additionally, Baughman discloses an apparatus (Baughman, [0079]: teaches an apparatus) comprising:
one or more processors (Baughman, [0079]: teaches a processor of a general purpose computer); and
one or more memory devices coupled to the one or more processors (Baughman, [0068]: teaches a machine learning program and sound association program 400 being stored "in persistent storage 508 <read on memory devices> for execution by one or more of the respective computer processors 504 via one or more memories of memory 506"), wherein
the one or more processors are configured to (Baughman, [0079]: teaches the processor of a general purpose computer):…

Thus, Claim 20 is met by Baughman according to the mapping presented in the rejection of Claim 1, given the method corresponds to an apparatus.





Regarding Claims 2 and 12, the combination of Baughman and Sumant discloses the method and the non-transitory computer-readable medium of Claims 1 and 11 respectively. Baughman does not expressly disclose the limitations of Claims 2 and 12; however, Sumant discloses
modifying the virtual environment theme of the computer generated virtual environment based on the predicted emotion of the user (Sumant, [0102]: teaches decision block 408, where if it determines that "the desired emotional state does not match the predicted emotional state, the process 400 proceeds to the block 410," which selects a video game configuration that is associated with the desired emotional state for the user, such as reducing the lighting in the game <read on modifying virtual theme>, to match the predicted emotional state with the desired emotional state).

Sumant is analogous art with respect to Baughman because they are from the same field of endeavor, namely utilizing neural networks to elicit emotion responses of users via stimuli/triggers. Before the effective filing date of the claimed invention, it would have been obvious to a person of ordinary skill in the art to implement a state modification process based on the elicited emotion of the user, an emotion analysis system, and a retention analysis system as taught by Sumant into the teaching of Baughman. The suggestion for doing so would allow for real-time dynamic scene modification that ensures the user stays engaged with the virtual content, thereby improving the overall user experience. Therefore, it would have been obvious to combine Sumant with Baughman.



Regarding Claims 3 and 13, the combination of Baughman and Sumant discloses the method and the non-transitory computer-readable medium of Claims 2 and 12 respectively. Baughman does not expressly disclose the limitations of Claims 3 and 13; however, Sumant discloses wherein modifying the virtual environment theme of the computer generated virtual environment based on the predicted emotion of the user comprises
modifying the virtual environment theme to elicit the predicted emotion of the user from the user (Sumant, [0102]: teaches decision block 408, where if it determines that "the desired emotional state does not match the predicted emotional state, the process 400 proceeds to the block 410," which selects a video game configuration that is associated with the desired emotional state for the user, such as reducing the lighting in the game <read on modifying virtual theme>, to match the predicted emotional state <read on elicit predicted emotion> with the desired emotional state).

Sumant is analogous art with respect to Baughman because they are from the same field of endeavor, namely utilizing neural networks to elicit emotion responses of users via stimuli/triggers. Before the effective filing date of the claimed invention, it would have been obvious to a person of ordinary skill in the art to implement a state modification process based on the elicited emotion of the user, an emotion analysis system, and a retention analysis system as taught by Sumant into the teaching of Baughman. The suggestion for doing so would allow for real-time dynamic scene modification that ensures the user stays engaged with the virtual content, thereby improving the overall user experience. Therefore, it would have been obvious to combine Sumant with Baughman.

Regarding Claims 6 and 16, the combination of Baughman and Sumant discloses the method and the non-transitory computer-readable medium of Claims 1 and 11 respectively. Additionally, Baughman further discloses wherein
the emotion data is collected from the one or more sensors associated with the user in response to stimuli present in a physical environment (Baughman, [0035]: teaches the biometric sensor 140 being a collection of sensors that "are arranged in an area in which a user is presented a stimulus, and the user's response in terms of the biometric measurements are received and associated with the particular stimulus"; [0042]: teaches camera 240 being used "to determine a user's response to a given stimulus by associating facial expressions to the relevant emotion <read on emotion data>"), and wherein
the stimuli context data represents the stimuli present in the physical environment, such that the one or more machine learning computer models learn associations of input patterns of emotional responses of the user to stimuli in the physical environment (Baughman, [0034]: teaches a machine learning program learning a user's response to a particular sound within a distributed computer processing environment 100; [0033]: teaches the machine learning program 300 learning the user responses to various sounds and objects, where it associates one stimulus/response to another stimulus/response to determine the user's behavioral response based on the two stimuli <read on input patterns of emotional responses>), and
applies the learning to virtual stimuli in the computer generated virtual environment (Baughman, [0062]: teaches the sound association program modifying a sound <read on apply learning to virtual stimuli in computer generated virtual environment> transmitted to a user's headphone to obtain the user's response).

Regarding Claims 7 and 12, the combination of Baughman and Sumant discloses the method and the non-transitory computer-readable medium of Claims 1 and 11 respectively. Additionally, Baughman further discloses
the emotion data is collected from the one or more sensors associated with the user in response to virtual stimuli present in a virtual world environment (Baughman, [0045]: teaches the machine learning program 300 presenting the user a stimulus <read on presented virtual stimuli> and receiving the user's response <read on emotion data> as shown in FIG. 3A), and wherein
    PNG
    media_image1.png
    342
    400
    media_image1.png
    Greyscale

the stimuli context data represents the virtual stimuli present in the virtual world environment, such that the one or more machine learning computer models learn associations of input patterns of emotional responses of the user to virtual stimuli in the virtual world environment (Baughman, [0049]: teaches a machine learning program 300 comparing collective biometric data of the user's response to a pre-existing repository of biometric data associated with emotional and state-of-mind responses, where it "selects the response that most closely matches the biometric data of the user's response, and associates the matched response <read on associations of input patterns of emotional responses> from the pre-existing repository, with the stimulus as the type of response"), and
applies the learning to virtual stimuli in the computer generated virtual environment (Baughman, [0062]: teaches the sound association program modifying a sound <read on apply learning to virtual stimuli in computer generated virtual environment> transmitted to a user's headphone to obtain the user's response).

Regarding Claims 8 and 18, the combination of Baughman and Sumant discloses the method and the non-transitory computer-readable medium of Claims 1 and 11 respectively. Additionally, Baughman further discloses wherein the emotion data comprises at least one of  
brain wave pattern data, heart rate pattern data, perspiration level data, eye dilation data, breathing rate data, facial expression data, body temperature data, or blood pressure data (Baughman, [0035]: teaches "biometric information <read on emotion data> being measured by biometric sensor 140", which includes: "heartrate, breathing rate, voice tonality, skin conductivity levels, temperature, eye dilation, facial expression, iris and retinal recognition, audio response of the user, gait, and vein recognition").

Regarding Claims 9 and 19, the combination of Baughman and Sumant discloses the method and the non-transitory computer-readable medium of Claims 1 and 11 respectively. Additionally, Baughman further discloses wherein the one or more sensors comprise
at least one of a user wearable sensor or a sensor physically positioned in a physical environment occupied by the user, to monitor the user while the user occupies the physical environment (Baughman, [0031]: teaches the AR Headset 115 <read on user wearable sensor> and the mobile device 120 communicating with each other to provide an AR experience to the user, such as "if a user is in a living room environment and sees a chair while wearing AR Headset 115 <read on monitoring the user in the physical environment>, mobile device 120 will prompt the user to select at least one option relevant to the relationship between the user and the chair").

Regarding Claim 10, the combination of Baughman and Sumant discloses the method of Claim 1 respectively. Additionally, Baughman further discloses
physical environment stimuli context data, collected from the one or more sensors, specifying objects, entities, or physical conditions of a physical environment in which the user occupies at a substantially same time as the emotion data is collected;social networking stimuli context data, from one or more social networking computing systems, specifying relationships between the user and other entities present in the physical environment in which the user occupies at a substantially same time as the emotion data is collected;event stimuli context data, from one or more event data source computing systems, specifying events occurring in the physical environment in which the user occupies at a substantially same time as the emotion data is collected; orlocation stimuli context data, from one or more location services computing systems, specifying characteristics of a physical location of the user corresponding to the physical environment in which the user occupies at a substantially same time as the emotion data is collected (Baughman, [0013]: teaches a stimulus/given stimulus being referred to "a natural or non-natural input sound that is associated with a given physical object <read on physical environment stimuli context data specifying objects in physical environment>, or a digital or non-digital representation of a physical object"; [0014]: teaches a response/user response being referred to "the emotional or psychological state of mind that a user is experiencing upon being exposed to a stimulus"; [0035]: teaches the biometric sensor 140 being a collection of sensors that "are arranged in an area <read on physical location of user> in which a user is presented a stimulus, and the user's response in terms of the biometric measurements are received <read on simultaneous emotion data collection> and associated with the particular stimulus").

Claims 4-5 and 14-15 are rejected under 35 U.S.C. 103 as being unpatentable over Baughman et al. (US 20180217808 A1, previously cited), hereinafter referenced as Baughman, in view of Sumant et al. (US 20200206631 A1, previously cited), hereinafter referenced as Sumant as applied to Claims 1 and 11 above respectively, and further in view of Cowen et al. (US 20230096485 A1, previously cited), hereinafter referenced as Cowen.

Regarding Claims 4 and 14, the combination of Baughman and Sumant discloses the method and the non-transitory computer-readable medium of Claims 1 and 11 respectively. Additionally, Baughman further discloses wherein the correlated stimuli context data comprises at least one of
visual stimuli, auditory stimuli, environmental stimuli, social stimuli, object stimuli, or contextual stimuli as specified in a stimuli ontology data structure (Baughman, [0021]: teaches enabling "augmentation of sound detected by a user in response to exposure of the user to a stimulus of detected sounds <read on auditory stimuli>, objects, or both"; [0023]: teaches machine learning algorithms defining characteristics <read on stimuli ontology data structure> of the types of sounds associated with an object), and wherein
[[the stimuli ontology data structure is input to the one or more machine learning computer models to train the one or more machine learning computer models to predict the emotion of the user from patterns of input data.]]


However, the combination of Baughman and Sumant does not expressly disclose
the stimuli ontology data structure is input to the one or more machine learning computer models to train the one or more machine learning computer models to predict the emotion of the user from patterns of input data.

Cowen discloses
the stimuli ontology data structure is input to the one or more machine learning computer models to train the one or more machine learning computer models to predict the emotion of the user from patterns of input data (Cowen, [0098]: teaches "a model can be trained to predict emotion(s) based on a recording of a user and the system can create/modify virtual characters, or aspects of an AR/VR environment <read on modifying a virtual environment theme>, based on the predicted emotion(s)"; [0099]: teaches "the training data can comprise a plurality of images", where "each training image can be an image that is presented to a participant (e.g., the stimuli <read on stimuli ontology data structure>) and is labeled with the participant's ratings of emotion").

Cowen is analogous art with respect to Baughman, in view of Sumant because they are from the same field of endeavor, namely associating stimuli with user emotions. Before the effective filing date of the claimed invention, it would have been obvious to a person of ordinary skill in the art to train a machine learning algorithm to predict an emotional rating of a user based on stimulus input and image/video training data as taught by Cowen into the teaching of Baughman, in view of Sumant. The suggestion for doing so would allow the machine learning algorithm to assess how specific people, such as therapy patients, would react to certain stimuli, thereby offering ways to aid people in achieving their goals, such as calming down from anxiety. Therefore, it would have been obvious to combine Cowen with Baughman, in view of Sumant.

Regarding Claims 5 and 15, the combination of Baughman and Sumant discloses the method and the non-transitory computer-readable medium of Claims 1 and 11 respectively. The combination of Baughman and Sumant does not expressly disclose the limitations of Claims 5 and 15; however, Cowen discloses wherein the emotion data comprises
an emotional state ontology data structure specifying a plurality of predefined emotional states of the user (Cowen, [0081]: teaches predefined emotion tags <read on predefined emotional states of a user> being associated with media content <read on emotional state ontology data structure> and/or recording as ratings), and wherein
portions of the emotion data are correlated with corresponding ones of predefined emotional states in the emotional state ontology data structure (Cowen, [0090]: teaches the system being trained to associate certain stimuli with emotions based on the percentage overlap in emotion tags <read on corresponding ones of predefined emotional states> in the input media content <read on emotional state ontology data structure>), and wherein
the emotional state ontology data structure is input to the one or more machine learning computer models to train the one or more machine learning computer models to predict the emotion of the user from patterns of input data (Cowen, [0098]: teaches "a model can be trained to predict emotion(s) based on a recording of a user and the system can create/modify virtual characters, or aspects of an AR/VR environment <read on modifying a virtual environment theme>, based on the predicted emotion(s)"; [0099]: teaches "the training data can comprise a plurality of images", where "each training image can be an image that is presented to a participant (e.g., the stimuli) and is labeled with the participant's ratings of emotion <read on emotional state ontology data structure>").

Cowen is analogous art with respect to Baughman, in view of Sumant because they are from the same field of endeavor, namely associating stimuli with user emotions. Before the effective filing date of the claimed invention, it would have been obvious to a person of ordinary skill in the art to implement emotion tags to the training data for the machine learning algorithm to predict emotions as taught by Cowen into the teaching of Baughman, in view of Sumant. The suggestion for doing so would allow the machine learning program to infer behavioral patterns and responses from certain environmental settings and stimuli, thereby creating a machine learning model that can accurately anticipate a user's future emotional response. Therefore, it would have been obvious to combine Cowen with Baughman, in view of Sumant.
















Conclusion
The prior art made of record and not relied upon is considered pertinent to applicant's disclosure.
Briggs et al. (US 20180247443 A1) discloses performing emotional analysis in a virtual reality environment;
Kaushik et al. (US 20220067385 A1) discloses a machine learning engine processing video and audio from a computer simulation to identify candidate segments of the simulation for use in a video summary, which uses metadata; and
Platt et al. (US 20230051795 A1) discloses mapping a user's emotion onto a virtual avatar in a virtual reality environment.

Applicant's amendment necessitated the new ground(s) of rejection presented in this Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP § 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a).
A shortened statutory period for reply to this final action is set to expire THREE MONTHS from the mailing date of this action. In the event a first reply is filed within TWO MONTHS of the mailing date of this final action and the advisory action is not mailed until after the end of the THREE-MONTH shortened statutory period, then the shortened statutory period will expire on the date the advisory action is mailed, and any nonprovisional extension fee (37 CFR 1.17(a)) pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of the advisory action. In no event, however, will the statutory period for reply expire later than SIX MONTHS from the mailing date of this final action.

Any inquiry concerning this communication or earlier communications from the examiner should be directed to KARL TRUONG whose telephone number is (703)756-5915. The examiner can normally be reached 10:30 AM - 7:30 PM.
Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice.
If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, Kent Chang can be reached at (571) 272-7667. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300.
Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000.





/K.D.T./Examiner, Art Unit 2614                                                                                                                                                                                                        
/KENT W CHANG/Supervisory Patent Examiner, Art Unit 2614
Read full office action
Prosecution Timeline

May 15, 2023
Application Filed
Mar 05, 2025
Non-Final Rejection — §103
Jun 05, 2025
Interview Requested
Jun 16, 2025
Examiner Interview Summary
Jun 16, 2025
Applicant Interview (Telephonic)
Jun 20, 2025
Response Filed
Jun 30, 2025
Final Rejection — §103
Aug 18, 2025
Interview Requested
Aug 25, 2025
Examiner Interview Summary
Aug 25, 2025
Applicant Interview (Telephonic)
Aug 27, 2025
Response after Non-Final Action
Sep 26, 2025
Request for Continued Examination
Oct 02, 2025
Response after Non-Final Action
Oct 20, 2025
Non-Final Rejection — §103
Jan 13, 2026
Interview Requested
Jan 28, 2026
Applicant Interview (Telephonic)
Jan 28, 2026
Examiner Interview Summary
Jan 30, 2026
Response Filed
Feb 23, 2026
Final Rejection — §103
Apr 12, 2026
Interview Requested
Precedent Cases

Applications granted by this same examiner with similar technology

18/324,617
Patent 12573149
DATA PROCESSING METHOD AND APPARATUS, DEVICE, COMPUTER-READABLE STORAGE MEDIUM, AND COMPUTER PROGRAM PRODUCT
2y 5m to grant Granted Mar 10, 2026
18/455,592
Patent 12561875
ANIMATION FRAME DISPLAY METHOD AND APPARATUS, DEVICE, AND STORAGE MEDIUM
2y 5m to grant Granted Feb 24, 2026
18/211,149
Patent 12494013
AUTODECODING LATENT 3D DIFFUSION MODELS
2y 5m to grant Granted Dec 09, 2025
18/125,596
Patent 12456258
SYSTEMS AND METHODS FOR GENERATING A SHADOW MESH
2y 5m to grant Granted Oct 28, 2025
18/028,063
Patent 12444020
FLEXIBLE IMAGE ASPECT RATIO USING MACHINE LEARNING
2y 5m to grant Granted Oct 14, 2025
Study what changed to get past this examiner. Based on 5 most recent grants.
AI Strategy Recommendation

Get an AI-powered prosecution strategy using examiner precedents, rejection analysis, and claim mapping.
Prosecution Projections

5-6
Expected OA Rounds
52%
Grant Probability
83%
With Interview (+31.0%)
2y 7m
Median Time to Grant
High
PTA Risk
Based on 29 resolved cases by this examiner. Grant probability derived from career allow rate.
Physical World Driven Environmental Themes for Avatars in Virtual/Augmented Reality Systems

Interview Optional

Examiner Intelligence

Statute-Specific Performance

Office Action

Prosecution Timeline

Precedent Cases

Applications granted by this same examiner with similar technology

AI Strategy Recommendation

Prosecution Projections

Ready to respond to this office action?

Sign in with your work email