Last updated: May 29, 2026

Application No. 17/627,847

CONVERSION DEVICE, CONVERSION METHOD, PROGRAM, AND INFORMATION RECORDING MEDIUM

Non-Final OA §101§103

Filed

Jan 18, 2022

Priority

Jul 25, 2019 — JP 2019-136728 +1 more

Examiner

LI, LIANG Y

Art Unit

2143

Tech Center

2100 — Computer Architecture & Software

Assignee

Riken

OA Round

2 (Non-Final)

Interview Optional

— +69.2% interview lift. Interview already conducted in this application's prosecution history. This examiner has a 61% grant rate with +69.2% interview lift. Since an interview has already been tried, recommend written response with narrowed claims based on precedent claim evolution patterns.

Based on 274 resolved cases, 2023–2026

Examiner Intelligence

LI, LIANG Y View full profile →

Grants 61% of resolved cases

Career Allowance Rate

168 granted / 274 resolved

+6.3% vs TC avg

Strong +69% interview lift

Without

With

+69.2%

Interview Lift

resolved cases with interview

Typical timeline

3y 3m

Avg Prosecution

17 currently pending

Career history

303

Total Applications

across all art units

Statute-Specific Performance

§101

0.7%

-39.3% vs TC avg

§103

89.6%

+49.6% vs TC avg

§102

9.4%

-30.6% vs TC avg

§112

0.2%

-39.8% vs TC avg

Black line = Tech Center average estimate • Based on career data from 274 resolved cases

Office Action

§101 §103

DETAILED ACTION
The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA . This action is responsive to pending claims 1-9, 11 filed 9/12/2025.
Claim Objections
The following claim(s) are objected to for formality issues:
	In claim 1 ¶4 (a first learner ..), “configured to the one or more processors” should read “configured by the one or more processors”.
	Appropriate corrections are required.

Claim Rejections - 35 USC § 101
35 U.S.C. 101 reads as follows:
Whoever invents or discovers any new and useful process, machine, manufacture, or composition of matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the conditions and requirements of this title.

Claim(s) 1-3, 5-9, 11 are rejected under 35 U.S.C. 101 because the claimed invention is directed to an abstract idea without significantly more. We analyze the claims according to the subject matter eligibility flowchart (MPEP 2106). As all claims recite statutory categories (device, method, computer media), step 1 is answered affirmatively and we proceed to step 2.

Representative claim 1 is directed to a image encoder and a partitioner that seeks to improve characteristic features of the encoder via random partitioning. This is analogous to a mental process where the mind may gain experience by exposing itself to random images in order to gain experience to better convert images into a vector. Likewise, converting images into a feature vector is a mental process analogous to using judgment and observation to determine image characteristics, and may be performed in the mind by considering, for example, presence or degree of presence of various attributes of the image, e.g., number of objects. Hence, the claims are directed to an abstract idea, that of mentally categorizing, determining the characteristics of, and classifying images.
	
	The claims portions directed to the abstract idea and the additional elements (underlined below) are analyzed below:
	For claim 1, Chai discloses: a conversion device that converts a given input vector representing an image to a feature vector of which a dimension is reduced  by a conversion model and classifies the image by the feature vector (As above, such a device may be implemented mentally), the conversion device comprising a memory that stores program code and one or more processors configured to read the program code and operate as instructed by the program code including (These additional elements recite implementation on a general purpose computer and hence does comprises mere instructions to apply the exception on a computer and hence does not constitute an integration into a practical application (2a-2). Furthermore, the use of computers is WURC in the field of classification (2b).):
	a partitioner code configured to cause the one or more processors to partition training vectors into groups randomly (partitioning randomly is a mental process, e.g., randomly deciding on batches for mental processing. See above for the analysis of the implementation on a general purpose computer via code);
	a first classifier code configured to cause the one or more processors to classify feature vectors that are obtained by converting the training vectors with the conversion model, into any one of the groups by a first classification model (Classification, such as based on image attributes via a feature vector, is a mental process), and
	a first learner code configured to the one or more processors to learn the conversion model and the first classification model by a first teacher data including the training vectors and the groups into which the training vectors are respectively partitioned (Learning patterns via training data and training vectors is a mental process), thereby sparsity of the feature vector being improved (improving sparsity, such as by reducing the number of image characteristics considered, can be performed mentally),
	wherein:
	the conversion model is implemented by an autoencoder comprising an encode part located in the former half of the autoencoder and a decode part located in the latter half of the autoencoder (Considering image characteristics of a feature vector, and considering whether such vectors correspond to the original image in order to assure correspondence, is a mental process. The additional elements recite implementation as an autoencoder, but such limitations comprise mere instructions to apply the mental process to a machine learning environment and hence does not constitute an integration into a practical application. Furthermore, the use of autoencoders is well-understood, routine, and conventional in the field of image processing),
	an input to the conversion model is fed into the encode part (receiving input is a mental process),
	the encode part produces an intermediate representation from the input (generating an encoded vector, such as by considering various image characteristics, is a mental process),
	the intermediate representation is fed into the decode part (Considering whether the generated vector of image characteristics corresponds with the image is a mental process),
	the decode part produces a result from the intermediate representation (Considering or imagining the image thereby generated is a mental process), and
	the result is treated as an output from the conversion model (Considering the result of an imagined image is a mental process),
 	the given input vector is provided as the input to the conversion model (Classifying images based on image characteristics is a mental process),
	the intermediate representation is treated as the feature vector (Considering image characteristics as a list is a mental process)), and
	the conversion model is learned by minimizing a difference between the input to the conversion model and the output from the conversion model (Considering a difference between an imagined image and an original image based on a reconstruction or reduction to a list of essential image characteristics is a mental process).

	For claim 2: the conversion device according to claim 1, wherein
	the training vectors belong to classes, respectively,
	the program code further includes (see above for analysis of implementation details on a general purpose computer):
		a second code configured to cause the one or more processors to classify a given vector into any one of the classes by a second classification model (Classification, such as of image characteristics, may be performed mentally); and
		a second learner code configured to cause the one or more processors to classify the second classification model by of a second teacher data including feature vectors that are obtained by converting the training vectors, and the classes to which the training vectors respectively belong, by the learned conversion model (Adjusting mental heuristics based on trained feature vectors, such as a listing of image characteristics, is a mental process),
	when a new input vector representing a new image is given after the second classification model is learned, the conversion device converts the new input vector to a new feature vector by the learned conversion model, and the second classifier classifies the new feature vector into any one of the classes by the learned second classification model, thereby classifying the new input vector into which a class into the new feature vector is classified (conversion into a new feature vector and subsequent classification is similar to the training process described above and may be performed mentally).

	For claim 3: the conversion device according to claim 2, wherein the feature vector has a dimension greater than the number of classes (This limitation imposed on the feature vector does not preclude performance in the mind).
  
	For claim 5: the conversion device according to claim 3, wherein the feature vector has a dimension greater than the number of the groups (This limitation imposed on the feature vector does not preclude performance in the mind).

	For claim 6: the conversion device according to claim 1, wherein the second classification model classifies the feature vector by logistic repression, ridge repression, lasso repression, support vector machine (SVM), random forest, or neural network (This merely recites the implementation of a general classification task on a machine learning model via a computer and hence does not constitute an integration into a practical application as it is a mere application and is extra solutional (2a-2). Furthermore, the use of machine learning models for classification is WURC in the field of classification (2b).).

	For claim 7: the conversion device according to claim 1, wherein probabilities that the partitioner randomly partitions the training vectors into the groups, respectively, are not equal to each other (This limitation restricting the classification does not preclude its performance in the mind).

	For claim 11: the conversion device according to claim 2, wherein the encode part is implemented by a first convolutional neural network with eight output layers, and the first classifier classifies the feature vector by a second convolutional neural network with eight output layers (This merely recites the implementation of a general classification task on a machine learning model via a computer and hence does not constitute an integration into a practical application as it is a mere application and is extra solutional (2a-2). Furthermore, the use of machine learning models, including those with varying numbers of output layers, for classification is WURC in the field of classification (2b).).
	
	Claims 8, 9 recite methods and computer media analogous to the above and are hence rejected for the same reasons.

Claim Rejections - 35 USC § 103
The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action:
A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made.
	Claim(s) 1-3, 5-6, 8-9 are rejected under 35 U.S.C. 103 as being unpatentable over Chai ("A semi-supervised auto-encoder using label and sparse regularizations for classification", published 1/23/2019) in view of Pondenkandath ("Leveraging random label memorization for unsupervised pre-training", published 2018) in view of Okazaki (US 10803388 B2).
	For claim 1, Chai discloses: a conversion device that converts a given input vector representing an image (image classification: p.207 col.2 ¶1, §4.1 ¶1, §5 ¶2) to a feature vector of which a dimension is reduced  by a conversion model and classifies the image by the feature vector (fig.1 gives overview of the device including an input layer for accepting an input vector of a certain weight, a hidden encode layer of lower dimension, these two layers being an encoder conversion model, and finally a label layer and a classifier that classifies the image via the feature vector generated by the hidden layer):
	a partitioner code configured to cause the one or more processors to partition training vectors into groups (p.201 col.1 ¶2 (“To improve ...”), §3 ¶1 contemplates using labeled data mixed with unlabeled data in a semi-supervised technique, hence, partitioning training vectors into corresponding groups via labeling as part of data preparation for input);
	a first classifier code configured to cause the one or more processors to classify feature vectors that are obtained by converting the training vectors with the conversion model, into any one of the groups by a first classification model (fig.1: classifier, label layer), and
	a first learner code configured to the one or more processors to learn the conversion model and the first classification model by a first teacher data including the training vectors and the groups into which the training vectors are respectively partitioned (fig.1 shows the overall trainer J-LSRAE operating as learner code, the trainer making adjustments to the classification model and the conversion model via the training or teacher data including the input vectors and groups), thereby sparsity of the feature vector being improved (J-sparse would improve the sparsity of the hidden layer activations, see §3 ¶1-2, hence, improving sparsity of feature vectors, for example, those vectors containing zeros in from the input layer will be preserved or have additional zeros due to the hidden layer sparsity),
	wherein:
	the conversion model is implemented by an autoencoder comprising an encode part located in the former half of the autoencoder and a decode part located in the latter half of the autoencoder (fig.1: input layer, hidden layer, output layer, §3 ¶1),
	an input to the conversion model is fed into the encode part (fig.1: input layer, hidden layer),
	the encode part produces an intermediate representation from the input (fig.1: output of the hidden layer),
	the intermediate representation is fed into the decode part (fig.1: output of hidden layer to output layer),
	the decode part produces a result from the intermediate representation (fig.1: output of the output layer, fed into J-AE (autoencoder loss)), and
	the result is treated as an output from the conversion model (ibid),
 	the given input vector is provided as the input to the conversion model (fig.1: output of hidden layer into label layer, classifier),
	the intermediate representation is treated as the feature vector (ibid, see also §3 ¶2 (“extract more local and informative features to represent the data)), and
	the conversion model is learned by minimizing a difference between the input to the conversion model and the output from the conversion model (§3 ¶1: J-AE being the reconstruction error, see also eq.11 (p.208)).
	Chai does not disclose: wherein the partitioning into groups occurs randomly.
	Pondenkandath disclose: wherein the partitioning into groups occurs randomly (§2.4, §3-4: pre-training with random labels yielding improvement in results).
It would have been obvious before the effective filing date to a person of ordinary skill in the art to modify the device of Chai by incorporating the random labeling of Pondenkandath. Both concern the art of neural network training, and the incorporation would have, according to Pondenkandath, provide potential improvements in model performance (§4).
Chai modified by Pondenkandath does not disclose the remaining limitations.
Chai modified by Pondenkandath does not disclose the remaining limitations.
Okazaki (US 10803388 B2) discloses: a device comprising a memory that stores program code and one or more processors configured to read the program code and operate as instructed by the program code (fig.9, col.9 ¶2-5).
It would have been obvious before the effective filing date to a person of ordinary skill in the art to modify the device of Chai modified by Pondenkandath by incorporating computing hardware structure of Okazaki. Both concern the art of neural network training, and the incorporation would have, according to Okazaki, allowed for implementation on a general purpose computing devices (col.9 ¶2).

	For claim 2, Chai modified by Pondenkandath modified by Okazaki  discloses the device of claim 1, as described above. Chai further discloses: the training vectors belong to classes (p.201 col.1 ¶2 (“To improve ..”), §3 ¶1 contemplates using labeled data mixed with unlabeled data in a semi-supervised technique, hence, the training vector belonging to respective labeled classes), respectively,
	the program code further includes:
		 a second classifier code configured to cause the one or more processors to classify a given vector into one of the classes by a second classification model (fig.1: classifier, label layer); and
	a second learner that learns code configured to cause the one or more processors to classify the second classification model by a second teacher data including feature vectors that are obtained by converting the training vectors, and the classes to which the training vectors respectively belong, by the learned conversion model (fig.1: label layer, classifier, and respective learning functions (e.g., J-Label), these components being a second learner in contrast to the first learner of random labels),
	when a new input vector representing a new image is given after the second classification model is learned, the conversion device converts the new input vector to a new feature vector by the learned conversion model, and the second classifier classifies the new feature vector into any one of the classes by the learned second classification model, thereby classifying the new input vector into which a class into the new feature vector is classified (ibid: passing the new image data through the entire encode-classify network, thereby generating a classification result).  

	For claim 3, Chai modified by Pondenkandath modified by Okazaki  discloses the device of claim 2, as described above. Chai further discloses: wherein the feature vector has a dimension greater than the number of the classes (§4.1, table 1 (p.210) gives overview of dataset including number of classes, features).  

	For claim 5, Chai modified by Pondenkandath modified by Okazaki  discloses the device of claim 3, as described above. Chai modified by Pondenkandath further discloses: wherein the feature vector has a dimension greater than the number of the groups (Chai §4.1 table 1, combination of the random labeling teaching of Pondenkandath would not change these metrics1).

	For claim 6, Chai modified by Pondenkandath modified by Okazaki  discloses the device of claim 2, as described above. Chai further discloses: wherein the second classification model classifies the feature vector by logistic regression, ridge regression, lasso regression, support vector machine (SVM), random forest, or neural network (fig.1: ELM constitutes a neural network).
	Claims 8, 9 recite analogous methods and computer media and hence are rejected under the same rationale.
	Claim(s) 7 are rejected under 35 U.S.C. 103 as being unpatentable over Chai ("A semi-supervised auto-encoder using label and sparse regularizations for classification", published 1/23/2019) in view of Pondenkandath ("Leveraging random label memorization for unsupervised pre-training", published 2018) in view of Okazaki (US 10803388 B2) in view of Bekker ("Training deep neural-networks based on unreliable labels", published 2016).
	For claim 7, Chai modified by Pondenkandath modified by Okazaki  discloses the device of claim 1, as described above. Chai modified by Pondenkandath does not discloses: wherein probabilities that the partitioner randomly partitions the training vectors into the groups, respectively, are not equal to each other.
	Bekker discloses: wherein probabilities that the partitioner randomly partitions the training vectors into the groups, respectively, are not equal to each other (§3 ¶3: Bekker contemplates passage through a noisy channel, see §2 ¶1, hence, where randomness is induced based on the original distribution).
	It would have been obvious before the effective filing date to a person of ordinary skill in the art to modify the device of Chai modified by Pondenkandath by incorporating the randomized labeling teachings of Bekker. Both concern the art of exploring the effects of relevancy via randomized labels on neural networks, and the incorporation would have, according to Bekker, to produce better results in noisy channels (§1 ¶1).

	Claim(s) 11 are rejected under 35 U.S.C. 103 as being unpatentable over Chai ("A semi-supervised auto-encoder using label and sparse regularizations for classification", published 1/23/2019) in view of Pondenkandath ("Leveraging random label memorization for unsupervised pre-training", published 2018) in view of Okazaki (US 10803388 B2) in view of Yasrab ("ECRU: An encoder-decoder based convolution neural network (CNN) for road-scene understanding", published 2018).

	For claim 11, Chai modified by Pondenkandath modified by Okazaki  discloses the device of claim 2, as described above. Chai modified by Pondenkandath does not disclose the limitations of claim 11.
	Yasrab discloses: wherein the encode part is implemented by a first convolutional neural network with eight output layers, and the first classifier classifies the feature vector by a second convolutional neural network with eight output layers (fig.1 shows encoding portion, with fig.2 showing a decoder, the encoder and the decoder having eight output layers).
	It would have been obvious before the effective filing date to a person of ordinary skill in the art to modify the device of Chai modified by Pondenkandath by incorporating the CNN architecture of Yasrab. Both concern the art of neural networks for images, and the incorporation would have, according to Yasrab, allowed application to image segmentation (§1).

Response to Arguments
	In the remarks, Applicant argues:
	1. The claims are statutory based on the amendments.
	---
	Examiner respectfully disagrees; while the amended claims now recite statutory categories (step 1 of the 101 analysis), the claims are directed to an abstract idea without significantly more, as described in the rejection above.

	2. The amended limitations overcome the art of record, as the claimed invention does not employ a cost function, but rather induces sparsity by an encoder and decoder. Furthermore, the application of Al-Qatf to Yamada is improper hindsight. Furthermore, Zhang does not overcome the deficiencies of the art and hence the remaining claims are allowable.
	---
	Examiner submits that applicant’s arguments are moot in view of newly applied art.
	
Conclusion
The prior art made of record and not relied upon is considered pertinent to applicant's disclosure. Park (US 20200193269 A1) discloses a noise-augmented image decoder and classifier.
Applicant's amendment necessitated the new ground(s) of rejection presented in this Office action.  Accordingly, THIS ACTION IS MADE FINAL.  See MPEP § 706.07(a).  Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a).
A shortened statutory period for reply to this final action is set to expire THREE MONTHS from the mailing date of this action.  In the event a first reply is filed within TWO MONTHS of the mailing date of this final action and the advisory action is not mailed until after the end of the THREE-MONTH shortened statutory period, then the shortened statutory period will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of the advisory action.  In no event, however, will the statutory period for reply expire later than SIX MONTHS from the date of this final action.
Any inquiry concerning this communication or earlier communications from the examiner should be directed to LIANG LI whose telephone number is (303)297-4263.  The examiner can normally be reached Mon-Fri 9-12p, 3-11p MT (11-2p, 5-1a ET).
If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor Jennifer Welch can be reached on (571)272-7212.  The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300.
Information regarding the status of an application may be obtained from Patent Center and the Private Patent Application Information Retrieval (PAIR) system. Status information for published applications may be obtained from Patent Center or Private PAIR. Status information for unpublished applications is available through Patent Center or Private PAIR to authorized users only. Should you have questions about access to Patent Center or the Private PAIR system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free).
Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice. The examiner is available for interviews Mon-Fri 6-11a, 2-7p MT (8-1p, 4-9p ET).

/LIANG LI/
Primary examiner AU 2143

Read full office action

Prosecution Timeline

Jan 18, 2022

Application Filed

Jan 18, 2022

Response after Non-Final Action

Jun 18, 2025

Non-Final Rejection mailed — §101, §103

Sep 12, 2025

Response Filed

Nov 05, 2025

Final Rejection mailed — §101, §103

Feb 04, 2026

Response after Non-Final Action

Feb 18, 2026

Examiner Interview (Telephonic)

Feb 25, 2026

Examiner Interview Summary

Precedent Cases

Applications granted by this same examiner with similar technology

17/811,444

Patent 12625602

Method for user interaction for data manipulation in a CAE/CAD system

3y 10m to grant Granted May 12, 2026

18/535,832

Patent 12596463

METHOD AND APPARATUS FOR IMAGE-BASED NAVIGATION

2y 3m to grant Granted Apr 07, 2026

17/445,905

Patent 12585716

INTELLIGENT RECOMMENDATION METHOD AND APPARATUS, MODEL TRAINING METHOD AND APPARATUS, ELECTRONIC DEVICE, AND STORAGE MEDIUM

4y 7m to grant Granted Mar 24, 2026

17/887,322

Patent 12585375

GENERATING SNAPPING GUIDE LINES FROM OBJECTS IN A DESIGNATED REGION

3y 7m to grant Granted Mar 24, 2026

18/152,328

Patent 12580000

MULTITRACK EFFECT VISUALIZATION AND INTERACTION FOR TEXT-BASED VIDEO EDITING

3y 2m to grant Granted Mar 17, 2026

Study what changed to get past this examiner. Based on 5 most recent grants.

Strategy Recommendation AI-generated — please review before filing

Get a prosecution strategy drawn from examiner precedents, rejection analysis, and claim mapping.

Typically takes 5-10 seconds — AI-generated, attorney review required before filing

Prosecution Projections

2-3

Expected OA Rounds

61%

Grant Probability

99%

With Interview (+69.2%)

3y 3m (~0m remaining)

Median Time to Grant

Moderate

PTA Risk

Based on 274 resolved cases by this examiner. Grant probability derived from career allowance rate.

CONVERSION DEVICE, CONVERSION METHOD, PROGRAM, AND INFORMATION RECORDING MEDIUM

Interview Optional

Examiner Intelligence

Statute-Specific Performance

Office Action

Prosecution Timeline

Precedent Cases

Applications granted by this same examiner with similar technology

Strategy Recommendation AI-generated — please review before filing

Prosecution Projections

Ready to respond to this office action?

Sign in with your work email