Prosecution Insights
Last updated: April 19, 2026
Application No. 18/696,991

LAYOUT ANALYSIS SYSTEM, LAYOUT ANALYSIS METHOD, AND PROGRAM

Non-Final OA §102
Filed
Mar 28, 2024
Examiner
LIN, JESSICA YIFANG
Art Unit
2668
Tech Center
2600 — Communications
Assignee
Rakuten Group Inc.
OA Round
1 (Non-Final)
75%
Grant Probability
Favorable
1-2
OA Rounds
2y 3m
To Grant
99%
With Interview

Examiner Intelligence

Grants 75% — above average
75%
Career Allow Rate
3 granted / 4 resolved
+13.0% vs TC avg
Strong +33% interview lift
Without
With
+33.3%
Interview Lift
resolved cases with interview
Typical timeline
2y 3m
Avg Prosecution
29 currently pending
Career history
33
Total Applications
across all art units

Statute-Specific Performance

§101
7.9%
-32.1% vs TC avg
§103
53.5%
+13.5% vs TC avg
§102
32.7%
-7.3% vs TC avg
§112
4.0%
-36.0% vs TC avg
Black line = Tech Center average estimate • Based on career data from 4 resolved cases

Office Action

§102
Notice of Pre-AIA or AIA Status The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA . Information Disclosure Statement The information disclosure statement (IDS) submitted on March 28, 2024 is in compliance with the provisions of 37 CFR 1.97. Accordingly, the information disclosure statement is being considered by the examiner. Claim Rejections - 35 USC § 102 In the event the determination of the status of the application as subject to AIA 35 U.S.C. 102 and 103 (or as subject to pre-AIA 35 U.S.C. 102 and 103) is incorrect, any correction of the statutory basis (i.e., changing from AIA to pre-AIA ) for the rejection will not be considered a new ground of rejection if the prior art relied upon, and the rationale supporting the rejection, would be the same under either status. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the basis for the rejections under this section made in this Office action: A person shall be entitled to a patent unless – (a)(2) the claimed invention was described in a patent issued under section 151, or in an application for patent published or deemed published under section 122(b), in which the patent or application, as the case may be, names another inventor and was effectively filed before the effective filing date of the claimed invention. Claim(s) 1-14 are rejected under 35 U.S.C. 102(a)(2) as being anticipated by Semenov et. al. (US Patent 2022/0198182 A1). Regarding claim 1, Semenov et. al. discloses a layout analysis system, comprising at least one processor configured to: detect a cell of each of a plurality of scales from in a document image showing a document including a plurality of components (Figure 2, 4A, 6, 8); acquire cell information relating to the cell of each of the plurality of scales (Figure 7, [0042] a heat map is generated to correspond with elements within the document being analyzed); and analyze a layout relating to the document based on the cell information on each of the plurality of scales (Figure 5, [0049]-[0050] an internal field format evaluation 500 is used in the system which uses Byte Pair Encoding (BPE) tokens that represent an input text of detected fields on the document images). Regarding claim 2, Semenov et. al. discloses the layout analysis system according to claim 1, wherein at least one processor is configured to analyze the layout based on a learning model which has learned a for-training layout relating to a for-training document (Figures 2, 6-8). Regarding claim 3, Semenov et. al. discloses the layout analysis system according to claim 2, wherein at least one processor is configured to analyze the layout by arranging the cell information on each of the plurality of scales under a predetermined condition, inputting the arranged cell information to the learning model, and acquiring a result of analysis of the layout by the learning model (Figure 2, learning stage 220, “cell information” corresponds with document information based on markup region in pixels, [0040] and regions can have a specific geometric shape, [0037] the word selection sub unit 330 of system 300 is a submodule that uses a heuristic algorithm to analyze document text). Regarding claim 4, Semenov et. al. discloses the layout analysis system according to claim 3, wherein the learning model is a Vision Transformer-based model (Figure 5, [0049]-[0050] an internal field format evaluation 500 is used in the system which uses Byte Pair Encoding (BPE) tokens that represent an input text of detected fields on the document images). Regarding claim 5, Semenov et. al. discloses the layout analysis system according to claim 3[[ or 4]], wherein at least one processor is configured to analyze the layout by inputting, to the learning model, input data obtained by arranging a plurality of pieces of cell information on a first scale under a predetermined condition and then arranging a plurality of pieces of cell information on a second scale under a predetermined condition ([0034], Figure 2 flow diagram of the method 200 for field detection in a document, learning stage 220 is the first step, and the second step is the input document field detection stage 230, and heat maps are generated for each reference element as shown in Figure 4A). Regarding claim 6, Semenov et. al. discloses the layout analysis system according to claim 3[[ or 4]], wherein the at least one processor is configured to arrange the cell information on each of the plurality of scales in the order in input data in which a data size of each of the plurality of scales is defined such that when a scale size is smaller, the data size is larger, and to input the arranged input data to the learning model (Figure 7, [0042] A heat map is generated to represent each of the training document images, and the cell is filled with a value that equals to the fraction of the area occupied by the region for the document field contained within the cell). Regarding claim 7, Semenov et. al. discloses the layout analysis system according to claim 2,wherein when a total size of the cell information on each of the plurality of scales is less than a standard size determined for input data for the learning model, the at least one processor is configured to add padding to the input data to make up for a shortfall in the total size from the standard size, to arrange the cell information on each of the plurality of scales in the order in the padded input data, and to input the arranged input data to the learning model ([0024], The training phase may involve generating heat maps corresponding to a plurality of document image pixels with a specified cell size from a rectangular grid to identify candidate regions that satisfy a threshold condition based on a pre-defined share of pixels, [0025-0026] the extracted content of each document field can be evaluated using BPE tokens by evaluating the differences between each to produce more accurate field detection in documents based on a threshold). Regarding claim 8, Semenov et. al. discloses the layout analysis system according to claim 1,wherein at least one processor is configured to acquire, from among the plurality of scales, the cell information on a scale in which a plurality of words is a unit of the cell based on any one of the plurality of words (Figure 7, [0037] the world selection subunit 330 of system 300 is a submodule that uses a heuristic algorithm to analyze document text. The text can be analyzed for selection of words on document layout based on character types, such as letters, numerals, separators, etc.). Regarding claim 9, Semenov et. al. discloses the layout analysis system according to claim 1,wherein at least one processor is configured to detect the cell of each of the plurality of scales such that at least one of the plurality of components is included in a cell having a different scale from other cells [0046, Figure 4A heat maps are generated for a chosen reference element and the grid size is a hyperparameter with different values such as 64 x64 pixels or 32 x 32 px, etc.). Regarding claim 10, Semenov et. al. discloses the layout analysis system according to claim 1, wherein at least one processor is configured to: divide the document image into a plurality of small areas based on division positions determined in advance, and to acquire small area information relating to each of the plurality of small areas, and analyze the layout based on the cell information on each of the plurality of scales and the small area information on each of the plurality of small areas (Figure 6 shows the document markup, and Figure 4A shows a representative heatmap generated with certain color elements). Regarding claim 11, Semenov et. al. discloses the layout analysis system according to claim 1,wherein the plurality of scales includes a token level in which a token including a plurality of words is a unit of the cell and a word level in which a word is a unit of the cell ([0048] the system uses BPE tokens which refers to a numeric vector representing an input text). Regarding claim 12, Semenov et. al. discloses the layout analysis system according claim 1,wherein at least one processor is configured to detect the plurality of cells by executing optical character recognition on the document image ([0022-0025], a neural network is trained using a small number of marked-up documents as training documents, Figure 1 field detection engine 122; the selected candidate regions may then be treated as the positions of the corresponding fields by applying OCR techniques to the image fragments lying within the candidate regions). Regarding claim 13, the rejection of claim 1 is incorporated herein. Semenov et. al. also discloses a layout analysis method, comprising: detecting a cell of each of a plurality of scales from in a document image showing a document including a plurality of components; acquiring cell information relating to the cell of each of the plurality of scales; and analyzing a layout relating to the document based on the cell information on each of the plurality of scales (Figure 2, 6, [0022] an document can be an invoice with Company Name, Total, Due Date, etc. with various reference elements then represented as metadata indicating the positions and names of the document fields). Regarding claim 14, the rejection of claim 1 is incorporated herein. Semenov et. al. also discloses a non-transitory computer-readable information storage medium for storing a program for causing a computer to detect a cell of each of a plurality of scales from in a document image showing a document including a plurality of components; acquire cell information relating to the cell of each of the plurality of scales; and analyze a layout relating to the document based on the cell information on each of the plurality of scales (Figure 8 shows a computer readable medium for storing the computer readable program). Conclusion Any inquiry concerning this communication or earlier communications from the examiner should be directed to JESSICA YIFANG LIN whose telephone number is (571)272-6435. The examiner can normally be reached M-F 7:00am-6:15pm, with optional day off. Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice. If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, Vu Le can be reached at 571-272-7332. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300. Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. /JESSICA YIFANG LIN/Examiner, Art Unit 2668 January 16, 2026 /VU LE/Supervisory Patent Examiner, Art Unit 2668
Read full office action

Prosecution Timeline

Mar 28, 2024
Application Filed
Jan 26, 2026
Non-Final Rejection — §102
Mar 30, 2026
Interview Requested
Apr 09, 2026
Examiner Interview Summary

Precedent Cases

Applications granted by this same examiner with similar technology

Patent 12597139
CONTROLLING AN ALERT SIGNAL FOR SPECTRAL COMPUTED TOMOGRAPHY IMAGING
2y 5m to grant Granted Apr 07, 2026
Study what changed to get past this examiner. Based on 1 most recent grants.

AI Strategy Recommendation

Get an AI-powered prosecution strategy using examiner precedents, rejection analysis, and claim mapping.
Powered by AI — typically takes 5-10 seconds

Prosecution Projections

1-2
Expected OA Rounds
75%
Grant Probability
99%
With Interview (+33.3%)
2y 3m
Median Time to Grant
Low
PTA Risk
Based on 4 resolved cases by this examiner. Grant probability derived from career allow rate.

Sign in with your work email

Enter your email to receive a magic link. No password needed.

Personal email addresses (Gmail, Yahoo, etc.) are not accepted.

Free tier: 3 strategy analyses per month