Last updated: May 29, 2026

Application No. 18/316,592

METHODS AND SYSTEMS FOR TRANSFORMING UNSTRUCTURED DATA TO STRUCTURED DATA

Non-Final OA §103§112

Filed

May 12, 2023

Priority

Aug 09, 2022 — continuation of 17/818,636

Examiner

MUHEBBULLAH, SAJEDA

Art Unit

2174

Tech Center

2100 — Computer Architecture & Software

Assignee

Ivalua S A S

OA Round

5 (Non-Final)

Interview Optional

— +34.8% interview lift. Interview already conducted in this application's prosecution history. This examiner has a 30% grant rate with +34.8% interview lift. Since an interview has already been tried, recommend written response with narrowed claims based on precedent claim evolution patterns.

Based on 256 resolved cases, 2023–2026

Examiner Intelligence

MUHEBBULLAH, SAJEDA View full profile →

Grants only 30% of cases

Career Allowance Rate

76 granted / 256 resolved

-25.3% vs TC avg

Strong +35% interview lift

Without

With

+34.8%

Interview Lift

resolved cases with interview

Typical timeline

4y 9m

Avg Prosecution

17 currently pending

Career history

287

Total Applications

across all art units

Statute-Specific Performance

§101

0.4%

-39.6% vs TC avg

§103

93.2%

+53.2% vs TC avg

§102

6.3%

-33.7% vs TC avg

§112

0.1%

-39.9% vs TC avg

Black line = Tech Center average estimate • Based on career data from 256 resolved cases

Office Action

§103 §112

DETAILED ACTION
This communication is responsive to RCE/Amendment filed 01/08/2026.
Claims 1 and 3-8 are pending in this application. In the Amendment, claims 1 and 7-8 are amended. 

Notice of Pre-AIA  or AIA  Status
The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA .


Response to Arguments
Applicant’s arguments with respect to claims amended 01/08/2026 have been considered but are moot because the new ground of rejection does not rely on any reference applied in the prior rejection of record for any teaching or matter specifically challenged in the argument.

Claim Rejections - 35 USC § 112
The rejection has been withdrawn as necessitated by the amendment.


Claim Rejections - 35 USC § 103
The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action:
A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made.

Claims 1 and 5-8 are rejected under 35 U.S.C. 103 as being unpatentable over Jayaraman et al. (“Jayaraman”, US 2021/0256097) in view of Priestas et al. (“Priestas”, US 2019/0005012) in view of Liang et al. (“Liang” US 2018/0060302) and further in view of Sirangimoorthy et al. (“Sirangimoorthy”, US 2021/0248153).

As per claim 1, Jayaraman teaches a computer-implemented method of transforming an unstructured set of data to a structured set of data (Jayaraman, para.13, extract data into structured format), wherein the unstructured set of data comprises a common file format document (Jayaraman, para.14, 33, input file formats i.e. DOC/PDF/RTF/TXT); the method comprising:
receiving, by at least one processor of a web service provider (Jayaraman, para.15-16, 20, document content extraction platform 100 communicatively connected via Internet), a plurality of segments of the unstructured set of data, wherein at least a portion of the plurality of segments are clauses of a document that contain text (Jayaraman, para.12, 22-23, document preprocessing of contract document that includes clauses and identifying relevant text blocks); 
generating, by the at least one processor of the web service provider, a JSON (Java Script Object Notation) file that includes locations of the segments within the common file format document (Jayaraman, para.19-20, 25, 30, 36, row level identifiers; extracted entities located and classified in JSON format);
forwarding, by the at least one processor of the web service provider to a processor of a buyer-side computer, the JSON file (Jayaraman, para.19-20, interface 104 used to transfer data from storage 112 to device 102 where data storage 112 is separately connected to platform 100; para.30, structured records viewable by user interface 104); 
adding, by the processor of the buyer-side computer, bookmarks to the common file format document (Jayaraman, para.20, 22-23, 31, 36-38, detected text blocks and table structures tagged with unique identifier to identify relevant content; mark relevant text);
forwarding, by the processor of the buyer-side computer to the at least one processor of the web service provider, the text of each segment extracted from the common file format document (Jayaraman, para.20, 25, 31, 33-34, 38, feedback updates record);
classifying, by at least one processor of the web service provider, each of the plurality segments by applying each segment to a neural network classification model (Jayaraman, para.17-18, 20, 25-26, 30, 36-38, entity classified into categories by machine learning model i.e. R-CNN; designated text blocks);
generating, by the at least one processor of the web service provider, a JSON file that includes the classifications for the plurality of segments and forwarding the JSON file to the processor of the buyer-side computer (Jayaraman, para.20, 38, text blocks with relevant entities tagged and passed on for further processing); 
extracting key terms from the text of the at least some of the plurality of segments by the at least one processor of the web service provider using an extraction model (Jayaraman, para.17, 20, 30, 38, extracting key/value pairs corresponding to entities), and
generating, by the at least one processor of the web service provider, a JSON file that includes the extracted key terms for the at least some of the plurality of segments and sending the JSON file to the processor of the buyer-side computer (Jayaraman, para.19-20, 30, 38, structured stored records store extracted key/value pairs corresponding to entities). 
However, Jayaraman does not explicitly teach the bookmarks indicating the segments and extracting, by the processor of the buyer-side computer, the text of the segment from the common file format document using the bookmarks. Priestas teaches a document extraction method that includes bookmarking to indicate the segments of a document (Priestas, para.17, 25-29, 39-41, markup tags) and extracting, by the processor of the buyer-side computer, the text of the segment from the common file format document using the bookmark (Priestas, para.40, 48, 53, text extractor 208 extracts text from markup file). It would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention to include Priestas’ teaching with Jayaraman’s method in order to extract data based on tagged sections.
Furthermore, the method of Jayaraman and Priestas does not explicitly teach concatenating, by the processor of the buyer-side computer, the classifications with the text to generate modified text for each of the plurality of segments. Liang teaches a text analysis method that includes concatenating, by the processor of the buyer-side computer, classifications with text to generate modified text for each of the plurality of segments (Liang, para.23, 137-139, 166, 215, entity 514 may modify text tag). It would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention to include Liang’s teaching with the method of Jayaraman and Priestas in order to provide user feedback.
Additionally, the method of Jayaraman, Priestas and Liang does not teach generating a first further JSON file and forwarding the first further JSON file to the processor of the buyer-side computer; sending in a second further JSON file, by the processor of the buyer-side computer to the at least one processor of the web service provider, the modified text for at least some of the plurality of segments; the extraction model selected from a plurality of extraction models based on the classification of each of the plurality of segments; generating, a third further JSON file and sending the third further JSON file to the processor of the buyer-side computer; and generating the structured set of data, by the processor of the buyer-side computer using the modified text of the classified segments and the extracted key terms in the third further JSON file. 
Sirangimoorthy teaches a method of generating structured documents that includes generating a first further JSON file and forwarding the first further JSON file to the processor of the buyer-side computer (Sirangimoorthy, para.27-33, 38, 57-58, document converter 142 generates first/second/third intermediate documents which are structured by the structured document generator 144); sending in a second JSON file (Sirangimoorthy, para.20, 23, 27-33, 36, 38, 40, 57, intermediate plaintext i.e. JSON document with location of extracted text; document converter 142 forwards intermediate document), by the processor of the buyer-side computer to the at least one processor of the web service provider (Sirangimoorthy, Fig.1, para. 20, 67 remote application server 130), the modified text for at least some of the plurality of segments (Sirangimoorthy, para.21, 28-31, second intermediate document); the extraction model selected from a plurality of extraction models based on the classification of each of the plurality of segments (Sirangimoorthy, para.26, 35, 39, 58, document analyzer extracts based on domain-specific ontology determined by key words); generating, a third further JSON file and sending the third further JSON file to the processor of the buyer-side computer (Sirangimoorthy, para.27-33, 38, 57-58, document converter 142 generates first/second/third intermediate documents which are structured by the structured document generator 144); and generating the structured set of data, by the processor of the buyer-side computer using the modified text of the classified segments and the extracted key terms in the third further JSON file (Sirangimoorthy, para.21, 28-33, 38, 41, 58, structured document generator uses intermediate documents).  It would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention to include Sirangimoorthy’s teaching with the method of Jayaraman, Priestas and Liang in order to extract data based on context and modify structured records.

As per claim 5, the method of Jayaraman, Priestas, Liang and Sirangimoorthy teaches the method of claim 1, comprising performing one or more of the following, by the at least one processor of the web service provider: annotating the segments; clustering a plurality of structured sets of data; generating a sematic meaning for the segments; scoring the segments and/or the structured set of data; querying the structured set of data; normalizing the data of the structured set of data; navigating the segments using the logical grouping of segments having the same classification (Sirangimoorthy, para.25, 45, navigating segments).

As per claim 6, the method of Jayaraman, Priestas, Liang and Sirangimoorthy teaches the method according to claim 1, wherein the unstructured set of data and the structured set of data correspond to a contract document (Jayaraman, para.22, contract).

Claims 7-8 are similar in scope to claim 1, and are therefore rejected under similar rationale.

Claim 3 is rejected under 35 U.S.C. 103 as being unpatentable over Jayaraman et al. (“Jayaraman”, US 2021/0256097), Priestas et al. (“Priestas”, US 2019/0005012), Liang et al. (“Liang” US 2018/0060302) and Sirangimoorthy et al. (“Sirangimoorthy”, US 2021/0248153) in view of Hurd et al. (“Hurd”, US 2021/0082062).

As per claim 3, the method of Jayaraman, Priestas, Liang and Sirangimoorthy teaches the method of claim 1, wherein the extraction model selected from the plurality of extraction models is dependent on the classification (Sirangimoorthy, p.26, 39, document analyzer extracts based on domain determined by key words), however does not teach wherein the classification model outputs, by the at least one processor of the web service provider, a confidence score for the classification and wherein the extraction model selected from the plurality of extraction models is dependent on the confidence score.  Hurd teaches a method of summarizing unstructured documents wherein the classification model outputs, by the at least one processor of the web service provider, a confidence score for the classification and wherein the extraction model selected from the plurality of extraction models is dependent on the confidence score (Hurd, para.19, 35-37, 49-50, confidence level). It would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention to include Hurd’s teaching with the method of Jayaraman, Priestas, Liang and Sirangimoorthy in order to determine the accuracy of extraction.

Claim 4 is rejected under 35 U.S.C. 103 as being unpatentable over Jayaraman et al. (“Jayaraman”, US 2021/0256097), Priestas et al. (“Priestas”, US 2019/0005012), Liang et al. (“Liang” US 2018/0060302) and Sirangimoorthy et al. (“Sirangimoorthy”, US 2021/0248153) in view of Fujimoto et al. (“Fujimoto”, US 2023/0010202).

As per claim 4, the method of Jayaraman, Priestas, Liang and Sirangimoorthy teaches the method according to claim 1, wherein the set of extraction models comprises an extraction model corresponding to each of a plurality of classifications (Sirangimoorthy, p.26, 39, document analyzer extracts based on domain determined by key words), however does not teach wherein a generic extraction model is selected, by the at least one processor of the web service provider, when the classification of the segment does not correspond to one of said plurality of classifications.  Fujimoto teaches a method of extracting data wherein a generic extraction model is selected dependent on the data classification (Fujimoto, para.126-127, 164-165, 187, 202, generic feature extractor/handwriting recognition model). It would have been obvious to one of ordinary skill in the art before the effective filing date of the claimed invention to include Fujimoto’s teaching with the method of Jayaraman, Priestas, Liang and Sirangimoorthy in order to extract newly acquired data.

Conclusion
The prior art made of record and not relied upon is considered pertinent to applicant's disclosure.
Newey (US 2021/0073257) teaches a method of auto-tagging bookmark sections of text.

Inquiries
Any inquiry concerning this communication or earlier communications from the examiner should be directed to SAJEDA MUHEBBULLAH whose telephone number is (571)272-4065. The examiner can normally be reached Mon-Tue/Thur-Fri 10am-8pm.
Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice.
If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, William L Bashore can be reached on 571-272-4088. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300.
Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000.

/S.M./
Sajeda MuhebbullahExaminer, Art Unit 2174       


							/WILLIAM L BASHORE/                                                                                    Supervisory Patent Examiner, Art Unit 2174

Read full office action

Prosecution Timeline

Show 9 earlier events

May 22, 2025

Examiner Interview Summary

May 22, 2025

Applicant Interview (Telephonic)

Jun 20, 2025

Response Filed

Oct 08, 2025

Final Rejection mailed — §103, §112

Dec 05, 2025

Response after Non-Final Action

Jan 08, 2026

Request for Continued Examination

Jan 09, 2026

Response after Non-Final Action

Mar 04, 2026

Non-Final Rejection mailed — §103, §112 (current)

Precedent Cases

Applications granted by this same examiner with similar technology

17/658,028

Patent 12510876

MACHINE STATE VISUALIZATION

3y 9m to grant Granted Dec 30, 2025

16/001,629

Patent 12271745

METHOD AND SYSTEM FOR RECONCILING USER INTERACTIONS

6y 10m to grant Granted Apr 08, 2025

17/354,269

Patent 12260841

MULTIPLE PRIMARY USER INTERFACES

3y 9m to grant Granted Mar 25, 2025

16/793,926

Patent 12248673

SYSTEMS AND METHODS FOR ATTRIBUTING A SCROLL EVENT IN AN INFINITE SCROLL GRAPHICAL USER INTERFACE

5y 0m to grant Granted Mar 11, 2025

16/733,962

Patent 12001661

Bound Based Contextual Zoom

4y 5m to grant Granted Jun 04, 2024

Study what changed to get past this examiner. Based on 5 most recent grants.

Strategy Recommendation AI-generated — please review before filing

Get a prosecution strategy drawn from examiner precedents, rejection analysis, and claim mapping.

Typically takes 5-10 seconds — AI-generated, attorney review required before filing

Prosecution Projections

5-6

Expected OA Rounds

30%

Grant Probability

64%

With Interview (+34.8%)

4y 9m (~1y 8m remaining)

Median Time to Grant

High

PTA Risk

Based on 256 resolved cases by this examiner. Grant probability derived from career allowance rate.