Last updated: May 29, 2026

Application No. 18/595,701

COMPUTER-BASED SYSTEMS CONFIGURED TO DETERMINE ELEMENT-LEVEL DATA LINEAGE AND METHODS OF USE THEREOF

Final Rejection §103

Filed

Mar 05, 2024

Examiner

CURRAN, J MITCHELL

Art Unit

2169

Tech Center

2100 — Computer Architecture & Software

Assignee

Capital One Services LLC

OA Round

4 (Final)

This examiner grants 62% of cases after interview

— +35.5% interview lift. A telephonic interview to clarify the technical implementation could significantly improve the outcome.

Based on 109 resolved cases, 2023–2026

Examiner Intelligence

CURRAN, J MITCHELL View full profile →

Grants 62% of resolved cases

Career Allowance Rate

68 granted / 109 resolved

+7.4% vs TC avg

Strong +36% interview lift

Without

With

+35.5%

Interview Lift

resolved cases with interview

Typical timeline

3y 2m

Avg Prosecution

10 currently pending

Career history

125

Total Applications

across all art units

Statute-Specific Performance

§101

0.8%

-39.2% vs TC avg

§103

93.7%

+53.7% vs TC avg

§102

4.7%

-35.3% vs TC avg

Black line = Tech Center average estimate • Based on career data from 109 resolved cases

Office Action

§103

Notice of Pre-AIA  or AIA  Status
The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA .

Detailed Action
This is a final Office Action for application 18/595,701, in response to arguments and amendments filed on 12/31/2025. Claims 1, 8 and 15 are currently amended. Claims 7, 14 and 20 are cancelled. Claims 1-6, 8-13, 15-19 and 21-23 are pending and examined below.

Response to Arguments
Applicant’s arguments, see pgs. 9-13, filed 2/2/2026, with respect to the rejection(s) of claim(s) 1-6, 8-13, 15-19 and 21-23 under 35 USC § 102 and 35 USC § 103 have been fully considered and are persuasive.  Therefore, the rejection has been withdrawn.  However, upon further consideration, a new ground(s) of rejection is made in view of Kozina et al. (US Pub. 2014/0114907).

Claim Rejections - 35 USC § 103
In the event the determination of the status of the application as subject to AIA  35 U.S.C. 102 and 103 (or as subject to pre-AIA  35 U.S.C. 102 and 103) is incorrect, any correction of the statutory basis (i.e., changing from AIA  to pre-AIA ) for the rejection will not be considered a new ground of rejection if the prior art relied upon, and the rationale supporting the rejection, would be the same under either status.  
The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action:
A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made.

Claim(s) 1, 4-6, 8, 11-13, 15, 18-19 and 21-23  is/are rejected under 35 U.S.C. 103 as being unpatentable over Guan et al. (US Pat. 11,811,626) in view of Kozina et al. (US Pub. 2014/0114907).
Regarding claim 1, Guan teaches 
	A computer-implemented method comprising: retrieving, by at least one processor, an output data record comprising a plurality of first data elements each first data element representing an output data element output from at least one transformation of a plurality of transformations wherein the at least one transformation of the plurality of transformations produces the output data element from a second data element representing an input data element to the at least one transformation; (Figs. 6, 9; Col. 10 [Lines 54-63], Col. 11 [Lines 42-47], Col. 11 [Line 60] - Col. 12 [Line 28], Col. 14 [Lines 1-67] ; a first graph (Fig. 6) is generated  (i.e. output data element from at least one transformation) and can come from one or more types of media data (i.e. a related second data element representing an input element), including audio data that is converted to text before the text is then extracted in the multi-step (i.e. a plurality) process of transforming the input to produce the output)
	wherein the plurality of data transformations produces output data from input data based at least in part on a data transformation process from a data source to a data target; (Col. 11 [Lines 42-47]) audio data is converted to text before the text is then extracted in the multi-step (i.e. a plurality) process of transforming the input to produce the output)
obtaining, by the at least one processor, at least one input data record comprising a plurality of second data elements, each second data element representing input data input into the at least one transformation; (Figs. 6, 9; Col. 10 [Lines 54-63], Col. 11 [Line 60] - Col. 12 [Line 28], Col. 14 [Lines 1-67] ; a first graph (Fig. 6) is generated  (i.e. output data element from at least one transformation) and can come from (i.e. obtained) one or more types of media data (i.e. a related second data element representing an input element))
generating, by the at least one processor, a plurality of input-output pairs, each input-output pair representing a candidate pairing of a first data element of the plurality of first data elements with a second data element of the plurality of second data elements; (Fig. 4; Col. 10 [Lines 13-21]; the graph is generated (including pairs) and refined based on more than one type of media data (i.e. first, second, third, etc. data elements), where a node represents a data element and an edge represents a correlation between data elements)
utilizing, by at least one processor, an element mapping module, to determine a correlation in a graph- database based at least in part on the first data element and the second data element of each input-output pair of the plurality of input-output pairs;  (Fig. 4; Col. 10 [Lines 13-21]; the graph is generated (e.g. by an element mapping module) and refined based on user input for the more than one type of media data, where a node represents a data element and an edge represents a correlation between data elements)
	wherein the element mapping module is configured to: utilize at least one element-level mapping model to determine a statistical relationship between the first data element and the second data element of each input-output pair comprising a plurality of interconnected nodes and edges in a graph database; (Fig. 4; Col. 10 [Lines 13-21]; the graph is generated (e.g. by an element mapping module) and refined based on user input for the more than one type of media data, where a node represents a data element and an edge represents a correlation (i.e. statistical relationship) between data elements)
	utilizing, by at least one processor, at least one element-level mapping model to determine a correlation between the first data element having interconnected nodes and edges in a graph database and a second element having interconnected nodes and edges in a graph database; (Fig. 4; Col. 10 [Lines 13-21]; the graph is generated and refined based on user input, where a node represents a data element and an edge represents a correlation between (e.g. first and second) data elements)
	utilizing, based on at least one processor, a correlation measurement model to determine correlation between each of the nodes of the first data element and each of the nodes of the second element of the plurality of data elements based on the correlation measurement; (Fig. 4; Col. 10 [Lines 13-21]; the graph is generated and refined based on user input, where a node represents a data element and an edge represents a correlation (i.e. statistical relationship) between (e.g. first and second) data elements, and semantic correlation can be determined by attention-based semantic analysis models (i.e. correlation measurement models))
	sorting, by the at least one processor, for each first data element of the plurality of first data elements, the plurality of second data elements based at least in part on the statistical relationship between the first data element and the second data element of each input-output pair;  (Fig. 4; Col. 10 [Lines 13-21]; the graph is generated and refined based on user input, where a node represents a data element and an edge represents a correlation (i.e. sorted statistical relationship) between data elements)
	determine, for each first data element, at least one correlated second data of the plurality of second data element based at least in part on the sorting, and upon identifying that a correlation measurement for the first data element and the second data element exceeds a threshold degree of statistical significance, to determine a statistically significant correlation between each first data element and the respective at least one correlated second data element; (Col. 14 [Line 60] – Col. 15 [Line 2] a criterion (e.g. for sorting) is designated by the user to determine which elements to connect based on the strength (i.e. statistical significance) of the correlation)
and mapping, by the at least one processor, by updating at least one knowledge graph comprising the plurality of interconnected the nodes and edges in the graph database to include, for each first data element, at least one edge to the at least one correlated second data element. (Fig. 4; Col. 10 [Lines 13-21]; the graph is generated (i.e. updated) and refined based on user input, where a node represents a data element and an edge represents a correlation between data elements)
Guan does not explicitly teach
and mapping, by the at least one processor, for the output data record, a data lineage comprising each input-output pair of each transformation for the output data record
However, from the same field, Kozina teaches
and mapping, by the at least one processor, for the output data record, a data lineage comprising each input-output pair of each transformation for the output data record (Fig.3; Par. [0036] the data lineage propagation module (#215) contains a record mapping module (#205) and column record mapping data (#210) data for maintaining data lineage information)
	It would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to combine the data lineage module of Kozina into the correlation calculations of Guan. The motivation for this combination would have been to make data lineage discrepancies visible as explained in Kozina (Par. [0029]). 
	Regarding claim 5, Guan and Kozina teach claim 1 as shown above, and Guan further teaches
	The computer-implemented method according to claim 1, wherein sorting by the at least one processor of the nodes and edges determined by the correlation model is performed on nodes and edges comprising at least one transformation of the data elements. (Figs. 6-9; Col. 10 [Lines 54-63], Col. 11 [Line 60] - Col. 12 [Line 28]; a first graph (Fig. 6) is generated (i.e. output data element) and refined (i.e. transformed) based on user input)
	Regarding claim 8, while worded slightly differently, is rejected under the same rationale as claim 1. Guan further teaches a non-transitory computer memory (Fig. 1 #106) and processor (Fig. 1 #104).
Regarding claim 12, while worded slightly differently, is rejected under the same rationale as claim 5.
Regarding claim 15, while worded slightly differently, is rejected under the same rationale as claim 8.
Regarding claim 19, while worded slightly differently, is rejected under the same rationale as claim 5.
Regarding claim 21, Guan and Kozina teach claim 1 as shown above, and Guan further teaches
	The method of claim 1, further comprising querying, by the at least one processor, the at least one knowledge graph by traversing the plurality of interconnected the nodes and edges in the graph database in response to an input query for at least one searched data element. (Col. 17 [Lines 51-67] nodes with similar semantic correlations are searched (i.e. queried) and provided to the user)
Regarding claim 22, while worded slightly differently, is rejected under the same rationale as claim 21.
Regarding claim 23, while worded slightly differently, is rejected under the same rationale as claim 21.



Claim Rejections - 35 USC § 103
In the event the determination of the status of the application as subject to AIA  35 U.S.C. 102 and 103 (or as subject to pre-AIA  35 U.S.C. 102 and 103) is incorrect, any correction of the statutory basis (i.e., changing from AIA  to pre-AIA ) for the rejection will not be considered a new ground of rejection if the prior art relied upon, and the rationale supporting the rejection, would be the same under either status.  
The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action:
A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made.

Claim(s) 2-3, 9-10 and 16-17 is/are rejected under 35 U.S.C. 103 as being unpatentable over Guan et al. (US Pat. 11,811,626) in view of Kozina et al. (US Pub. 2014/0114907), and further in view of Goodsitt et al. (US Pat. 11,030,526).
Regarding claim(s) 2, Guan and Kozina do not explicitly teach 
The computer-implemented method according to claim 1, wherein the statistical relationship determined by the element-level mapping module is at least in part based on a Pearson correlation. 
	However, from the same field, Goodsitt teaches
	The computer-implemented method according to claim 1, wherein the statistical relationship determined by the element-level mapping module is at least in part based on a Pearson correlation. (Col. 6 [Lines 33-43] the threshold of intercorrelation of datasets is based on a Pearson correlation of 0.6)
	It would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to combine the intercorrelation of Goodsitt into the correlation calculations of Guan. The motivation for this combination would have been to improve the intercorrelations between synthetic datasets as explained in (Col. 9 [Lines 15-16]). 
	Regarding claim(s) 3, Guan and Kozina do not explicitly teach 
The computer-implemented method according to claim 1, further comprising utilizing a correlation measurement model is based on a Pearson correlation. 
	However, from the same field, Goodsitt teaches
	The computer-implemented method according to claim 1, wherein the correlation measurement model is at least in part based on a Pearson correlation. (Col. 6 [Lines 33-43] the threshold of intercorrelation of datasets is based on a Pearson correlation of 0.6)
	It would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to combine the intercorrelation of Goodsitt into the correlation calculations of Guan. The motivation for this combination would have been to improve the intercorrelations between synthetic datasets as explained in (Col. 9 [Lines 15-16]). 
Regarding claim 9, while worded slightly differently, is rejected under the same rationale as claim 2.
Regarding claim 10, while worded slightly differently, is rejected under the same rationale as claim 3.
Regarding claim 16, while worded slightly differently, is rejected under the same rationale as claim 2.
Regarding claim 17, while worded slightly differently, is rejected under the same rationale as claim 3.


Claim(s) 4, 6, 11, 13 and 18 is/are rejected under 35 U.S.C. 103 as being unpatentable over Guan et al. (US Pat. 11,811,626) in view of Kozina et al. (US Pub. 2014/0114907) and further in view of Leach et al. (US Pat. 11,526,261).
Regarding claim(s) 4, Guan and Kozina do not explicitly teach 
The computer-implemented method according to claim 1, wherein confidence intervals of a correlation measurement model determines at least in part the correlation measurement model.
	However, from the same field, Leach teaches
	The computer-implemented method according to claim 1, wherein confidence intervals of a correlation measurement model determines at least in part the correlation measurement model. (Col. 34 [Lines 2-26] the statistical processing techniques include confidence intervals)
	It would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to combine the statistical processing techniques of Leach into the correlation calculations of Guan. The motivation for this combination would have been to improve the interpretability or appearance of graphs as explained in Leach (Col. 14 [Lines 63-67]). 
	Regarding claim(s) 6, Guan and Kozina do not explicitly teach 
The computer-implemented method according to claim 1, wherein sorting by the at least one processor of the nodes and edges determined by the correlation model is based on timestamp information of the data elements. 
	However, from the same field, Leach teaches
	The computer-implemented method according to claim 1, wherein sorting by the at least one processor of the nodes and edges determined by the correlation model is based on timestamp information of the data elements. (Col. 4 Lines [25-35] a user defined date range (i.e. nodes and edges determined by timestamp) is used as a data filter on the input data)
	It would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to combine the statistical processing techniques of Leach into the correlation calculations of Guan. The motivation for this combination would have been to improve the interpretability or appearance of graphs as explained in Leach (Col. 14 [Lines 63-67]). 
Regarding claim 11, while worded slightly differently, is rejected under the same rationale as claim 4.
Regarding claim 13, while worded slightly differently, is rejected under the same rationale as claim 6.
Regarding claim 18, while worded slightly differently, is rejected under the same rationale as claim 4.

Conclusion
Applicant's amendment necessitated the new ground(s) of rejection presented in this Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP § 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 CFR 1.136(a).
A shortened statutory period for reply to this final action is set to expire THREE MONTHS from the mailing date of this action. In the event a first reply is filed within TWO MONTHS of the mailing date of this final action and the advisory action is not mailed until after the end of the THREE-MONTH shortened statutory period, then the shortened statutory period will expire on the date the advisory action is mailed, and any nonprovisional extension fee (37 CFR 1.17(a)) pursuant to 37 CFR 1.136(a) will be calculated from the mailing date of the advisory action. In no event, however, will the statutory period for reply expire later than SIX MONTHS from the mailing date of this final action.
Any inquiry concerning this communication or earlier communications from the examiner should be directed to J MITCHELL CURRAN whose telephone number is (469)295-9081. The examiner can normally be reached M-F 8:00am - 5:00pm.
Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice.
If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, Sherief Badawi can be reached on (571) 272-9782. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300.
Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000.





/J MITCHELL CURRAN/Examiner, Art Unit 2161      


/SHERIEF BADAWI/Supervisory Patent Examiner, Art Unit 2169

Read full office action

Prosecution Timeline

Show 4 earlier events

Sep 02, 2025

Response after Non-Final Action

Sep 11, 2025

Request for Continued Examination

Sep 24, 2025

Response after Non-Final Action

Oct 01, 2025

Non-Final Rejection mailed — §103

Dec 31, 2025

Response Filed

Feb 13, 2026

Final Rejection mailed — §103

May 13, 2026

Request for Continued Examination

May 18, 2026

Response after Non-Final Action

Precedent Cases

Applications granted by this same examiner with similar technology

18/585,465

Patent 12639327

BUILT-IN ANALYTICS FOR DATABASE MANAGEMENT

2y 3m to grant Granted May 26, 2026

18/526,692

Patent 12619643

DATA DISPLAY METHOD, DEVICE, COMPUTER APPARATUS AND SYSTEM

2y 5m to grant Granted May 05, 2026

18/019,736

Patent 12596692

Source Scoring for Entity Representation Systems

3y 2m to grant Granted Apr 07, 2026

17/639,164

Patent 12566770

Centralized Knowledge Repository and Data Mining System

4y 0m to grant Granted Mar 03, 2026

17/742,161

Patent 12554743

SYSTEMS AND METHODS FOR ELECTRONIC DATA RECORD SYNCHRONIZATION

3y 9m to grant Granted Feb 17, 2026

Study what changed to get past this examiner. Based on 5 most recent grants.

Strategy Recommendation AI-generated — please review before filing

Get a prosecution strategy drawn from examiner precedents, rejection analysis, and claim mapping.

Typically takes 5-10 seconds — AI-generated, attorney review required before filing

Prosecution Projections

5-6

Expected OA Rounds

62%

Grant Probability

98%

With Interview (+35.5%)

3y 2m (~11m remaining)

Median Time to Grant

High

PTA Risk

Based on 109 resolved cases by this examiner. Grant probability derived from career allowance rate.

COMPUTER-BASED SYSTEMS CONFIGURED TO DETERMINE ELEMENT-LEVEL DATA LINEAGE AND METHODS OF USE THEREOF

This examiner grants 62% of cases after interview

Examiner Intelligence

Statute-Specific Performance

Office Action

Prosecution Timeline

Precedent Cases

Applications granted by this same examiner with similar technology

Strategy Recommendation AI-generated — please review before filing

Prosecution Projections

Ready to respond to this office action?

Sign in with your work email