Last updated: April 18, 2026

Application No. 18/695,718

TEXT RECOGNITION METHOD AND APPARATUS, STORAGE MEDIUM AND ELECTRONIC DEVICE

Non-Final OA §101§103§112

Filed

Mar 26, 2024

Examiner

THIRUGNANAM, GANDHI

Art Unit

2672

Tech Center

2600 — Communications

Assignee

BOE TECHNOLOGY GROUP CO., LTD.

OA Round

1 (Non-Final)

Interview Optional

— +12.3% interview lift. This examiner has a relatively high allow rate; a written response may suffice.

Based on 559 resolved cases, 2023–2026

Examiner Intelligence

THIRUGNANAM, GANDHI View full profile →

Grants 74% — above average

Career Allow Rate

413 granted / 559 resolved

+11.9% vs TC avg

Moderate +12% lift

Without

With

+12.3%

Interview Lift

resolved cases with interview

Typical timeline

3y 7m

Avg Prosecution

42 currently pending

Career history

601

Total Applications

across all art units

Statute-Specific Performance

§101

9.6%

-30.4% vs TC avg

§103

35.8%

-4.2% vs TC avg

§102

21.5%

-18.5% vs TC avg

§112

27.1%

-12.9% vs TC avg

Black line = Tech Center average estimate • Based on career data from 559 resolved cases

Office Action

§101 §103 §112

DETAILED ACTION
Notice of Pre-AIA  or AIA  Status
The present application, filed on or after March 16, 2013, is being examined under the first inventor to file provisions of the AIA .

Claim Rejections - 35 USC § 101
35 U.S.C. 101 reads as follows:
Whoever invents or discovers any new and useful process, machine, manufacture, or composition of matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the conditions and requirements of this title.


Claim 19 is rejected under 35 U.S.C. 101 because the claimed invention(s) is/are directed to non-statutory subject matter. The claim(s) does/do not fall within at least one of the four categories of patent eligible subject matter because the claim recites “non-volatile computer-readable storage medium” and 1. the BRI of “computer-readable storage medium” encompasses transitory signals and waveforms and 2. the specification does not expressly exclude transitory signals and waveforms from the interpretation of “computer-readable storage medium.” Since transitory signals and waveforms do not fall within one of the four statutory categories of invention, the claim is non-statutory. A claim drawn to such a computer readable storage medium that covers both transitory and non-transitory embodiments may be amended to narrow the claim to cover only statutory embodiments to avoid a rejection under 35 U.S.C. § 101 by adding the limitation “non-transitory,” such as “non-transitory computer-readable storage medium,” to the claim.

Claim Rejections - 35 USC § 112
The following is a quotation of 35 U.S.C. 112(b):
(b)  CONCLUSION.—The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the subject matter which the inventor or a joint inventor regards as the invention.


The following is a quotation of 35 U.S.C. 112 (pre-AIA ), second paragraph:
The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the subject matter which the applicant regards as his invention.


Claim 3-8,22-23 and 25 are rejected under 35 U.S.C. 112(b) or 35 U.S.C. 112 (pre-AIA ), second paragraph, as being indefinite for failing to particularly point out and distinctly claim the subject matter which the inventor or a joint inventor (or for applications subject to pre-AIA  35 U.S.C. 112, the applicant), regards as the invention.
Claim 3 and 22 recite “short-circuiting”.  It is not clear what Applicant means.   Short-circuiting does not appear to be a term in the art of neural networks.
Claim 6 and 25 recites “a XXX direction perceptual map” followed by “the XXX directional perceptual map”, where XXX is first and second.  These claim limitations lack antecedent basis.
	Claims 4,7-8 and 23 are rejected as dependent upon a rejected claim. 

Claim Rejections - 35 USC § 103
The following is a quotation of 35 U.S.C. 103 which forms the basis for all obviousness rejections set forth in this Office action:
A patent for a claimed invention may not be obtained, notwithstanding that the claimed invention is not identically disclosed as set forth in section 102, if the differences between the claimed invention and the prior art are such that the claimed invention as a whole would have been obvious before the effective filing date of the claimed invention to a person having ordinary skill in the art to which the claimed invention pertains. Patentability shall not be negated by the manner in which the invention was made.

Claim(s) 1-2,19-21 is/are rejected under 35 U.S.C. 103 as being unpatentable over “Real-time Scene Text Detection with Differentiable Binarization”, hereafter referred to as Liao, in view of “Image compression based on octave convolution and semantic segmentation” hereafter referred to as Liu.
Liao discloses 1.A text recognition method, comprising: 
acquiring 
performing an M-level convolution process on the 
merging the M pairs of target (Liao, Fig. 3 , 
    PNG
    media_image1.png
    410
    640
    media_image1.png
    Greyscale
Concat reads on merging; pairs of  data; Liao takes an image and perform)
determining a probability map and a threshold map of the target image based on the target feature map, and  calculating a binarization map of the target image based on the probability map and the threshold map; and determining a text area in the target image based on the binarization map, and recognizing text information in the text area. (Liao, Fig. 3, 
    PNG
    media_image2.png
    300
    594
    media_image2.png
    Greyscale
; see probablity map, threshold map, binary map and the determined text)

Liao discloses normal convolution, but does not expressly disclose octave convolution, in particular  
“acquiring a first high-frequency feature map and a first low-frequency feature map of a target image; 
performing an M-level convolution process on the first high-frequency feature map and the first low-frequency feature map by M cascaded convolution modules to obtain M pairs of target high-frequency feature map and target low-frequency feature map of the target image, where M is a positive integer; 
merging the M pairs of target high-frequency feature map and target low-frequency feature map to obtain a target feature map of the target image;”
Liu discloses “acquiring a first high-frequency feature map and a first low-frequency feature map of a target image; 
performing an M-level convolution process on the first high-frequency feature map and the first low-frequency feature map by M cascaded convolution modules to obtain M pairs of target high-frequency feature map and target low-frequency feature map of the target image, where M is a positive integer; 
merging the M pairs of target high-frequency feature map and target low-frequency feature map to obtain a target feature map of the target image;”(Liu, pg. 1-2, 
    PNG
    media_image3.png
    232
    342
    media_image3.png
    Greyscale

    PNG
    media_image4.png
    90
    352
    media_image4.png
    Greyscale

    PNG
    media_image5.png
    240
    163
    media_image5.png
    Greyscale
Fig. 1. On the left is octave convolution, where f(·) is vanilla convolution, W is convolution kernel, pool(·) is average pooling, and upsample(·) is an up-sampling using nearest neighbor interpolation.)
It would have been obvious to a person having ordinary skill in the art before the time of the effective filing date of the claimed invention of the instant application to replace the standard convolution of Liao with the octave convolution of Liu.
The suggestion/motivation for doing so would have been to decrease the number of calculations and storage space.
Further, one skilled in the art could have combined the elements as described above by known methods with no change in their respective functions, and the combination would have yielded nothing more than predictable results. 
Therefore, it would have been obvious to combine  Liao with Liu to obtain the invention as specified in claim 1.

Liao in view of Liu discloses 2. The text recognition method according to claim 1, wherein the convolution module performs a convolution process on the first high-frequency feature map and the first low-frequency feature map, and the convolution process comprises: performing a first convolution process on the input first high-frequency feature map to obtain a second high-frequency feature map, and performing an up-sampling convolution process on the input first low-frequency feature map to obtain a second low-frequency feature map; acquiring the target high-frequency feature map based on the second high-frequency feature map and the second low-frequency feature map; performing a second convolution process on the input first low-frequency feature map to obtain a third low-frequency feature map, and performing a down-sampling convolution process on the input first high-frequency feature map to obtain a third high-frequency feature map; and acquiring the target low-frequency feature map based on the third low-frequency feature map and the third high-frequency feature map. (Liu, Fig. 1 
    PNG
    media_image5.png
    240
    163
    media_image5.png
    Greyscale
Fig. 1. On the left is octave convolution, where f(·) is vanilla convolution, W is convolution kernel, pool(·) is average pooling, and upsample(·) is an up-sampling using nearest neighbor interpolation )

Claim 19 is rejected under similar grounds as claim 1.
Claim 20 is rejected under similar grounds as claim 1.
Claim 21 is rejected under similar grounds as claim 2.

Claim(s) 11 is/are rejected under 35 U.S.C. 103 as being unpatentable over Liao in view of Lui in further view of “AMultiplexed Network for End-to-End, Multilingual OCR”, hereafter referred to as Huang.
	Liao in view of Liu discloses 11. The text recognition method according to claim 1,
But doesn’t expressly disclose “wherein the method further comprises: predicting a language in which the target image contains text based on the target feature map; and the recognizing of the text information in the text area comprises: determining a corresponding text recognition model according to the language in which the target image contains the text to recognize the text information in the text area.”
Huang discloses “wherein the method further comprises: predicting a language in which the target image contains text based on the target feature map; and the recognizing of the text information in the text area comprises: determining a corresponding text recognition model according to the language in which the target image contains the text to recognize the text information in the text area.” (Huang, Fig. 1,

    PNG
    media_image6.png
    398
    680
    media_image6.png
    Greyscale
)
It would have been obvious to a person having ordinary skill in the art before the time of the effective filing date of the claimed invention of the instant application to add the language detection and selection of language model of Huang to the system of Liao in view of Liu.
The suggestion/motivation for doing so would have been to provide translations for the users.
Further, one skilled in the art could have combined the elements as described above by known methods with no change in their respective functions, and the combination would have yielded nothing more than predictable results. 
Therefore, it would have been obvious to combine Liao in view of Lui and Huang to obtain the invention as specified in claim 11.

Allowable Subject Matter
Claims 3-10, 12-18, 22-27 would be allowable if rewritten to overcome the rejection(s) under 35 U.S.C. 112(b) or 35 U.S.C. 112 (pre-AIA ), 2nd paragraph, if asserted, as set forth in this Office action and to include all of the limitations of the base claim and any intervening claims.

Conclusion
Any inquiry concerning this communication or earlier communications from the examiner should be directed to GANDHI THIRUGNANAM whose telephone number is (571)270-3261. The examiner can normally be reached M-F 8:30-5PM.
Examiner interviews are available via telephone, in-person, and video conferencing using a USPTO supplied web-based collaboration tool. To schedule an interview, applicant is encouraged to use the USPTO Automated Interview Request (AIR) at http://www.uspto.gov/interviewpractice.
If attempts to reach the examiner by telephone are unsuccessful, the examiner’s supervisor, Sumati Lefkowitz can be reached at 571-272-3638. The fax phone number for the organization where this application or proceeding is assigned is 571-273-8300.
Information regarding the status of published or unpublished applications may be obtained from Patent Center. Unpublished application information in Patent Center is available to registered users. To file and manage patent submissions in Patent Center, visit: https://patentcenter.uspto.gov. Visit https://www.uspto.gov/patents/apply/patent-center for more information about Patent Center and https://www.uspto.gov/patents/docx for information about filing in DOCX format. For additional questions, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a USPTO Customer Service Representative, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000.





/GANDHI THIRUGNANAM/Primary Examiner, Art Unit 2672

Read full office action

Prosecution Timeline

Mar 26, 2024

Application Filed

Mar 25, 2026

Non-Final Rejection — §101, §103, §112 (current)

Precedent Cases

Applications granted by this same examiner with similar technology

18/344,789

Patent 12597135

SYSTEMS AND METHODS FOR UPDATING A GRAPHICAL USER INTERFACE BASED UPON INTRAOPERATIVE IMAGING

2y 5m to grant Granted Apr 07, 2026

17/513,171

Patent 12561963

CROSS-MODALITY NEURAL NETWORK TRANSFORM FOR SEMI-AUTOMATIC MEDICAL IMAGE ANNOTATION

2y 5m to grant Granted Feb 24, 2026

17/655,013

Patent 12555291

METHOD FOR AUTOMATED REGULARIZATION OF HYBRID K-SPACE COMBINATION USING A NOISE ADJUSTMENT SCAN

2y 5m to grant Granted Feb 17, 2026

18/278,568

Patent 12541869

GRAIN FLAKE MEASUREMENT SYSTEM, GRAIN FLAKE MEASUREMENT METHOD, AND GRAIN FLAKE COLLECTION, MOVEMENT, AND MEASUREMENT SYSTEM

2y 5m to grant Granted Feb 03, 2026

18/327,885

Patent 12525007

TRAINING METHOD AND ELECTRONIC DEVICE

2y 5m to grant Granted Jan 13, 2026

Study what changed to get past this examiner. Based on 5 most recent grants.

AI Strategy Recommendation

Get an AI-powered prosecution strategy using examiner precedents, rejection analysis, and claim mapping.

Prosecution Projections

1-2

Expected OA Rounds

74%

Grant Probability

86%

With Interview (+12.3%)

3y 7m

Median Time to Grant

Low

PTA Risk

Based on 559 resolved cases by this examiner. Grant probability derived from career allow rate.