Treasury RFI: EDCMO IRS Digitization Technology Pilot OCR

Notice ID: 5000123185

High-Level Summary/Future State

“This RFI is associated with a Pilot IRS Enterprise Digitalization and Case Management Office (EDCMO) program. The IRS EDCMO office seeks information on technologies capable of extracting machine-readable data from existing low-resolution digital images and include both structured and unstructured data (although our initial focus will be forms/structured data, there is also handwritten and typed information in unstructured formats). We are primarily interested in solutions that:

  1. Extract machine-readable data out of low-resolution (120 DPI and below) digital images, with high levels of accuracy and speed, and low levels of manual correction/activity (i.e., results in the use of this information by government personnel with limited manual input or effort).
  2. Demonstrate flexibility and adaptability to extract machine-readable data from different forms with different structures (or the same form with different structures based on different versions/years), be able to improve accuracy and speed based on previous images, and provide a search capability across different images and data sources.
  3. Interface and are compliant with IRS systems, cybersecurity requirements, hardware and software, etc. Interfaces and schemas included in potential solutions would need to be approved for use by the IRS Chief Information Officer.”

“We are anticipating solutions that leverage Optical Character Recognition (OCR) or Intelligent Character Recognition (ICR), to include aspects of machine learning and neural networks. Because the initial users of this solution will be working to identify trends across multiple sets of information, we also anticipate a solution with a robust and configurable search capability.”

Background/Current State

“The IRS has many existing digital images that are at a level of resolution (120 dots per inch (DPI) and lower) that creates difficulty in extracting machine-readable data. These low-resolution images reside within multiple systems, and are occasionally available, for a shorter period, at higher resolution. Ultimately, some of these images are saved at lower resolution due to the nature of legacy systems within which they are stored. As much of the information stored within these systems is sensitive, the IRS will use less sensitive or publicly available forms/data to confirm the efficacy of a proposed solution before deciding whether to pursue it. Because the images are contained within legacy systems, interoperability with those systems and other IRS requirements (i.e., cybersecurity) will also be a primary determinant of whether a use case will be scaled.”

“The IRS is committed to creating an environment where IRS data is available, accessible, and usable in a format that enables data-driven decision-making at all levels of the IRS organization. These efforts will support improvements to taxpayer service, enhance the fairness of our compliance efforts, address federal guidelines (e.g., Office of Management and Budget (OMB) M-19-21, NARA 2022 mandate), and reduce teleworking challenges that have emerged as a result of the COVID-19 pandemic…”

Read more here.

0
Tags:

This topic contains 0 replies, has 1 voice, and was last updated by  Jackie Gilbert 4 months, 1 week ago.

  • Author
    Posts
  • #127935

    Replies viewable by members only

    0

You must be logged in to reply to this topic.

CONTACT US

Questions?. Send us an email and we'll get back to you, asap.

Sending

©2021 MileMarker10, LLC all rights reserved | Community and Member Guidelines | Privacy Policy | About G2Xchange FedCiv

Opportunities. Starting Points.

About our Data

The Vault is a listing of expiring contracts, task orders, etc. within a certain set of parameters, to include:

  • Have an initial total estimated contract value of $10 million or above
  • Federal Civilian Only – DHS, Transportation, Justice, Labor, Interior, Commerce, Energy, State, and Treasury Actions
  • NAICS codes include: 511210, 518210, 519130, 519190, 541511,
    541512, 
    541513, 541519, 541611, 541618,
    541690, 541720, 541990
  • Were modified within the last 12 calendar months
  • The data represented is based on information provided by the government

Who has access? Please note that ALL G2Xchange FedCiv Members will receive access to all basic and much of the advanced data. G2Xchange FedCiv Corporate Members will receive access to ALL Vault content (basic and advanced).

Feedback/Suggestions? Contact us at Vault@G2Xchange.com and let us know what you think. 

G2Xchange FedCiv

Log in with your credentials for G2Xchange FedCiv

Forgot your details?