Document Image Analysis

The document discusses a project for document image analysis using deep computer vision and optical character recognition. The goal is to process images of documents like certificates and IDs to extract key information in a format like JSON or XML. This will include recognizing text and graphics to extract data in the same way a human would. The project will create a web portal to submit documents and display results, with an edit option to correct errors, and will also create an API. The project will be evaluated based on modular code, use of datasets, cloud deployment, APIs, logging, and optimization.

Uploaded by

Sitansu Sekhar Mohanty

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

63 views

Document Image Analysis

Uploaded by

Sitansu Sekhar Mohanty

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

Project Title Document Image Analysis

Technologies Deep Computer Vision (OCR)

Domain Banking
Project Difficulties level Basic

Problem Statement:
Documents are an important aspect of many enterprises in a variety of sectors,
including law, finance, and technology. Automatic document understanding, such as
invoices, contracts, and resumes, is lucrative, offering up a slew of new business
opportunities.
Over the last four decades, there has been a lot of research into document image
processing and comprehension. Work in the field has been applied in a variety of
domains, including office automation, forensics, and digital libraries, and includes
preprocessing, physical and logical layout analysis, optical and intelligent character
recognition (OCR/ICR), graphics analysis, form processing, signature verification, and
writer identification. There are several decent document processing and analysis
options available;
The goal of document image analysis is to recognize text and graphics components in
images and extract the desired information in the same way as a human would.

The main objective is-

1. Process the images of documents like certificates, ID (Driving license, AADHAR
Card, PAN Card)
2. Extract data at the end and store in a certain format such as JSON, XML, and so on.
3. Following figure shows the basic diagram of how an information extraction system
should work. Our goal is to extract the key information from the documents.

1
Click here to enter text.
4. Create a web portal to submit the document and display the results on the screen.
5. Keep an edit option as well on the screen to avoid any mistake done by system.
6. Create an API as well.

Dataset:
You have to collect your dataset for this project for the Indian continent, and based on
that, you have to design your solution and create a repo for the dataset.
Or use the following dataset to create an initial app - Dataset

Project Evaluation metrics:

Code:
• You are supposed to write a code in a modular fashion

2
Click here to enter text.
• Safe: It can be used without causing harm.
• Testable: It can be tested at the code level.
• Maintainable: It can be maintained, even as your codebase grows.
• Portable: It works the same in every environment (operating system)
• You have to maintain your code on GitHub.
• You have to keep your GitHub repo public so that anyone can check your code.
• Proper readme file you have to maintain for any project development.
• You should include basic workflow and execution of the entire project in the readme
file on GitHub
• Follow the coding standards: https://www.python.org/dev/peps/pep-0008/

Database:
• You are supposed to use a given dataset for this project which is a Cassandra
database.
• https://astra.dev/ineuron

Cloud:
• You can use any cloud platform for this entire solution hosting like AWS, Azure or
GCP
API Details or User Interface:
• You have to expose your complete solution as an API or try to create a user
interface for your model testing. Anything will be fine for us.
Logging:
• Logging is a must for every action performed by your code use the python logging
library for this.
Ops Pipeline:
• If possible, you can try to use AI ops pipeline for project delivery Ex. DVC, MLflow
, Sagemaker , Azure machine learning studio, Jenkins, Circle CI, Azure DevOps ,
TFX, Travis CI

Deployment:
• You can host your model in the cloud platform, edge devices, or maybe local, but
with a proper justification of your system design.
Solutions Design:
• You have to submit complete solution design strategies in HLD and LLD document
3
Click here to enter text.
System Architecture:
• You have to submit a system architecture design in your wireframe document and
architecture document.
Latency for model response:
• You have to measure the response time of your model for a particular input of a
dataset.
Optimization of solutions:
• Try to optimize your solution on code level, architecture level and mention all of
these things in your final submission.
• Mention your test cases for your project.

Submission requirements:

High-level Document:

4
Click here to enter text.
You have to create a high-level document design for your project. You can reference the
HLD form below the link.
Sample link:
HLD Document Link

Low-level document:
You have to create a Low-level document design for your project; you can refer to the LLD
from the below link.
Sample link
LLD Document Link

Architecture: You have to create an Architecture document design for your project;
you can refer to the Architecture from the below link.
Sample link
Architecture sample link

Wireframe: You have to create a Wireframe document design for your project; refer to
the Wireframe from the below link.
Demo link
Wireframe Document Link

Project code:
You have to submit your code GitHub repo in your dashboard when the final submission
of your project.
Demo link
Project code sample link :

Detail project report:

You have to create a detailed project report and submit that document as per the given
sample.

5
Click here to enter text.
Demo link
DPR sample link

Project demo video:

You have to record a project demo video for at least 5 Minutes and submit that link as per
the given demo.
Demo link
Project sample link :

The project LinkedIn a post:

You have to post your project detail on LinkedIn and submit that post link in your
dashboard in your respective field.
Demo link
Linkedin post sample link :

6
Click here to enter text.

S4F72 - EN - Col17 Contracts and Conditions in SAP Contract and Lease Management For SAP S4HANA
No ratings yet
S4F72 - EN - Col17 Contracts and Conditions in SAP Contract and Lease Management For SAP S4HANA
162 pages
Ibt Sa Final
100% (1)
Ibt Sa Final
3 pages
Problem Statement:: Project Title Technologies Domain Project Difficulties Level
No ratings yet
Problem Statement:: Project Title Technologies Domain Project Difficulties Level
4 pages
Catering Reserving and Ordering System-Mern
100% (1)
Catering Reserving and Ordering System-Mern
5 pages
Deloitte Case Study
No ratings yet
Deloitte Case Study
4 pages
Question Answering Systems For Customer Relationship Management
No ratings yet
Question Answering Systems For Customer Relationship Management
6 pages
Investment Predictions
No ratings yet
Investment Predictions
5 pages
Online Assignment Plagiarism Check
No ratings yet
Online Assignment Plagiarism Check
5 pages
Google Analytics Customer Revenue Prediction
No ratings yet
Google Analytics Customer Revenue Prediction
5 pages
Investment Predictions
No ratings yet
Investment Predictions
5 pages
Computer Accessories (Web Application) WEb
No ratings yet
Computer Accessories (Web Application) WEb
5 pages
Market Basket Project On E-Commerce
No ratings yet
Market Basket Project On E-Commerce
5 pages
Freeform Text Generation for Content Creators
No ratings yet
Freeform Text Generation for Content Creators
6 pages
Flight Fare Prediction
No ratings yet
Flight Fare Prediction
5 pages
Phishing Domain Detection - Updated
No ratings yet
Phishing Domain Detection - Updated
5 pages
Bug Resolution Application Management Web Application
No ratings yet
Bug Resolution Application Management Web Application
6 pages
Online Doctor Visit Appointment (Web Application)
No ratings yet
Online Doctor Visit Appointment (Web Application)
5 pages
Project - Restaurant Rating Prediction: Problem Statement
No ratings yet
Project - Restaurant Rating Prediction: Problem Statement
3 pages
Automated ML
No ratings yet
Automated ML
4 pages
Social Media Web Application
No ratings yet
Social Media Web Application
5 pages
Leave Management System Web Application
No ratings yet
Leave Management System Web Application
6 pages
Drug Activity Prediction - Updated
No ratings yet
Drug Activity Prediction - Updated
5 pages
AI Recruit (2)
No ratings yet
AI Recruit (2)
7 pages
Online Vehicle Rental Management System-Mern
No ratings yet
Online Vehicle Rental Management System-Mern
5 pages
Healthcare Analytics
No ratings yet
Healthcare Analytics
4 pages
Healthcare Data Analysis
No ratings yet
Healthcare Data Analysis
4 pages
T DEV 600 Redditech
No ratings yet
T DEV 600 Redditech
9 pages
Consumer Complaint Analysis (AIOPS PROJECT)
No ratings yet
Consumer Complaint Analysis (AIOPS PROJECT)
4 pages
VerveBridge Machine Learning Book Recommendation System Task 1
No ratings yet
VerveBridge Machine Learning Book Recommendation System Task 1
3 pages
Diet and Workout Assistant (DWA) - 2
No ratings yet
Diet and Workout Assistant (DWA) - 2
6 pages
Backend Challenge
No ratings yet
Backend Challenge
7 pages
Backend Challenge
No ratings yet
Backend Challenge
6 pages
Minh Huyen
No ratings yet
Minh Huyen
6 pages
Analyze Debt Statistics
No ratings yet
Analyze Debt Statistics
4 pages
Leadzen React + Js Assignment
No ratings yet
Leadzen React + Js Assignment
3 pages
Sample Resume Format: Overview
No ratings yet
Sample Resume Format: Overview
6 pages
Experiment No 1
No ratings yet
Experiment No 1
6 pages
Full Stack Hiring Task - BTS v2
No ratings yet
Full Stack Hiring Task - BTS v2
3 pages
Veerraju Palacharla (PY Project)
No ratings yet
Veerraju Palacharla (PY Project)
11 pages
Technology Assessment - Mobile - Java
No ratings yet
Technology Assessment - Mobile - Java
2 pages
Carrier Perffer by ChatGPT
No ratings yet
Carrier Perffer by ChatGPT
10 pages
Senior Software Engineer Test
No ratings yet
Senior Software Engineer Test
3 pages
CSC 210 Final Project
No ratings yet
CSC 210 Final Project
6 pages
Datagrokr Internship Technical Assignment - 20201017
No ratings yet
Datagrokr Internship Technical Assignment - 20201017
3 pages
Compatibility Test For Frontend Developers PDF
No ratings yet
Compatibility Test For Frontend Developers PDF
3 pages
rahulpancard
No ratings yet
rahulpancard
4 pages
Backend Developer Roadmap 2025
No ratings yet
Backend Developer Roadmap 2025
8 pages
3rd Year Project Report
No ratings yet
3rd Year Project Report
34 pages
Soil Farming Agent.docx
No ratings yet
Soil Farming Agent.docx
3 pages
DemoUpCliplister Coding Challenge Backend (1)
No ratings yet
DemoUpCliplister Coding Challenge Backend (1)
2 pages
Capstone Project
No ratings yet
Capstone Project
3 pages
Rules and Regulations for Documentation
No ratings yet
Rules and Regulations for Documentation
4 pages
Project Bank: Visit For Complete Career and Job Resources
No ratings yet
Project Bank: Visit For Complete Career and Job Resources
8 pages
3rd Year Project Report
No ratings yet
3rd Year Project Report
36 pages
1Z0-1110-2024 Dumps (Updated Version) (1)
No ratings yet
1Z0-1110-2024 Dumps (Updated Version) (1)
14 pages
System Implementation - Coding
No ratings yet
System Implementation - Coding
55 pages
Job Description - Frontend Web Developer
No ratings yet
Job Description - Frontend Web Developer
3 pages
Abhijeet Mohan Bedagkar
No ratings yet
Abhijeet Mohan Bedagkar
3 pages
AssignmentInformation-ConvolutionEncoder
No ratings yet
AssignmentInformation-ConvolutionEncoder
2 pages
Handbook For Technical Recruitment
No ratings yet
Handbook For Technical Recruitment
45 pages
CodeIgniter 1.7
From Everand
CodeIgniter 1.7
David Upton
No ratings yet
Visual Basic 2010 Coding Briefs Data Access
From Everand
Visual Basic 2010 Coding Briefs Data Access
Kevin Hough
5/5 (1)
2020-21 Series Test 1 QP
No ratings yet
2020-21 Series Test 1 QP
1 page
Unitrust Bank Update Who and When
No ratings yet
Unitrust Bank Update Who and When
1 page
Combination of Subjects at SSC AND HSSC LEVEL
No ratings yet
Combination of Subjects at SSC AND HSSC LEVEL
4 pages
Company Master Data: Charges
No ratings yet
Company Master Data: Charges
1 page
True & False - Scanner
No ratings yet
True & False - Scanner
88 pages
UPO
No ratings yet
UPO
3 pages
CIVIL PROCEDURE 14TH MARCH 2025
No ratings yet
CIVIL PROCEDURE 14TH MARCH 2025
8 pages
Strict or Liberal Construction
100% (1)
Strict or Liberal Construction
8 pages
Asr CSMT Exp Third Ac (3A)
No ratings yet
Asr CSMT Exp Third Ac (3A)
2 pages
Criminal Reports 2015 B PDF
No ratings yet
Criminal Reports 2015 B PDF
13 pages
Mobile Home Park Investing
No ratings yet
Mobile Home Park Investing
56 pages
Revolution and Nationalism
No ratings yet
Revolution and Nationalism
9 pages
CAIE-A2 Level-Law - Law of Contract
No ratings yet
CAIE-A2 Level-Law - Law of Contract
13 pages
Attorney-General V (Danhai) Williams (1997) 51 WIR 264
No ratings yet
Attorney-General V (Danhai) Williams (1997) 51 WIR 264
14 pages
WT7520 PDF, WT7520 Hoja de Datos - Weltrend Semiconductor DatsheetQ ..
No ratings yet
WT7520 PDF, WT7520 Hoja de Datos - Weltrend Semiconductor DatsheetQ ..
2 pages
Megillah 17
No ratings yet
Megillah 17
71 pages
OIG Report On Allegations by Bureau of Alcohol Tobacco and Firearms
No ratings yet
OIG Report On Allegations by Bureau of Alcohol Tobacco and Firearms
21 pages
MEC411 2011 - 09 Test 1
No ratings yet
MEC411 2011 - 09 Test 1
2 pages
Ins 21
No ratings yet
Ins 21
77 pages
WOSM Constitution en
No ratings yet
WOSM Constitution en
19 pages
Nit - Lit Brochure 2024 Narula Institute of Technology
No ratings yet
Nit - Lit Brochure 2024 Narula Institute of Technology
2 pages
Oraset 11g
No ratings yet
Oraset 11g
10 pages
Prepaid Instruments License
No ratings yet
Prepaid Instruments License
6 pages
SOM 2 Marks
No ratings yet
SOM 2 Marks
8 pages
Heinrich10m3 Public Release Memo
No ratings yet
Heinrich10m3 Public Release Memo
2 pages
United States v. Thomas Herbert McIlvain, 967 F.2d 1479, 10th Cir. (1992)
No ratings yet
United States v. Thomas Herbert McIlvain, 967 F.2d 1479, 10th Cir. (1992)
5 pages
Data Protection in The Practical Context Strategies and Techniques 1st Edition Hannah Yeefen Lim Ebook All Chapters PDF
100% (6)
Data Protection in The Practical Context Strategies and Techniques 1st Edition Hannah Yeefen Lim Ebook All Chapters PDF
62 pages
5G Architecture Model & Concepts PDF (2019)
No ratings yet
5G Architecture Model & Concepts PDF (2019)
108 pages