0% found this document useful (0 votes)
556 views4 pages

Sih Idea

The team aims to develop unified software to extract data from various sources like images, PDFs, text, and documents. The software will use tools like import.io, selenium, and Hadoop for big data extraction and provide graphical representations of analyzed data. It will also feature integrated keyword searching of a locally stored database. The team is from Rajalakshmi Institute of Technology and their problem addresses bulk evidence collection, deleted data recovery, large-scale data analysis and comparison for the Madhya Pradesh Police.

Uploaded by

Viswesh S
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
556 views4 pages

Sih Idea

The team aims to develop unified software to extract data from various sources like images, PDFs, text, and documents. The software will use tools like import.io, selenium, and Hadoop for big data extraction and provide graphical representations of analyzed data. It will also feature integrated keyword searching of a locally stored database. The team is from Rajalakshmi Institute of Technology and their problem addresses bulk evidence collection, deleted data recovery, large-scale data analysis and comparison for the Madhya Pradesh Police.

Uploaded by

Viswesh S
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 4

Basic Details of the Team and

Problem Statement
Organization Name: Madhya Pradesh Police.

PS Code: AT980

Problem Statement Title: Big Data Searching.

Team Name: ClassyCoders

Team Leader Name: VISWESH S

Institute Code (AISHE): 2117

Institute Name: RAJALAKSHMI INSTITUTE OF TECHNOLOGY

Theme Name: Blockchain & Cybersecurity


Idea/Approach Details

⮚ Our objective is to develop an unified software to


extract data from various sources such as images,
pdf, text, documents, etc.
⮚ The software uses various big data extraction tools
such as import.io, selenium and Hadoop.
⮚ Our unique feature is that we are analyzing and
providing various graphical representations of the
data by using data analytics and visualzation
techniques.
⮚ The software also will have an integrated feature to
search keywords by accessing the locally stored
database.
2
Idea/Approach Details
Use cases: Dependencies/Tools used:
⮚ To collect bulk evidence from •NoSQL Database: HBase,MongoDB,
smartphones or laptops of accused. ZooKeeper
⮚ For quick recovery of deleted data. •Storage: HDFS and S3
⮚ To analyze large blocks of data and
compare with existing keywords •Processing: Datameer, BigSheets,
provided in database. Mechanical Turk, R

⮚ For graphical representation of •MapReduce: Hive, Hadoop, S4, Flume,


various big Data. Cascading

•Servers: Heroku, Google App Engine


3
Team Member Details
Team Leader Name: VISWESH S
Branch: BE Stream: CSE Year: II
Team Member 1 Name: TARUN KUMAR S
Branch: BE Stream: CSE Year: II
Team Member 2 Name: SURESH KISHNA A
Branch: BE Stream: CSE Year: II
Team Member 3 Name: YATISHWAR GV
Branch: BE Stream: CSE Year: II
Team Member 4 Name: VENKATA SAI DEEPA A
Branch: BE Stream: CSE Year: II
Team Member 5 Name: THARUN M
Branch: BE Stream: CSE Year: II
Team Mentor 1 Name: Mr. R. Arun Kumar
Category: ACADEMIC Expertise: Blockchain Domain Experience (in years): 2

You might also like