Data Processing
Types of data processing systems
Manual collection of data
Presented By:
(63-m-20)
(58-m-20)
(60-m-20)
(59-m-20)
(80-m-20)
(67-m-20)
(70-m-20)
Data Processing
Data processing is the method of collecting raw data and
translating it into usable information.
It is usually performed in a step-by-step process by a team
of data scientists and data engineers in an organization.
Raw data is the input that goes into some sort of processing
to generate meaningful output.
The raw data is collected, filtered, sorted, processed,
analyzed, stored, and then presented in a readable format.
In other words, it means the conversion of raw data into
meaningful and machine-readable content.
By converting the data into readable formats like graphs,
charts, and documents people can understand and use the
data.
Data Processing The data processing cycle
consists of a series of steps
Cycle where raw data (input) is fed into
a system to produce actionable
insights (output).
Each step is taken in a specific
order, but the entire process is
repeated in a cyclic manner.
The first data processing cycle's
output can be stored and fed as
the input for the next cycle.
Steps of data processing cycle
Step 1: Collection
The collection of data is first step in data processing cycle. The
type of raw data collected has a huge impact on the output
produced. Hence, raw data should be gathered from defined and
accurate sources so that the subsequent findings are valid and
usable. Raw data can include monetary figures, website cookies,
profit/loss statements of a company, user behavior, etc.
Step 2: Preparation
Data preparation or data cleaning is the process of sorting
and filtering the raw data to remove unnecessary and
inaccurate data. Raw data is checked for errors, duplication,
miscalculations or missing data, and transformed into a
suitable form for further analysis and processing. This is done
to ensure that only the highest quality data is fed into the
processing unit.
The purpose of this step to remove bad data (redundant, incomplete, or
incorrect data) so as to begin assembling high-quality information so
that it can be used in the best possible way for business intelligence.
Step 3: Input
In this step, the raw data is converted into machine readable form and
fed into the processing unit. This can be in the form of data entry
through a keyboard, scanner or any other input source.
Step 4: Data Processing
In this step, the raw data is subjected to various data processing
methods using machine learning and artificial intelligence algorithms to
generate a desirable output.
This step may vary slightly from process to process
depending on the source of data being processed (data takes,
online databases, connected devices, etc.) and the intended
use of the output.
Step 5: Output
The data is finally transmitted and displayed to the user in a
readable form like graphs, tables, vector files, audio, video,
documents, etc. This output can be stored and further
processed in the next data processing cycle.
Step 6: Storage
The last step of the data processing cycle is storage, where data
and metadata are stored for further use. This allows for quick
access and retrieval of information whenever needed, and also
allows it to be used as input in the next data processing cycle
directly.
Types of Data Processing Systems
Batch Processing
Real-Time Processing
Online Processing
Multiprocessing
Time-sharing Processing
Sr. Name of Processing DEFINITION Example
no. System
1 Batch Process Data is collected and Payroll System
processed in batches. Used
for large amounts of data.
2 Real-Time Processing Data is processed within Withdrawal of
seconds when the input is money from ATM
given. Used for small machine
amounts of data.
3 Online Processing Data is automatically fed Barcode
into the CPU as soon as it Scanning
becomes available. Used
for continuous processing
Sr. Name of Processing DEFINITION Example
no. System
4 Multiprocessing Data is broken down into Weather
frames and processed Forecast
using two or more CPUs
within a single computer
system. Also known as
parallel processing.
5 Time-share Processing Allocates computer Many users
resources and data in time connected to a
slots to several users simple data
simultaneously. source
Data Processing Methods
[Link] Data Processing
This data processing method is handled manually. The entire process of
data collection, filtering, sorting, calculation, and other logical operations
are all done with human intervention and without the use of any other
electronic device or automation software.
It is a low-cost method and requires little to no tools, but produces high
errors, high labor costs, and lots of time.
In this method, calculations are done manually with a pen and paper.
[Link] Data Processing
Data is processed with modern technologies using data
processing software and programs. A set of instructions is given to
the software to process the data and yield output. This method is
the most expensive but provides the fastest processing speeds
with the highest reliability and accuracy of output.
Electronic data processing (EDP) refers to the gathering of data
using electronic devices, such as computers, servers or laptops.
[Link] Data Processing
Data is processed mechanically through the use of devices and
machines. These can include simple devices such as calculators,
typewriters, printing press, etc. Simple data processing operations
can be achieved with this method. It has much lesser errors than
manual data processing, but the increase of data has made this
method more complex and difficult.
In Mechanical Data Processing, data is processed without human
intervention using machines and computers to automate the
process.
Manual Collection of Data
Data collection is done manually with hand.
Seek out databases manually.
Download and upload of data is done manually.
Manual Collection of data is done as;
Data is manually collected and moved from one
place to another.
Without using a machine or any sort of tool,
data processing can be done by hand.
Given the manual work involved and high need
for attention to detail, errors are likely to occur.
Additionally, manual data processing is time
consuming.
Naturally, this is generally the case when budgets are
low and manual data processing is the most affordable
method.
It’s useful to note that as technology progresses, the
economies of scale have made it more attainable and
accessible for all.
Manually collected data is information that is collected
manually typically with pen and paper.
Manual data collection occurs when people, seek out
data sources, download datasets, and manually upload
files to data analysis platform they use.
Merits & Demerits of Manual Data
Collection
MERITS
Inputting human sensible data.
Less expensive to set up.
Low risk of corrupted data.
Less risk of data loss, if data saved in fire proof environment.
May be easier in correcting errors.
Data is always accessible.
DEMERITS
It is a time consuming process.
It is not reliable as errors can occur.
Manual data collection can handicap us when collecting data.