0% found this document useful (0 votes)
112 views6 pages

Fake News Synopsis 1

This project aims to build a website that detects fake news using machine learning. It will use classifiers trained on past news reports to predict the authenticity of new reports. The methodology involves static classification using ML models, dynamic searches of keywords online, and checking URLs against databases of real and fake news sites. Python libraries like scikit-learn will be used to evaluate algorithms and Django for web deployment. The system will have interfaces for static prediction, dynamic keyword searches, and URL verification. It is intended to help users identify false or misleading news stories.

Uploaded by

FuNnY OnE
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
112 views6 pages

Fake News Synopsis 1

This project aims to build a website that detects fake news using machine learning. It will use classifiers trained on past news reports to predict the authenticity of new reports. The methodology involves static classification using ML models, dynamic searches of keywords online, and checking URLs against databases of real and fake news sites. Python libraries like scikit-learn will be used to evaluate algorithms and Django for web deployment. The system will have interfaces for static prediction, dynamic keyword searches, and URL verification. It is intended to help users identify false or misleading news stories.

Uploaded by

FuNnY OnE
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd

KAMLA NEHRU INSTITUTE OF TECHNOLOGY

DEPARTMENT OF COMPUTER SCIENCE AND ENGINEERING

SESSION: 2022-2023

MAJOR PROJECT SYNOPSIS

FAKE NEWS DETECTION USING MACHINE LEARNING

SUBMITTED BY: SUBMITTED TO:

ADITYA KUMAR MAURYA (19605) PROFF. ARVIND TIWARI


ANAND KUMAR YADAV (19612) PROFF. SOHIT SHUKLA
BHUPENDRA BHASHKAR (19621)
Title

We will build a website that will authenticate the news whether it is real news or fake news.

Objective

 The aim of this project is to create a system or model that can use the data of past news reports and predict
the chances of a news report being fake or not.
 The aim is to understand different algorithm and its scope.

Introduction

Fake news detection is a subtask of text classification and is often defined as the task of classifying news as real
or fake. The term ‘fake news’ refers to the false or misleading information that appears as real news. It aims to
deceive or mislead people.
Problem definition

Given a multi-source news dataset and social contexts of news consumers (social media users), the task of fake
news detection is to determine if a news item is fake or real. Formally, we define the problem of fake news
detection as:
 Input: News items, social contexts and associated side information
 Output: One of two labels: ‘fake’ or ‘real’.

Motivation

In the era of news in our lives, it is the people’s responsibility to not to share any misleading information
as there are many sources available now-a-days. The fraud news such as spam messages, funding news or
any false information to be fall out or reach to the people we consider it as a serious issue although it is
extremely complicated to find out which is fraud and which is not a fraud profile or users in social media,
they replicate the information as the original one. As the technology evolved and the machine intelligence
has come into existence everyone tends to use available sources for creating and dissemination of fraud
news. People who are illiterate might be new to digital media as they are inexperienced, so they are the
ones who believe that fraud news easily and makes it practical in their lives. To a minimum, we have
deviled а simple web аррliсаtiоn which statistically detects false information, and also real news.

Related work

 Mykhailo Granik et. al. in their paper shows a simple approach for fake news detection using naive
Bayes classifier. This approach was implemented as a software system and tested against a data set of
Facebook news posts. They were collected from three large Facebook pages each from the right and
from the left, as well as three large mainstream political news pages . They achieved classification
accuracy of approximately 74%. Classification accuracy for fake news is slightly worse. This may be
caused by the skewness of the dataset: only 4.9% of it is fake news
.
 Himank Gupta et. al. gave a framework based on different machine learning approach that deals with
various problems including accuracy shortage, time lag (BotMaker) and high processing time to handle
thousands of tweets in 1 sec. Firstly, they have collected 400,000 tweets from HSpam14 dataset. Then
they further characterize the 150,000 spam tweets and 250,000 non- spam tweets. They also derived
some lightweight features along with the Top-30 words that are providing highest information gain
from Bag-of-Words model. 4. They were able to achieve an accuracy of 91.65% and surpassed the
existing solution by approximately 18%.
 Marco L. Della Vedova et. al. first proposed a novel ML fake news detection method which, by
combining news content and social context features, outperforms existing methods in the literature,
increasing its accuracy up to 78.8%. Second, they implemented their method within a Facebook
Messenger Chabot and validate it with a real-world application, obtaining a fake news detection
accuracy of 81.7%. Their goal was to classify a news item as reliable or fake; they first described the
datasets they used for their test, then presented the content-based approach they implemented and the
method they proposed to combine it with a social-based approach available in the literature. The
resulting dataset is composed of 15,500 posts, coming from 32 pages (14 conspiracy pages, 18 scientific
pages), with more than 2, 300, 00 likes by 900,000+ users. 8,923 (57.6%) posts are hoaxes and 6,577
(42.4%) are non-hoaxes.
Some similar websites are –
 [Link]

Hardware Requirements

Processor – Intel Core i3 or higher version

RAM – minimum 4 GB

Hard disk – 10 GB or higher

System type – 64-bit operating system

Software Requirements

• Operating System – Windows 7 or equivalent, macOS


• Browser – Chrome, Firefox, or any equivalent browser
• Editor – Jupyter notebook
Firebase – Firebase is a set of hosting services for any type of application. It offers NoSQL and real-time
hosting of databases, content, and social authentication. It also provides real-time communication service. We
will deploy our application on Firebase.

Methodology

This paper explains the system which will be developed in three parts. The first part is static which
works on machine learning classifier. We will study and train the model with 4 different classifiers
and will choose the best classifier for final execution. The second part is dynamic which takes the
keyword/text from user and searches online for the truth probability of the news. The third part
provides the authenticity of the URL input by user. In this paper, we will use Python and its Sci-kit
libraries. Python has a huge set of libraries and extensions, which can be easily used in Machine
Learning. Sci-Kit Learn library is the best source for machine learning algorithms where nearly all
types of machine learning algorithms are readily available for Python, thus easy and quick evaluation
of ML algorithms is possible. We will use Django for the web based deployment of the model,
provides client side implementation using HTML, CSS and Javascript. We will also use Beautiful
Soup (bs4), requests for online scrapping.
A. System Design

B. System Architecture
i) Static Search-The architecture of Static part of fake news detection system is quite simple and is
done keeping in mind the basic machine learning process flow. The system design is shown below and
self- explanatory. The main processes in the design are

ii) Dynamic Search- +The second search field of the site asks for specific keywords to be searched on
the net upon which it provides a suitable output
for the percentage probability of that term actually being present in an article or a similar article
with those keyword references in it. iii) URL Search The third search field of the site accepts a
specific website domain name upon which the implementation looks for the site in our true sites
database or the blacklisted sites database. The true sites database holds the domain names
which regularly provide proper and authentic news and vice versa. If the site isn’t found in
either of the databases then the implementation doesn’t classify the domain it simply states that
the news aggregator does not exist.
Timeline

References .

[1] Kai Shu, Amy Sliva, Suhang Wang, Jiliang Tang, and Huan Liu, “Fake News Detection on Social
Media: A Data Mining Perspective” arXiv:1708.01967v3 [[Link]], 3 Sep 2017

[2] M. Granik and V. Mesyura, "Fake news detection using naive Bayes classifier," 2017 IEEE First
Ukraine Conference on Electrical and Computer Engineering (UKRCON), Kiev, 2017, pp. 900-903.

[3] Fake news websites. (n.d.) Wikipedia. [Online]. Available:


[Link] Accessed Feb. 6, 2017

__________________ ___________________

Signature of Supervisor Signature of Supervisor

(Dr. Arvind Kumar Tiwari) (Prof. Sohit Shukla)

You might also like