Web Scrapers and Your
Property Portal:
High Risk Lessons
Speaker
Rami Essaid
CEO
Distil Networks
Awards and Analyst Recognition
“Distil’s ability to analyze behavior provides
the best chance of detecting and blocking
bot-driven attacks.”
5 Stars across the board.“
Verdict: For monitoring the impact of bots on
a network this is the tool one needs.”
The only anti-bot solution to be included
in Gartner’s Online Fraud Detection
Market Guide
Ovum puts Distil Networks On The Radar.
“Clear innovation compared to similar
services.”
Fortune 500 & Alexa Global 10,000 Customers
Ecommerce
Travel
Publishers
Directories
Traditional Media
Marketplace
Services
Distil Protects Over 50 RE Portals Globally!
Protecting Your Data
Enhancing Your Data
Cleaning-Up Your Data
Protecting Your Data
A Brief Intro to Bots and Web Scraping
What Is Web Scraping?
Web Scraping
Also known as screen scraping, web scraping is the act of
copying large amounts of data from a website – either
manually or with an automated program.
Legitimate Scraping
Scraping can sometimes be benevolent and totally
acceptable. For example, the search engine bots that index
your website
Malicious Scraping
A systematic theft of intellectual property accessible on a
website, including pricing, content, images, and proprietary
data
Who is behind Web Scraping?
Competitors
Content Theft
Competitive Intel
Price Scraping
Aggregators
Start-ups
Unauthorized Middlemen
Hackers
Content for Fake Pages
Search Engines
Google
Bing
Yahoo
Baidu
Bad Bots Cause the Majority of Website Problems
In 2015 the most targeted verticals were digital publishing and real
estate. Real Estate sites saw a 300% increase in bad bot traffic!
Traffic by Type of Site, 2014 vs 2015
Bad Web Scraping
Web scraping is the act of taking content from a website with the intent of using
it for purposes outside the direct control of the site owner.
It can be used to
○ Steal intellectual property
○ Gain competitive advantage
○ Create aggregation or meta-sites
○ Perform market research
○ Damage SEO rankings
Alexa – monitor traffic levels
SE Ranking – track search rankings
InfiniGraph – watch social media trends
Open Site Explorer – monitor backlinks
SpyFu – view advertising keywords
Moat – find where ads are running
iSpionage – organic search keywords
Compete PRO – get demographic info
Quantcast – view audience insights
SpyOnWeb – see behind the curtain
Cheap scraping software
Inexpensive cloud computing resources
Botnet-as-a-Service
What is Contributing to the Growth in Web Scraping?
Freelancer.com Rates
Scraping three real estate sites
Data Manipulation (de-duping, etc.)
Importing into new software
Average Cost - $130 USD
The Going Rate for Scraping Less than $130/day
Posting Stolen Data is Quick and Easy due to Turnkey Platforms
Real Estate Portal Platforms start at $299
Scraped Data
$130
The Cost of Replicating your Website
Classified Ad Website
$299
$429
Bottom Line
Scrapers scrape because they are
making money with your listings!
And the Real Estate industry is left
with...
Higher Costs
Lost Revenues
Why Bots / Scraping is a Problem in Real Estate
Case Study
Enhancing Your Data
Delivering a Clear Picture of Your Web Traffic
Low Resolution Fingerprint
“Unactionable”
Hi-Def Fingerprint
“Actionable”
Hi-Def Fingerprinting Eliminates Blind Defense
IP Address
Header & User Agent Information
Cookie Browser
200+ Attributes of data
Navigator, WebGL, Plugins, Audio, Video, etc.
Tamper proofing layer
Hi-Def Fingerprint
That Majority of Bad Bots Now Use Multiple IP Addresses
Bots which dynamically rotate IP addresses, or distribute attacks are
significantly harder to detect and mitigate
Sticky Bot Tracking With No Impact On Real Users
Device Fingerprinting
Fingerprints stick to the bot even if it
attempts to reconnect from random IP
addresses or hide behind an anonymous
proxy or peer-to-peer network
Tracks distributed attacks that would
normally fly under the radar
Without Distil With Distil
Without Impacting Users Sharing the Same IP
Avoids blocking residential users or organizations
that might share the same NAT as the bot or botnet
Case Study
Cleaning-Up Your Data
In 2015 the most targeted verticals were digital publishing and real
estate. Real Estate sites saw a 300% increase in bad bot traffic!
Traffic by Type of Site, 2014 vs 2015
Web scraping hurts your KPIs...
Slowdowns, downtime, and poor user experiences
Increase in costs (infrastructure and people)
Distortion of web analytics
Digital ad fraud, reputation and trust (bad leads)
How Web Scrapers Impact KPIs
Majority of Bots are Advanced Persistent Bots (APBs)
APBs have one or more of the following abilities:
Advanced
Mimick human behavior
Load JavaScript
Load external resources
Support cookies
Browser automation (Selenium, PhantomJS)
Persistent
Dynamic IP rotation
Distribute attacks across IP addresses
Hide behind anonymous and peer-to-peer proxies
2016 Distil Bad Bot Report
Loading Assets & Bots Mimicking Humans
% of bots able to load external
assets (e.g. JavaScript)
% of bots able to mimic human
behavior
These bots will skew marketing tools such as
(Google Analytics, A/B testing, conversion
tracking, etc.)
These bots will fly under the radar of most
security tools
Bots Throw Off Analytics
Impressions and Clicks Remain the Biggest Targets
Impressions
(CPM/CPV)
Clicks
(CPC)
Search
$18.8B
86% digital spend
Display
$7.9B
Video
$3.5B
Mobile
$6.2B$6.2B
Leads
(CPL)
Sales
(CPA)
Lead Gen
$2.0B
Other
$5.0B
• classifieds
• sponsorship
• rich media
estimated
fraud
not at risk
$42.5B $7B
Bots Don't Buy Houses
35
Case Study
The Only Easy and Accurate Way to
Protect Web Applications from
Bad Bots, API Abuse, and Fraud.
Detect and Distil Traffic
No Longer Blind Defense
Complete Visibility into False Positives
17 million
CAPTCHAs served
78 solved
False Positive Rate = 0.00000458
www.distilnetworks.com/trial/
Offer Ends: October 30, 2016
Two Months of Free Service + Traffic Analysis
www.distilnetworks.com
QUESTIONS….COMMENTS?
I N F O @ D I S T I L N E T W O R K S . C O M
1.866.423.0606
OR CALL US ON

More Related Content

PPTX
Case Study on Property Portal Data Security
PPTX
Field Guide To Preventing Competitor Price Scraping, Unwanted Transactions, B...
PPTX
Better Metrics, Less Hacks: Online Travel and The Future of Web Security
PPTX
Field Guide for Validating Premium Ad Inventory
PPTX
2016 Bad Bot Report: Quantifying the Risk and Economic Impact of Bad Bots
PDF
Distil Networks 2017 Bad Bot Report: 6 High Risk Lessons for Website Defenders
PPTX
Ensuring Property Portal Listing Data Security
PPTX
Presentation - How to do Fraud like Vietnamese
Case Study on Property Portal Data Security
Field Guide To Preventing Competitor Price Scraping, Unwanted Transactions, B...
Better Metrics, Less Hacks: Online Travel and The Future of Web Security
Field Guide for Validating Premium Ad Inventory
2016 Bad Bot Report: Quantifying the Risk and Economic Impact of Bad Bots
Distil Networks 2017 Bad Bot Report: 6 High Risk Lessons for Website Defenders
Ensuring Property Portal Listing Data Security
Presentation - How to do Fraud like Vietnamese

What's hot (20)

PDF
Mobile CPI Fraud
PDF
Ias guide ad fraud essentials_2017 (1)
PDF
StubHub's Field Guide To Preventing Competitor Price Scraping, Unwanted Trans...
PDF
Affise fraud protection
PPTX
MeasureCamp Dublin 2019 Joe Bollard
PPTX
The Many Faces of Ad Fraud
PPTX
Bot Benchmark study - White Ops & DCN
PPTX
2015 Bot Baseline Report - White Ops & ANA
PDF
White Ops & Videology Whitepaper
PDF
IAB Best Practices Traffic Fraud Final
PDF
ComplianceBrief
PPTX
Digital ad fraud superheroes the good guys by augustine fou
PDF
DataDome's winning deck for 2019 FIC (Cybersecurity International Forum) "Pri...
PPTX
Bp Corp Pres Short
PDF
The Wrong Impression | Adfraud
PPT
Direct mail promo
PPTX
IEG_JAN_LV_DaveNickens_v4
PPTX
VIP Technologies PowerPoint Presentation
PPTX
Google And The Click Fraud Menace
PPT
Mobile Search Battle
Mobile CPI Fraud
Ias guide ad fraud essentials_2017 (1)
StubHub's Field Guide To Preventing Competitor Price Scraping, Unwanted Trans...
Affise fraud protection
MeasureCamp Dublin 2019 Joe Bollard
The Many Faces of Ad Fraud
Bot Benchmark study - White Ops & DCN
2015 Bot Baseline Report - White Ops & ANA
White Ops & Videology Whitepaper
IAB Best Practices Traffic Fraud Final
ComplianceBrief
Digital ad fraud superheroes the good guys by augustine fou
DataDome's winning deck for 2019 FIC (Cybersecurity International Forum) "Pri...
Bp Corp Pres Short
The Wrong Impression | Adfraud
Direct mail promo
IEG_JAN_LV_DaveNickens_v4
VIP Technologies PowerPoint Presentation
Google And The Click Fraud Menace
Mobile Search Battle
Ad

Viewers also liked (17)

PPTX
Day 2: Georg Chmiel
PPTX
Day 2: Alberto Santos Estevez - Urban Data Analytics
PDF
Using Social Media to Market Houses
PDF
Attacking in an Emerging Marketing - Lessons from the Ukraine
PDF
Building Real Estate Market Indices for the Brazilian Market
PDF
ListGlobally Promo
PDF
Horizontals Versus Verticals – Who Wins
PDF
Draft property portal watch conference agenda nyc 2016 - version 4
PDF
HouseLens Promo
PDF
Opportunties Created by the Greek Crisis - Presentation by xe.gr at the Prope...
DOCX
Property Portal Watch Conference Agenda - AMS 2015
DOCX
Property Portal Watch Conference NYC 2016 Final Agenda
PDF
Attacking in an Emerging Market - Lessons from the Ukraine - Presentation by ...
PDF
Inventing a Niche - Bankruptcy Listings
PDF
Consumer Insights - Presentation at the Property Portal Watch Conference - AM...
PDF
Using Big data to Create New Business Opportunities - Presentation by Hemnet ...
PPTX
Changing Nature of the Online Real Estate Market and Who to Watch and Learn From
Day 2: Georg Chmiel
Day 2: Alberto Santos Estevez - Urban Data Analytics
Using Social Media to Market Houses
Attacking in an Emerging Marketing - Lessons from the Ukraine
Building Real Estate Market Indices for the Brazilian Market
ListGlobally Promo
Horizontals Versus Verticals – Who Wins
Draft property portal watch conference agenda nyc 2016 - version 4
HouseLens Promo
Opportunties Created by the Greek Crisis - Presentation by xe.gr at the Prope...
Property Portal Watch Conference Agenda - AMS 2015
Property Portal Watch Conference NYC 2016 Final Agenda
Attacking in an Emerging Market - Lessons from the Ukraine - Presentation by ...
Inventing a Niche - Bankruptcy Listings
Consumer Insights - Presentation at the Property Portal Watch Conference - AM...
Using Big data to Create New Business Opportunities - Presentation by Hemnet ...
Changing Nature of the Online Real Estate Market and Who to Watch and Learn From
Ad

Similar to 17 00 distil rami (20)

PDF
Distil Network Sponsor Presentation at the Property Portal Watch Conference -...
PPTX
Are Bot Operators Eating Your Lunch?
PDF
easyjet’s journey to protect its booking engine - the slides for the Tnooz / ...
PPTX
How to clean up travel website traffic from bots and spammers?
PDF
Rtp rsp16-distil networks-final-deck
PPTX
Cleaning up website traffic from bots & spammers
PPTX
Are Bad Bots Destroying Your Conversion Rate and Costing You Money?
PDF
Distil Networks 2017 Bad Bot Report: 6 High Risk Lessons for Website Defenders
PPT
INTRODUCTION TO INFORMATION RETRIVAL
PPT
Cse535 chapter19-web search
PPTX
The Ins, Outs, and Nuances of Internet Privacy
PDF
IRJET - Chrome Extension for Detecting Phishing Websites
PDF
Boston seo meetup 2-28-2017
PPT
Black Hat Protection and Q&A - Andre Alpar, Dominik Wojcik and Ralph Tegtmeier
PDF
Bot how to find them 2014_27_03
PPTX
Iab bots how to_find_them_webinar_2014_03_27
PPSX
The Nitty Gritty of Affiliate Marketing Compliance
PPTX
Web Scraping Services.pptx
PPT
Watching websites
PDF
How many types of traffic ( Visitors )
Distil Network Sponsor Presentation at the Property Portal Watch Conference -...
Are Bot Operators Eating Your Lunch?
easyjet’s journey to protect its booking engine - the slides for the Tnooz / ...
How to clean up travel website traffic from bots and spammers?
Rtp rsp16-distil networks-final-deck
Cleaning up website traffic from bots & spammers
Are Bad Bots Destroying Your Conversion Rate and Costing You Money?
Distil Networks 2017 Bad Bot Report: 6 High Risk Lessons for Website Defenders
INTRODUCTION TO INFORMATION RETRIVAL
Cse535 chapter19-web search
The Ins, Outs, and Nuances of Internet Privacy
IRJET - Chrome Extension for Detecting Phishing Websites
Boston seo meetup 2-28-2017
Black Hat Protection and Q&A - Andre Alpar, Dominik Wojcik and Ralph Tegtmeier
Bot how to find them 2014_27_03
Iab bots how to_find_them_webinar_2014_03_27
The Nitty Gritty of Affiliate Marketing Compliance
Web Scraping Services.pptx
Watching websites
How many types of traffic ( Visitors )

More from Property Portal Watch (11)

PDF
Ingatlan - Market Leader in Hungary - Presentation by Ingatlan at the Propert...
PDF
8 Property Portals to Watch and Learn From - Presentation by Simon Baker at t...
PDF
Creating a Global MLS for New Developments - Presentation by Investorist at t...
PDF
Making Your Listings Social Proof - Presentation by Placeit at the Property P...
PDF
Using New Technology to Create a Better Consumer Experience - Presentation by...
PDF
Growing Importance of Business Intelligence on Property Portal Growth - Prese...
PDF
Challenges and Opportunities for the Online Marketing of Commercial Property ...
PDF
Floorplanner Sponsor Presentation at the Property Portal Watch Conference - A...
PDF
RENT Sponsor Presentation at the Property Portal Watch Conference - AMS 2015
PDF
Ubiflow Sponsor Presentation at the Property Portal Watch Conference - AMS 2015
PDF
Global Trends in the Property Portal Industry - Presentation at the Property ...
Ingatlan - Market Leader in Hungary - Presentation by Ingatlan at the Propert...
8 Property Portals to Watch and Learn From - Presentation by Simon Baker at t...
Creating a Global MLS for New Developments - Presentation by Investorist at t...
Making Your Listings Social Proof - Presentation by Placeit at the Property P...
Using New Technology to Create a Better Consumer Experience - Presentation by...
Growing Importance of Business Intelligence on Property Portal Growth - Prese...
Challenges and Opportunities for the Online Marketing of Commercial Property ...
Floorplanner Sponsor Presentation at the Property Portal Watch Conference - A...
RENT Sponsor Presentation at the Property Portal Watch Conference - AMS 2015
Ubiflow Sponsor Presentation at the Property Portal Watch Conference - AMS 2015
Global Trends in the Property Portal Industry - Presentation at the Property ...

Recently uploaded (20)

PDF
Reef 998 at DLRC - Reef Luxury Developments.pdf
PPTX
PRC in Latin America - Westminster Institute - R Evan Ellis.pptx
PDF
Best Student Rooms in Leeds Under Your Budget.pdf
PPTX
Guide To real estate in cameroon- Bboyo
PPTX
project communication managementMost important non-technical skills include p...
PDF
Gate Eleven at MBR City by Amwaj Development.pdf
PDF
Binghatti Circle at JVC - Binghatti Developers Miva.ae.pdf
PPTX
Your Step by Step Guide to Buying A Home in Ontario
PPTX
The Next Chapter of Dhaka's Real Estate: Modern Homes for a Timeless Lifestyle
PDF
Why Businesses Trust a Commercial Property Broker for Growth
PPTX
Candor Techspace Sector 135 Noida sector 135
PDF
Challenges and Opportunities in Cameroon’s Real Estate Sector
PDF
Install a PVC grow wall to recreate the ideal growing environment.pdf
PDF
Octa Isle Interiors by Missoni at Dubai Islands.pdf
PDF
CBIPS_3.23.22_Construction Tech_Modular.pdf
PPTX
Design Studio by Oberoi Realty | Pioneering the extraordinary
PPTX
Dholera SIR: A Gateway to Endless Possibilities
PDF
VOI Residence at Dubai South - HVM Living.pdf
PDF
ReconstructionTechnologiesAStudyovertheEffectsofConstructionTechnologiesonPos...
PPTX
Perry Lieber – Expert Construction Management Services for Streamlined, Succe...
Reef 998 at DLRC - Reef Luxury Developments.pdf
PRC in Latin America - Westminster Institute - R Evan Ellis.pptx
Best Student Rooms in Leeds Under Your Budget.pdf
Guide To real estate in cameroon- Bboyo
project communication managementMost important non-technical skills include p...
Gate Eleven at MBR City by Amwaj Development.pdf
Binghatti Circle at JVC - Binghatti Developers Miva.ae.pdf
Your Step by Step Guide to Buying A Home in Ontario
The Next Chapter of Dhaka's Real Estate: Modern Homes for a Timeless Lifestyle
Why Businesses Trust a Commercial Property Broker for Growth
Candor Techspace Sector 135 Noida sector 135
Challenges and Opportunities in Cameroon’s Real Estate Sector
Install a PVC grow wall to recreate the ideal growing environment.pdf
Octa Isle Interiors by Missoni at Dubai Islands.pdf
CBIPS_3.23.22_Construction Tech_Modular.pdf
Design Studio by Oberoi Realty | Pioneering the extraordinary
Dholera SIR: A Gateway to Endless Possibilities
VOI Residence at Dubai South - HVM Living.pdf
ReconstructionTechnologiesAStudyovertheEffectsofConstructionTechnologiesonPos...
Perry Lieber – Expert Construction Management Services for Streamlined, Succe...

17 00 distil rami

  • 1. Web Scrapers and Your Property Portal: High Risk Lessons
  • 3. Awards and Analyst Recognition “Distil’s ability to analyze behavior provides the best chance of detecting and blocking bot-driven attacks.” 5 Stars across the board.“ Verdict: For monitoring the impact of bots on a network this is the tool one needs.” The only anti-bot solution to be included in Gartner’s Online Fraud Detection Market Guide Ovum puts Distil Networks On The Radar. “Clear innovation compared to similar services.”
  • 4. Fortune 500 & Alexa Global 10,000 Customers Ecommerce Travel Publishers Directories Traditional Media Marketplace Services
  • 5. Distil Protects Over 50 RE Portals Globally!
  • 6. Protecting Your Data Enhancing Your Data Cleaning-Up Your Data
  • 7. Protecting Your Data A Brief Intro to Bots and Web Scraping
  • 8. What Is Web Scraping? Web Scraping Also known as screen scraping, web scraping is the act of copying large amounts of data from a website – either manually or with an automated program. Legitimate Scraping Scraping can sometimes be benevolent and totally acceptable. For example, the search engine bots that index your website Malicious Scraping A systematic theft of intellectual property accessible on a website, including pricing, content, images, and proprietary data
  • 9. Who is behind Web Scraping? Competitors Content Theft Competitive Intel Price Scraping Aggregators Start-ups Unauthorized Middlemen Hackers Content for Fake Pages Search Engines Google Bing Yahoo Baidu
  • 10. Bad Bots Cause the Majority of Website Problems
  • 11. In 2015 the most targeted verticals were digital publishing and real estate. Real Estate sites saw a 300% increase in bad bot traffic! Traffic by Type of Site, 2014 vs 2015
  • 12. Bad Web Scraping Web scraping is the act of taking content from a website with the intent of using it for purposes outside the direct control of the site owner. It can be used to ○ Steal intellectual property ○ Gain competitive advantage ○ Create aggregation or meta-sites ○ Perform market research ○ Damage SEO rankings
  • 13. Alexa – monitor traffic levels SE Ranking – track search rankings InfiniGraph – watch social media trends Open Site Explorer – monitor backlinks SpyFu – view advertising keywords
  • 14. Moat – find where ads are running iSpionage – organic search keywords Compete PRO – get demographic info Quantcast – view audience insights SpyOnWeb – see behind the curtain
  • 15. Cheap scraping software Inexpensive cloud computing resources Botnet-as-a-Service What is Contributing to the Growth in Web Scraping?
  • 16. Freelancer.com Rates Scraping three real estate sites Data Manipulation (de-duping, etc.) Importing into new software Average Cost - $130 USD The Going Rate for Scraping Less than $130/day
  • 17. Posting Stolen Data is Quick and Easy due to Turnkey Platforms Real Estate Portal Platforms start at $299
  • 18. Scraped Data $130 The Cost of Replicating your Website Classified Ad Website $299 $429
  • 19. Bottom Line Scrapers scrape because they are making money with your listings! And the Real Estate industry is left with... Higher Costs Lost Revenues Why Bots / Scraping is a Problem in Real Estate
  • 22. Delivering a Clear Picture of Your Web Traffic Low Resolution Fingerprint “Unactionable” Hi-Def Fingerprint “Actionable”
  • 23. Hi-Def Fingerprinting Eliminates Blind Defense IP Address Header & User Agent Information Cookie Browser 200+ Attributes of data Navigator, WebGL, Plugins, Audio, Video, etc. Tamper proofing layer Hi-Def Fingerprint
  • 24. That Majority of Bad Bots Now Use Multiple IP Addresses Bots which dynamically rotate IP addresses, or distribute attacks are significantly harder to detect and mitigate
  • 25. Sticky Bot Tracking With No Impact On Real Users Device Fingerprinting Fingerprints stick to the bot even if it attempts to reconnect from random IP addresses or hide behind an anonymous proxy or peer-to-peer network Tracks distributed attacks that would normally fly under the radar Without Distil With Distil Without Impacting Users Sharing the Same IP Avoids blocking residential users or organizations that might share the same NAT as the bot or botnet
  • 28. In 2015 the most targeted verticals were digital publishing and real estate. Real Estate sites saw a 300% increase in bad bot traffic! Traffic by Type of Site, 2014 vs 2015
  • 29. Web scraping hurts your KPIs... Slowdowns, downtime, and poor user experiences Increase in costs (infrastructure and people) Distortion of web analytics Digital ad fraud, reputation and trust (bad leads) How Web Scrapers Impact KPIs
  • 30. Majority of Bots are Advanced Persistent Bots (APBs) APBs have one or more of the following abilities: Advanced Mimick human behavior Load JavaScript Load external resources Support cookies Browser automation (Selenium, PhantomJS) Persistent Dynamic IP rotation Distribute attacks across IP addresses Hide behind anonymous and peer-to-peer proxies 2016 Distil Bad Bot Report
  • 31. Loading Assets & Bots Mimicking Humans % of bots able to load external assets (e.g. JavaScript) % of bots able to mimic human behavior These bots will skew marketing tools such as (Google Analytics, A/B testing, conversion tracking, etc.) These bots will fly under the radar of most security tools
  • 32. Bots Throw Off Analytics
  • 33. Impressions and Clicks Remain the Biggest Targets Impressions (CPM/CPV) Clicks (CPC) Search $18.8B 86% digital spend Display $7.9B Video $3.5B Mobile $6.2B$6.2B Leads (CPL) Sales (CPA) Lead Gen $2.0B Other $5.0B • classifieds • sponsorship • rich media estimated fraud not at risk $42.5B $7B
  • 34. Bots Don't Buy Houses
  • 35. 35
  • 37. The Only Easy and Accurate Way to Protect Web Applications from Bad Bots, API Abuse, and Fraud.
  • 38. Detect and Distil Traffic
  • 39. No Longer Blind Defense Complete Visibility into False Positives 17 million CAPTCHAs served 78 solved False Positive Rate = 0.00000458
  • 40. www.distilnetworks.com/trial/ Offer Ends: October 30, 2016 Two Months of Free Service + Traffic Analysis
  • 41. www.distilnetworks.com QUESTIONS….COMMENTS? I N F O @ D I S T I L N E T W O R K S . C O M 1.866.423.0606 OR CALL US ON