Fast, Powerful and
Scalable Analytics
Why Analytics ?
• Get the most value of your data asset
• Faster Better decision making process
• Cost reduction
• New products and services
Type of Analytics
• Descriptive: What happened ?
• Diagnostics: Why did it happen?
• Predictive: What is likely to happen?
• Prescriptive: What should I do about it ?
Descriptive: What happened ?
● Reports
○ Sales Report
○ Expense summary
● Ad-hoc requests to analyst
Diagnostics: Why did it happen
● Aggregates: aggregate measure over one or
more dimension
○ Find total sales
○ Top five product ranked by sales
● Roll-ups: Aggregate at different levels of
dimension hierarchy
○ given total sales by city, roll-up to get sales by
state
● Drill-down: Inverse of roll-ups
○ given total sales by state, drill-down to get
total by city
● Slicing and Dicing:
○ Equality and range selections on one or more
dimensions
Predictive: What is likely to happen
● Sales Prediction
○ Analyze data to identify trends, spot
weakness or determine conditions
among broader data sets for making
decisions about the future
● Targeted marketing
○ what is likelihood of a customer buying
a particular product based on past
buying behavior
Prescriptive: What is the best course of action?
Paradox of choices
With too many choices, which one is the best?
Data Analytics Use Cases
By industry
Finance
Identify trade patterns
Detect fraud and anomalies
Predict trading outcomes
Manufacturing
Simulations to improve design/yield
Detect production anomalies
Predict machine failures (sensor data)
Telecom
Behavioral analysis of customer calls
Network analysis (perf and reliability)
Healthcare
Find genetic profiles/matches
Analyze health vs spending
Predict viral outbreaks
Data Analytics Solution Consideration
• Technical Considerations
• Real-time analytics
– High speed data ingestion
– High speed read queries
• Analytics
– Built in analytics
– Choice of BI tools
• Business Considerations
• Cost of deployment and use
– Hardware and
Price/Performance ratio
– Large talent pool
Existing Approaches
Limited real time analytics
Slow releases of product innovation
Expensive hardware and software
Data Warehouses
Hadoop / NoSQL
LIMITED SQL SUPPORT
DIFFICULT TO
INSTALL/MANAGE
LIMITED TALENT POOL
DATA LAKE W/ NO DATA
MANAGEMENT
Hard to use
Purpose Built rather than predictive
analytics
MariaDB Big Data Solution
MariaDB AX
and
MariaDB ColumnStore
MariaDB AX
Analytics -
simple, fast, scalable…
and open source
MariaDB AX
MariaDB Server
MariaDB MaxScale
MariaDB ColumnStore
Parallel queries
Distributed storage
No indexes
Automatic partitioning
Read optimized
High compression
Low disk IO ColumnStore
Storage
ColumnStore
Storage
ColumnStore
Storage
MariaDB Server
ColumnStore
MariaDB Server
ColumnStore
MariaDB MaxScale
MariaDB Server
ColumnStore
ColumnStore
Storage
MariaDB MaxScale
UM
User
Module
PM
Performance Module
MariaDB ColumnStore
High performance columnar storage engine that supports a wide variety
of analytical use cases in highly scalable distributed environments
Parallel query
processing for distributed
environments
Faster, More
Efficient Queries
Single Interface for
OLTP and analytics
Easy to Manage and
Scale
Easier Enterprise
Analytics
Power of SQL and
Freedom of Open
Source to Big Data
Analytics
Better Price
Performance
Better Price
Performance
Flexible deployment option
• Cloud and On-premise
• Run on commodity hardware
• Open Source, Subscription based pricing
90.3%
less per TB
per year
Commercial Data
Warehouse
MariaDB
ColumnStore
No need to maintain a third platform
• Run analytics from the same SQL front end
• No need to update application code
• Leverage MariaDB Extensible architecture
High data compression
• More efficient at storing big data
• Less hardware
Customers have saved by going to MariaDB AX against
Oracle(HealthCare), MemSQL(Auto-parts), Vertica(Finance, SEO
Marketing): Come see them at M18!
Easier Enterprise
Analytics
ANSI SQL
Single SQL Front-end
• Use a single SQL interface for analytics and OLTP
• Leverage MariaDB Security features - Encryption for
data in motion, role based access and auditing
Full ANSI SQL
• No more SQL “like” query
• Support complex join, aggregation and window
function
Easy to manage and scale
• Eliminate needs for indexes and views
• Automated horizontal/vertical partitioning
• Linear scalable by adding new nodes as data grows
• Out of box connection with BI tools
MariaDB AX customers across industries: Auto Parts, Finance, Ad
analytics, Asset management, Telecommunication, Healthcare,
Digital Media, Carpooling App
Faster, More
Efficient Queries
Optimized for Columnar storage
• Columnar storage reduces disk I/O
• Blazing fast read-intensive workload
• Ultra fast data import
Parallel
Query Processing
Parallel distributed query execution
• Distributed queries into series of parallel operations
• Fully parallel high speed data ingestion
– TPCH lineitem table - 750K to 1 million rows per min
Highly available analytic environment
• Built-in Redundancy
• Automatic fail-over
MariaDB AX customers across industries: Auto Parts, Finance, Ad
analytics, Asset management, Telecommunication, Healthcare,
Digital Media, Carpooling App
Ingestion Analytics
Data Services
Bulk Data Adapters
Apache Kafka
Streaming Data Adapters
Spark / Python / ML
Bulk Data Adapters
Operations
Transaction (OLTP)
MariaDB Server
InnoDB
MariaDB MaxScale
Web/Mobile Services
MariaDB MaxScale
Analytics (OLAP)
MariaDB Server
ColumnStore
Simple & Streamlined data ingestion
Streaming data
adapters – Apache
Kafka
Stream all messages published
to Apache Kafka topics to
MariaDB AX automatically and
continuously - enable data
from many sources to be
streamed and collected for
analysis without complex
code
MariaDB Server
ColumnStore
Apache Kafka
ColumnStore Storage ColumnStore StorageColumnStore Storage
Write API Write API Write API
MariaDB Server
ColumnStore
Streaming Data
Adapter
(Kafka Client)
Topic Topic Topic
OLTP to OLAP:
Streaming data
adapters – MaxScale
CDC
Stream all writes from
MariaDB TX to MariaDB AX
automatically and continously -
ensure analytical data is
up to date and not stale, no
need for batch jobs,
manual processes or
human intervention
MariaDB Server
InnoDB
MariaDB Server
ColumnStore
MariaDB MaxScale
ColumnStore Storage ColumnStore StorageColumnStore Storage
Write API Write API Write API
MariaDB Server
ColumnStore
Streaming Data
Adapter
(CDC Client)
CDC Server
MariaDB AX Use Cases
IHME - Institute of Health Metrics and Evaluation
IHME Visualizations library: http://www.healthdata.org/results/data-visualizations
Started with 4.2 TB, with goal to go to 30TB of data
Customer Use Case -1
Industry: healthcare (Medicaid)
Data: surveys
Use case: decision support system
Details:
• Identify trends and patterns
• Determine population cohorts
• Predict health outcomes
• Anticipate funding / capacity
• Recommend intervention
Can’t do complex queries on current
hardware with Oracle and snowflake
schemas
Limited to optimizing for simple, known
queries (2-3 columns)
Replaced with ColumnStore
> a single table
> 2.5 million rows, 248 columns >
complex, ad-hoc queries
> query 20+ columns in seconds
Customer Use Case - 2
Industry: biotechnology (genetics)
Data: genotypes
Use case: genetic profiling
Details:
• Find genetic mates (beef and dairy)
• Predict meat production (pork)
• Gene/DNA analysis
Had to convert to CSV files and schedule
import jobs (cron)
Always receiving new genetic data
Migrated to data adapter (Python)
> streamline import process
> remove steps / possible error
> remove delays
> import data on demand
> immediate customer access
Customer Use Case - 3
Industry:Mobile text/call app
Data: call and text logs
Use case: Mobile app use analytics
Details:
• 30 million text and 3 million phone call
per day
• 1.5 billion rows of logs per day
• The text and call volume rate will
continue to grow
InnoDB backend hit the scale limit of
6TB and it requires lot of performance
tuning and index management
Migrated to MariaDB AX
> Able to process 24 month - 24TB vs
6 months limitation of InnoDB
> Same BI tools and client applications
worked with MariaDB AX seamlessly
MariaDB AX
Analytics made easy –
simple, fast, scalable…
Thank you

More Related Content

PDF
Securing data and preventing data breaches
PDF
Semi Structured Data
PDF
Maximizing performance via tuning and optimization
PPTX
Delivering fast, powerful and scalable analytics
PDF
Дмитрий Лавриненко "Blockchain for Identity Management, based on Fast Big Data"
PPTX
MongoDB Operations for Developers
PDF
Webinar: 10-Step Guide to Creating a Single View of your Business
PPTX
MongoDB in a Mainframe World
Securing data and preventing data breaches
Semi Structured Data
Maximizing performance via tuning and optimization
Delivering fast, powerful and scalable analytics
Дмитрий Лавриненко "Blockchain for Identity Management, based on Fast Big Data"
MongoDB Operations for Developers
Webinar: 10-Step Guide to Creating a Single View of your Business
MongoDB in a Mainframe World

What's hot (20)

PPTX
MongoDB as a Data Warehouse: Time Series and Device History Data (Medtronic)
PPTX
MongoDB and RDBMS: Using Polyglot Persistence at Equifax
PPTX
An Enterprise Architect's View of MongoDB
PPTX
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...
PDF
SQL Server 2017 Enhancements You Need To Know
PPTX
MongoDB Evenings Minneapolis: Medtronic's MongoDB Journey
PPTX
Jumpstart: MongoDB BI Connector & Tableau
PPTX
Microsoft azure documentDB
PPTX
Big Data Business Transformation - Big Picture and Blueprints
PPTX
Introduction to Azure DocumentDB
PPTX
Introduction to Couchbase: Onomi
PPTX
Tableau & MongoDB: Visual Analytics at the Speed of Thought
PPTX
Key-Value NoSQL Database
PPT
MongoDB in the Healthcare Enterprise
PPT
MongoATL: How Sourceforge is Using MongoDB
PDF
SQL vs. NoSQL Databases
PPTX
Coming to cassandra from relational world (New)
PPTX
MongoDB Atlas
PPTX
Benefits of Using MongoDB Over RDBMSs
PDF
Business Track: How MongoDB Helps Telefonia Digital Accelerate Time to Market
MongoDB as a Data Warehouse: Time Series and Device History Data (Medtronic)
MongoDB and RDBMS: Using Polyglot Persistence at Equifax
An Enterprise Architect's View of MongoDB
MongoDB San Francisco 2013: Storing eBay's Media Metadata on MongoDB present...
SQL Server 2017 Enhancements You Need To Know
MongoDB Evenings Minneapolis: Medtronic's MongoDB Journey
Jumpstart: MongoDB BI Connector & Tableau
Microsoft azure documentDB
Big Data Business Transformation - Big Picture and Blueprints
Introduction to Azure DocumentDB
Introduction to Couchbase: Onomi
Tableau & MongoDB: Visual Analytics at the Speed of Thought
Key-Value NoSQL Database
MongoDB in the Healthcare Enterprise
MongoATL: How Sourceforge is Using MongoDB
SQL vs. NoSQL Databases
Coming to cassandra from relational world (New)
MongoDB Atlas
Benefits of Using MongoDB Over RDBMSs
Business Track: How MongoDB Helps Telefonia Digital Accelerate Time to Market

Similar to Fast, Powerful and Scalable Analytics (20)

PDF
Delivering fast, powerful and scalable analytics
PDF
MariaDB AX: Analytics with MariaDB ColumnStore
PDF
MariaDB AX: Solución analítica con ColumnStore
PDF
[db tech showcase Tokyo 2017] C37: MariaDB ColumnStore analytics engine : use...
PPTX
Hadoop in the Cloud: Common Architectural Patterns
PDF
Business Intelligence Architecture
PDF
Customer value analysis of big data products
PPTX
Business analytics and data visualisation
PPTX
Assessing New Databases– Translytical Use Cases
PPTX
Skilwise Big data
PDF
The New Model
PPTX
Skillwise Big Data part 2
PDF
Girish Juneja - Intel Big Data & Cloud Summit 2013
PPTX
Data lake-itweekend-sharif university-vahid amiry
PDF
Machine Data Analytics
PPTX
Big data unit 2
PPT
Informix & IWA : Operational analytics performance
PPTX
IPC Data Analysis and Extraction
PPT
Data ware housing- Introduction to data ware housing
PDF
Fast Cycle, Multi-Terabyte Data Analysis with Amazon Redshift and ClearStory ...
Delivering fast, powerful and scalable analytics
MariaDB AX: Analytics with MariaDB ColumnStore
MariaDB AX: Solución analítica con ColumnStore
[db tech showcase Tokyo 2017] C37: MariaDB ColumnStore analytics engine : use...
Hadoop in the Cloud: Common Architectural Patterns
Business Intelligence Architecture
Customer value analysis of big data products
Business analytics and data visualisation
Assessing New Databases– Translytical Use Cases
Skilwise Big data
The New Model
Skillwise Big Data part 2
Girish Juneja - Intel Big Data & Cloud Summit 2013
Data lake-itweekend-sharif university-vahid amiry
Machine Data Analytics
Big data unit 2
Informix & IWA : Operational analytics performance
IPC Data Analysis and Extraction
Data ware housing- Introduction to data ware housing
Fast Cycle, Multi-Terabyte Data Analysis with Amazon Redshift and ClearStory ...

More from MariaDB plc (20)

PDF
MariaDB Berlin Roadshow Slides - 8 April 2025
PDF
MariaDB München Roadshow - 24 September, 2024
PDF
MariaDB Paris Roadshow - 19 September 2024
PDF
MariaDB Amsterdam Roadshow: 19 September, 2024
PDF
MariaDB Paris Workshop 2023 - MaxScale 23.02.x
PDF
MariaDB Paris Workshop 2023 - Newpharma
PDF
MariaDB Paris Workshop 2023 - Cloud
PDF
MariaDB Paris Workshop 2023 - MariaDB Enterprise
PDF
MariaDB Paris Workshop 2023 - Performance Optimization
PDF
MariaDB Paris Workshop 2023 - MaxScale
PDF
MariaDB Paris Workshop 2023 - novadys presentation
PDF
MariaDB Paris Workshop 2023 - DARVA presentation
PDF
MariaDB Tech und Business Update Hamburg 2023 - MariaDB Enterprise Server
PDF
MariaDB SkySQL Autonome Skalierung, Observability, Cloud-Backup
PDF
Einführung : MariaDB Tech und Business Update Hamburg 2023
PDF
Hochverfügbarkeitslösungen mit MariaDB
PDF
Die Neuheiten in MariaDB Enterprise Server
PDF
Global Data Replication with Galera for Ansell Guardian®
PDF
Introducing workload analysis
PDF
Under the hood: SkySQL monitoring
MariaDB Berlin Roadshow Slides - 8 April 2025
MariaDB München Roadshow - 24 September, 2024
MariaDB Paris Roadshow - 19 September 2024
MariaDB Amsterdam Roadshow: 19 September, 2024
MariaDB Paris Workshop 2023 - MaxScale 23.02.x
MariaDB Paris Workshop 2023 - Newpharma
MariaDB Paris Workshop 2023 - Cloud
MariaDB Paris Workshop 2023 - MariaDB Enterprise
MariaDB Paris Workshop 2023 - Performance Optimization
MariaDB Paris Workshop 2023 - MaxScale
MariaDB Paris Workshop 2023 - novadys presentation
MariaDB Paris Workshop 2023 - DARVA presentation
MariaDB Tech und Business Update Hamburg 2023 - MariaDB Enterprise Server
MariaDB SkySQL Autonome Skalierung, Observability, Cloud-Backup
Einführung : MariaDB Tech und Business Update Hamburg 2023
Hochverfügbarkeitslösungen mit MariaDB
Die Neuheiten in MariaDB Enterprise Server
Global Data Replication with Galera for Ansell Guardian®
Introducing workload analysis
Under the hood: SkySQL monitoring

Recently uploaded (20)

PPTX
UNIT-1 NOTES Data warehousing and data mining.pptx
PPTX
DataGovernancePrimer_Hosch_2018_11_04.pptx
PDF
GPL License Terms of document persentaion
PPTX
reflex-210317162019.pptxjy5i767i6i67i67i67i76
PDF
TenneT-Integrated-Annual-Report-2018.pdf
PDF
Lesson 1 - intro Cybersecurity and Cybercrime.pptx.pdf
PDF
Machine Learning Final Summary Cheat Sheet
PDF
Introduction to Database Systems Lec # 1
PPTX
1.Introduction to orthodonti hhhgghhcs.pptx
PDF
MULTI-ACCESS EDGE COMPUTING ARCHITECTURE AND SMART AGRICULTURE APPLICATION IN...
PPTX
Transport System for Biology students in the 11th grade
PDF
n8n Masterclass.pdfn8n Mastercn8n Masterclass.pdflass.pdf
PPTX
Evaluasi program Bhs Inggris th 2023-2024 dan prog th 2024-2025-1.pptx
PPTX
ISO 9001-2015 quality management system presentation
PPT
genetics-16bbbbbbhhbbbjjjjjjjjffggg11-.ppt
PDF
PPT nikita containers of the company use
PPTX
text mining_Natural Language Processing.pptx
PPT
DWDM unit 1 for btech 3rd year students.ppt
PDF
NU-MEP-Standards معايير تصميم جامعية .pdf
PPTX
Dkdkskakkakakakskskdjddidiiffiiddakaka.pptx
UNIT-1 NOTES Data warehousing and data mining.pptx
DataGovernancePrimer_Hosch_2018_11_04.pptx
GPL License Terms of document persentaion
reflex-210317162019.pptxjy5i767i6i67i67i67i76
TenneT-Integrated-Annual-Report-2018.pdf
Lesson 1 - intro Cybersecurity and Cybercrime.pptx.pdf
Machine Learning Final Summary Cheat Sheet
Introduction to Database Systems Lec # 1
1.Introduction to orthodonti hhhgghhcs.pptx
MULTI-ACCESS EDGE COMPUTING ARCHITECTURE AND SMART AGRICULTURE APPLICATION IN...
Transport System for Biology students in the 11th grade
n8n Masterclass.pdfn8n Mastercn8n Masterclass.pdflass.pdf
Evaluasi program Bhs Inggris th 2023-2024 dan prog th 2024-2025-1.pptx
ISO 9001-2015 quality management system presentation
genetics-16bbbbbbhhbbbjjjjjjjjffggg11-.ppt
PPT nikita containers of the company use
text mining_Natural Language Processing.pptx
DWDM unit 1 for btech 3rd year students.ppt
NU-MEP-Standards معايير تصميم جامعية .pdf
Dkdkskakkakakakskskdjddidiiffiiddakaka.pptx

Fast, Powerful and Scalable Analytics

  • 2. Why Analytics ? • Get the most value of your data asset • Faster Better decision making process • Cost reduction • New products and services
  • 3. Type of Analytics • Descriptive: What happened ? • Diagnostics: Why did it happen? • Predictive: What is likely to happen? • Prescriptive: What should I do about it ?
  • 4. Descriptive: What happened ? ● Reports ○ Sales Report ○ Expense summary ● Ad-hoc requests to analyst
  • 5. Diagnostics: Why did it happen ● Aggregates: aggregate measure over one or more dimension ○ Find total sales ○ Top five product ranked by sales ● Roll-ups: Aggregate at different levels of dimension hierarchy ○ given total sales by city, roll-up to get sales by state ● Drill-down: Inverse of roll-ups ○ given total sales by state, drill-down to get total by city ● Slicing and Dicing: ○ Equality and range selections on one or more dimensions
  • 6. Predictive: What is likely to happen ● Sales Prediction ○ Analyze data to identify trends, spot weakness or determine conditions among broader data sets for making decisions about the future ● Targeted marketing ○ what is likelihood of a customer buying a particular product based on past buying behavior
  • 7. Prescriptive: What is the best course of action? Paradox of choices With too many choices, which one is the best?
  • 8. Data Analytics Use Cases By industry Finance Identify trade patterns Detect fraud and anomalies Predict trading outcomes Manufacturing Simulations to improve design/yield Detect production anomalies Predict machine failures (sensor data) Telecom Behavioral analysis of customer calls Network analysis (perf and reliability) Healthcare Find genetic profiles/matches Analyze health vs spending Predict viral outbreaks
  • 9. Data Analytics Solution Consideration • Technical Considerations • Real-time analytics – High speed data ingestion – High speed read queries • Analytics – Built in analytics – Choice of BI tools • Business Considerations • Cost of deployment and use – Hardware and Price/Performance ratio – Large talent pool
  • 10. Existing Approaches Limited real time analytics Slow releases of product innovation Expensive hardware and software Data Warehouses Hadoop / NoSQL LIMITED SQL SUPPORT DIFFICULT TO INSTALL/MANAGE LIMITED TALENT POOL DATA LAKE W/ NO DATA MANAGEMENT Hard to use Purpose Built rather than predictive analytics
  • 11. MariaDB Big Data Solution MariaDB AX and MariaDB ColumnStore
  • 12. MariaDB AX Analytics - simple, fast, scalable… and open source
  • 13. MariaDB AX MariaDB Server MariaDB MaxScale MariaDB ColumnStore Parallel queries Distributed storage No indexes Automatic partitioning Read optimized High compression Low disk IO ColumnStore Storage ColumnStore Storage ColumnStore Storage MariaDB Server ColumnStore MariaDB Server ColumnStore MariaDB MaxScale MariaDB Server ColumnStore ColumnStore Storage MariaDB MaxScale UM User Module PM Performance Module
  • 14. MariaDB ColumnStore High performance columnar storage engine that supports a wide variety of analytical use cases in highly scalable distributed environments Parallel query processing for distributed environments Faster, More Efficient Queries Single Interface for OLTP and analytics Easy to Manage and Scale Easier Enterprise Analytics Power of SQL and Freedom of Open Source to Big Data Analytics Better Price Performance
  • 15. Better Price Performance Flexible deployment option • Cloud and On-premise • Run on commodity hardware • Open Source, Subscription based pricing 90.3% less per TB per year Commercial Data Warehouse MariaDB ColumnStore No need to maintain a third platform • Run analytics from the same SQL front end • No need to update application code • Leverage MariaDB Extensible architecture High data compression • More efficient at storing big data • Less hardware Customers have saved by going to MariaDB AX against Oracle(HealthCare), MemSQL(Auto-parts), Vertica(Finance, SEO Marketing): Come see them at M18!
  • 16. Easier Enterprise Analytics ANSI SQL Single SQL Front-end • Use a single SQL interface for analytics and OLTP • Leverage MariaDB Security features - Encryption for data in motion, role based access and auditing Full ANSI SQL • No more SQL “like” query • Support complex join, aggregation and window function Easy to manage and scale • Eliminate needs for indexes and views • Automated horizontal/vertical partitioning • Linear scalable by adding new nodes as data grows • Out of box connection with BI tools MariaDB AX customers across industries: Auto Parts, Finance, Ad analytics, Asset management, Telecommunication, Healthcare, Digital Media, Carpooling App
  • 17. Faster, More Efficient Queries Optimized for Columnar storage • Columnar storage reduces disk I/O • Blazing fast read-intensive workload • Ultra fast data import Parallel Query Processing Parallel distributed query execution • Distributed queries into series of parallel operations • Fully parallel high speed data ingestion – TPCH lineitem table - 750K to 1 million rows per min Highly available analytic environment • Built-in Redundancy • Automatic fail-over MariaDB AX customers across industries: Auto Parts, Finance, Ad analytics, Asset management, Telecommunication, Healthcare, Digital Media, Carpooling App
  • 18. Ingestion Analytics Data Services Bulk Data Adapters Apache Kafka Streaming Data Adapters Spark / Python / ML Bulk Data Adapters Operations Transaction (OLTP) MariaDB Server InnoDB MariaDB MaxScale Web/Mobile Services MariaDB MaxScale Analytics (OLAP) MariaDB Server ColumnStore Simple & Streamlined data ingestion
  • 19. Streaming data adapters – Apache Kafka Stream all messages published to Apache Kafka topics to MariaDB AX automatically and continuously - enable data from many sources to be streamed and collected for analysis without complex code MariaDB Server ColumnStore Apache Kafka ColumnStore Storage ColumnStore StorageColumnStore Storage Write API Write API Write API MariaDB Server ColumnStore Streaming Data Adapter (Kafka Client) Topic Topic Topic
  • 20. OLTP to OLAP: Streaming data adapters – MaxScale CDC Stream all writes from MariaDB TX to MariaDB AX automatically and continously - ensure analytical data is up to date and not stale, no need for batch jobs, manual processes or human intervention MariaDB Server InnoDB MariaDB Server ColumnStore MariaDB MaxScale ColumnStore Storage ColumnStore StorageColumnStore Storage Write API Write API Write API MariaDB Server ColumnStore Streaming Data Adapter (CDC Client) CDC Server
  • 21. MariaDB AX Use Cases
  • 22. IHME - Institute of Health Metrics and Evaluation IHME Visualizations library: http://www.healthdata.org/results/data-visualizations Started with 4.2 TB, with goal to go to 30TB of data
  • 23. Customer Use Case -1 Industry: healthcare (Medicaid) Data: surveys Use case: decision support system Details: • Identify trends and patterns • Determine population cohorts • Predict health outcomes • Anticipate funding / capacity • Recommend intervention Can’t do complex queries on current hardware with Oracle and snowflake schemas Limited to optimizing for simple, known queries (2-3 columns) Replaced with ColumnStore > a single table > 2.5 million rows, 248 columns > complex, ad-hoc queries > query 20+ columns in seconds
  • 24. Customer Use Case - 2 Industry: biotechnology (genetics) Data: genotypes Use case: genetic profiling Details: • Find genetic mates (beef and dairy) • Predict meat production (pork) • Gene/DNA analysis Had to convert to CSV files and schedule import jobs (cron) Always receiving new genetic data Migrated to data adapter (Python) > streamline import process > remove steps / possible error > remove delays > import data on demand > immediate customer access
  • 25. Customer Use Case - 3 Industry:Mobile text/call app Data: call and text logs Use case: Mobile app use analytics Details: • 30 million text and 3 million phone call per day • 1.5 billion rows of logs per day • The text and call volume rate will continue to grow InnoDB backend hit the scale limit of 6TB and it requires lot of performance tuning and index management Migrated to MariaDB AX > Able to process 24 month - 24TB vs 6 months limitation of InnoDB > Same BI tools and client applications worked with MariaDB AX seamlessly
  • 26. MariaDB AX Analytics made easy – simple, fast, scalable…