0% found this document useful (0 votes)
18 views3 pages

22021134 - Đặng Thanh Quang - Chủ đề 1

Uploaded by

Đặng Quang
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
18 views3 pages

22021134 - Đặng Thanh Quang - Chủ đề 1

Uploaded by

Đặng Quang
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Họ và tên: Đặng Thanh Quang

Mã sinh viên 22021134


The report on topic 1
Google Cloud infrastructure: 3 layers
+ Base layer: Networking and security → lays the foundation to support all of
Google’s infrastructure and applications
+ Next layer: Compute and storage → scale independently based on need
+ Top layer: Big data and machine learning products → enable you to perform tasks
to ingest, store, process, and deliver business insights, data pipelines, and ML
models
1. Google cloud's infrastructure
- Google cloud's infrastructure is based in five major geographic locations, North
America, South America, Europe, Asia and Australia. Having multiple service
locations is important because choosing where to locate applications affects qualities
like availability, durability. And latency, which measures the time a packet of
information takes to travel from its source to its destination.
- Regions represent independent geographic areas and are composed of zones. For
example, London or Europe west 2 is a region that currently contains three different
zones. A zone is an area where Google Cloud resources are deployed. For example,
let's say you launch a virtual machine using compute engine. More about compute
engine in a bit, it will run in the zone that you specify to ensure resource redundancy.
Zonal resources operate within a single zone, which means that if a zone becomes
unavailable, the resources won't be available either. Google Cloud lets users specify
the geographic locations to run services and resources.
2. Compute and storage
a. Compute
Google offers a range of computing services.
- The first is Compute Engine. Compute Engine is an IaaS offering or infrastructure
as a service which provides compute, storage, and network virtually that are similar
to physical datacenters. You use the virtual compute and storage resources the same
as you manage them locally. Compute Engine provides maximum flexibility for
those who prefer to manage server instances themselves.
The second is Google Kubernetes Engine or GKE. GKE runs containerized
applications in a Cloud environment as opposed to on an individual virtual machine
like Compute Engine. A container represents code packaged up with all its
dependencies.
- The third computing service offered by Google is App Engine, a fully managed
PaaS offering or platform as a service. PaaS offerings bind code to libraries that
provide access to the infrastructure application needs. This allows more resources to
be focused on application logic. Then there is Cloud Functions which executes code
in response to events like when a new file is uploaded to Cloud Storage.
- Finally, Cloud Run, a fully managed compute platform that enables you to run
requests or event-driven stateless workloads without having to worry about servers.
b. Storage
- For proper scaling capabilities, compute and storage are decoupled
- Data type that needs to be stored and the business need:
+ Unstructured data
• Unstructured data is information stored in a non-tabular form such as
documents, images, and audio files. Unstructured data is usually suited
to Cloud Storage, but BigQuery now offers the capability to store
unstructured data as well.
• Cloud Storage is a managed service for storing unstructured data. Cloud
Storage is a service for storing your objects in Google Cloud.
• Cloud Storage has four primary storage classes: Standard Storage,
Nearline Storage, Coldline Storage and Archive Storage.
+ Structured data → represents information stored in tables, rows, and
columns
Structured data comes in two types:
• Transactional workloads and analytical workloads. Transactional
workloads stem from Online Transaction Processing systems, which
are used when fast data inserts and updates are required to build row-
based records.
• Analytical workloads, which stem from Online Analytical Processing
systems, which are used when entire datasets need to be read. They
often require complex queries, for example, aggregations.
3. Big data and machine learning products
- Four general categories along the data-to-AI workflow: ingestion and process,
storage, analytics, and machine learning.
• Ingestion and process: include products that are used to digest both real-time
and batch data. The list includes Pub/Sub Dataflow Dataproc Cloud Data
Fusion
• Data storage: there are five storage products: Cloud Storage, Cloud SQL,
Cloud Spanner, Cloud Bigtable, and Firestore Cloud.
• Analytics: the major analytics tool is BigQuery. BigQuery is a fully managed
data warehouse that can be used to analyze data through SQL commands. In
addition to BigQuery, you can analyze data and visualize results using:
Looker, and Looker Studio.
• Machine learning, or ML. ML products include both the ML development
platform and the AI solutions: The primary product of the ML development
platform is Vertex AI, which includes the products and technologies: AutoML
Vertex AI Workbench, and TensorFlow AI solutions are built on the ML
development platform and include state-of-the-art products to meet both
horizontal and vertical market needs.

You might also like