Gen-AI
Gen-AI
Generative AI is an application of ML where the focus is on creating new data rather than
just analyzing or predicting it. This involves leveraging various ML techniques to build
models capable of generating high-quality, novel content.
Generative AI (Gen AI) and Machine Learning (ML) are closely related concepts, with Gen
AI being a specific application within the broader field of ML. Here’s a detailed explanation
of their relationship:
ML is a subset of artificial intelligence (AI) that involves training algorithms to learn patterns
from data and make decisions or predictions based on that data. There are various types of
ML, including:
Generative AI refers to algorithms that create new content. This could be text, images, music,
or any other type of data. Generative AI models learn from existing data and generate new,
similar data. The two main types of generative models in ML are:
Generative AI is a branch of ML focused on generating new data. Here's how they are
connected:
These other types of AI may still generate content, but they do it as a side
effect of their primary function. Generative AI is specifically designed to
generate new content as its primary output. Whether this is text, images,
product suggestions, whatever, that's what generative AI is designed to
do. So, now that we know where generative AI fits in the broader
landscape, together, let's explore how it works.
You feed it with thousands, millions, trillions of content, and then you
teach a certain algorithm to generate outputs and solutions as a
result. Okay, now that we got AI 101 out of the way, let's get into
generative AI 101. Let's use cars as an example. Just like a Porsche has a
different engine than a Mazda, under the umbrella term of generative
AI, there are a variety of different generative AI models. These AI models,
or car engines, are written and manufactured by groups of highly
advanced computer vision specialists, machine learning experts, and
mathematicians. They're built on years of open source machine learning
research and generally funded by companies and universities. Some of
the big players in writing these generative AI models, engines, are Open
AI, NVIDIA, Google, Meta, and universities like UC Berkeley and LMU
Munich. They can either keep these models private, or they can make
them public, what we call this, open source, for those to benefit from their
research.
All right, now that these complex generative models are written, meaning
the engines are made, what are we going to do with them? Depending on
your level of technical expertise, this can look a bit different. I'm going to
paint a picture for you with three different end users of these models. The
first person is a business leader who comes up with a product idea that
involves a generative AI model, or several. For the development of their
tool, this business leader either uses free open source generative AI
models or enters into a partnership with a corporation to get rights to
their generative AI model, then their team creates their vision. To
continue the chronology, let's say this person owns the car factory. They
direct where the engine and chassis go, but don't actually work on the
floor. The second person is a creative person with an appetite for
adventure. They might have some technical knowledge, but they aren't an
AI engineer. I mean, they can be if they want. This person goes to a car
engine showroom, where they pick a pre-made car engine or a generative
AI model from a repository like GitHub and Hugging Face. After that, they
go to a chassis manufacturer to pick their empty shell for their new
engine, their precious new engine. These chassis are called AI
notebooks. Their purpose is to hold and run the generative AI model
code. The most widely used one is Google Colab, but there are others like
Jupiter Notebooks. And the third person would be my mother, bless her
heart. She has absolutely no technical pedigree, nor she's interested in
acquiring one. But this doesn't mean she cannot benefit from generative
AI. My mother would be buying her already made car. She will have way
less control over the outcome of her car, but she will be able to drive, just
like the business leader and the creative technologist. People with no
technical knowledge can simply subscribe to an online service like
OpenAI's ChatGPT or DALL-E, or download Discord and play with
Midjourney, or download Lensa AI and Avatar Maker in their
smartphone to play with the magic of generative AI. Well, this all depends
what you want to do and what you want to build, and how much technical
expertise you already have. Now that we have our car, our generative AI
model, we can now start creating our own content and go for a drive.
Now, let's compare that to other companies that hit 1 million users. It took
Netflix 49 months to reach 1 million users. It took Twitter 24 months, it
took Airbnb 30 months, Facebook, 10 months, and it took Instagram two-
and-a half-months to reach 1 million users. Let's remember, it took
ChatGPT only one week. These figures demonstrate how easily
humans adopted their workflow for co-creating with generative AI-based
tools and services. This is amazing. However, GPT has several
limitations, such as the lack of common sense, creativity and
understanding the text it generates. Also, bias data sets and the danger of
normalization of mediocrity when we come up with creative
writing. Natural language models synthetically mimic human
capabilities, but, clearly, conscious contemplations are required before
developing generative AI tools. ChatGPT is a wonderful tool for factual and
computable information. However, I would advise us to approach it with
caution when inquiring about creative and opinion-based writing.
AI was simply a tool that the wheel designers used to inspire themselves
for the final designs that they would make. Next, Beko, that is a European-
based appliance brand, they use custom trained GANs in their
sustainability stand film, which also happens to be the world's first brand-
funded AI film created and produced by Seyhan Lee. We use GANs to
generate lightning, leaves, roots, eyes, flowers, and created seamless
transitions to flow between humans and nature. GANs have this beautiful
transitional quality. And finally, in the context of financial fraud
detection, GAN models can be used to generate synthetic versions of
fraudulent transactions, which can then be used to train a fraud detection
model. You know what's really surprising with GANs is that the same
generative AI model can be used for two very distinct professions. Here
we are seeing some financial fraud detection solving and create a new tire
styles for Audi. And then later on, the same AI model makes
impossibly beautiful visual effects for film, and that versatility is the
greatest power of GAN models.
VAE and Anomaly Detection
These models can be used for anomaly detection by training the model on
a dataset of normal data, and then using the trained model to identify
instances that deviate from the normal data. This can be used to detect
anomalies in a wide range of situations, like finding fraud in financial
transactions, spotting flaws in manufacturing or finding security breaches
in a network.
For example, Uber has used VAE for anomaly detection in their financial
transactions to detect fraud. Another example would be Google has also
used VAE to detect network intrusions using anomaly detection and
another one of a real world application of VAE would be anomaly detection
in industrial quality control. In this scenario, a VAE can be trained on a
dataset of images of normal products and then used to identify images of
products that deviate from the normal data. In this way, it can be used to
detect defects in products such as scratches, dents, or misalignments.
- The best way to predict the future, as they say, is to invent it so let's talk
about the future. In two to three years in the gaming, film and marketing
sectors generative AI will continue to be used in computer graphics, and
animation to create more realistic, and believable characters, and
environments.
This is going to be particularly important in 3D modeling. Generative AI
will be used to improve natural language understanding in virtual
assistants and chatbots making them more and more capable of handling
complex and nuanced conversations. In the energy sector, generative AI
models will be used to optimize energy consumption and production, such
as predicting demand, and managing renewable energy sources, as well
as improving the efficiency of energy distribution networks. As for the
transportation sector, generative AI models will be used to optimize traffic
flow and to predict maintenance needs for vehicles. In short, generative AI
will be used to automate repetitive tasks and improve efficiency in a wide
variety of industries. My predictions for the next 10 to 15 years would be
generative AI will be used to create more and realistic, and accurate
simulations in fields such as architecture, urban planning, and
engineering. The second would be to be used to create new materials and
products in fields, such as manufacturing and textile design. The third will
be natural language generation will be improved in the fields of content
creation such as news articles, books, and even movie scripts. It will also
improve self-driving cars by generating realistic virtual scenarios for
testing and training, and also it will excel in audio to asset generation
where you can speak, and then have the AI generate an asset. In short,
my prediction for the upcoming 10 to 15 years would be generative AI will
be used in the creation and production of mass media quality books, films,
and games. Meanwhile, it will also be the technology behind paradigm
shifting implications in the job market, such as self-driving cars, advanced
robotics for manufacturing, and for warehousing, and improved crop yield
and precision agriculture.