0% found this document useful (0 votes)

54 views20 pages

How To Use Whisper AI - The Only Guide You Need

Uploaded by

creandoparadigmas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

54 views20 pages

How To Use Whisper AI - The Only Guide You Need

Uploaded by

creandoparadigmas

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 20

🎉 Introducing Notta Showcase: instantly translate your video into multiple languages for a global reach!

Learn More

How to Use Whisper AI: The Only Guide You Need

Rivi Richards Updated: 2023-09-08 5mins

From ChatGPT to Dalle, and now, Whisper, Open AI has set the stage for the revolution of AI with some of
the most valuable and mindblowing tools you will ever find. Their youngest child, Whisper, is on a
transcription tool that beats all the rest in time, cost, and accuracy.

While it has gained much praise for being the best, one concern remains: few people know how to use it. The
fact that you can’t download it like any other software is a big letdown to likely users.

After conducting my research, some of the concerns I noted respondents raising were, “It's too technical to
use!” and ‘You have to go through numerous developer notes that are tiresome to read!”

If this is a problem you have encountered, here is an easy step-by-step solution on how to use Whisper
OpenAI.

What is OpenAI's Whisper?

Whisper is an automatic speech recognition system by Open AI, the makers of ChatGPT and Dalle. The
project is open source, meaning it is free to use, distribute, and change.
Unlike other speech-to-text systems, Whisper does not have a download site. All its files are in a GitHub
repository. You must download some developer tools and run some code to install it in your system.

Who Can Use Open AI Whisper?

Anyone who needs to convert their speech to text can use Whisper AI. For example:

A student who wants to transcribe their class notes

A meeting head who wants to derive the context of a previously recorded Zoom meeting

A podcaster looking to repurpose their audio content into various formats

A video editor looking to add subtitles to a video and more.

Take Your Productivity to the Next Level

Looking for a better transcription? Notta AI offers
accuracy, efficiency, and advanced features which
can help you transcribe speech into searchable text.
Experience seamless transcription today!

Start Now

How to Download and Install Whisper

First, it's essential to understand that Whisper is unlike other transcription and translation tools in how it runs
and operates. There is no download site with a ready file to download and install in your system. To install
and use it, you need a basic understanding of the Windows, Linux, or Mac command line, depending on your
device.

Our guide is a step-by-step process for installing Whisper in Windows for offline use. To get started, you
need several prerequisites on your computer to ensure a smooth download and install.

1. Python

2. Git

3. Rust

4. NVIDIA CUDA (optional)

5. Pip (only for older versions of Python)

6. Pytorch

7. FFmpeg

Python
For this installation, we will use Python version 3.9.9, but its dependencies allow it to work with versions
between 3.7 and 3.11.

Head to the Python website and click on the preferred Python, depending on the release date to download.
For this guide, I chose to use the Python 3.9.9. Click on it and scroll to the section with the installation files.

Click on the file best suited to your systems. The download will start immediately.

Once done, install the software into your system. When installing Pythion for the first time, remember to click
"Add to path" at the bottom of the first page of the installer. This allows you to run Python from a terminal.
Failure to check this box can cause the entire Whisper installation to fail.

Git
Since the Open AI Whisper files are on a GitHub repository, you need to download, configure, and install Git
to your system to access these files.
Visit Git for Windows and choose an installer that suits your device.

Rust
Installing Rust in your system will help you avoid errors when building the wheels for tokenizers, a unique
requirement when installing (Python) py-based projects.

There are two ways to install Rust into your system.

1. Head to Rust’s official site and choose an installer that best fits your computer system.

2.Open your command interface and run the following command line: pip install setuptools-rust

N.B: To open a CMD interface, Click ‘Windows+R’ to quick launch an app; type ‘cmd’ then click ‘run.’

NVIDIA CUDA
If you have used any AI tool before, you already know that a lot of computation power is needed to run these
tools. Therefore, running the AI tools on devices that run using NVIDIA GPUs and have NVIDIA CUDA
installed is highly favorable. CUDa improves the GPUs' processing power, allowing them to be more efficient
in processing data than traditional GPUs.

Unfortunately, you can only install CUDA on devices that run on NVIDIA GPUs. However, this does not mean
you cannot use Whisper on CPU devices. As you will see later, Whisper can run on various models from tiny,
base, small, medium, and large. The higher the model, the more the computation power and vice versa.
Therefore, all models, CPU or GPU users, can benefit.

If your device can support an NVIDIA CUDA, visit the NVIDIA website and download the latest CUDA
compatible with PyTorch.

As of this post, PyTorch supports CUDA 11.7 and 11.8.

PIP
PIP is a package installer and management tool for Python applications and packages. It’s a necessity if you
want to manage all your PyPL installations using the command line.

Newer versions of Python come with an already installed PIP. However, if you are running an older version,
you must download it to your computer.

To check if there is an installed PIP on your device, access your cmd and run the command prompt:

Pip help

If there is a response, PIP is present in your Python.

However, if you find an error response, you must install it on your device.

Visit https://pip.pypa.io/en/stable/installation/ for a step-by-step guide on downloading PIP into your system.

PyTorch
Pytorch is a deep-learning library mostly used when running applications that rely on GPUs and CPUs.
Developers prefer it due to its speed and flexibility of implementation.

To install it, go to the PyTorch Website and choose your installation preferences based on what you will be
using.
Start for Free

Once done, you will get a Command line.

Content

Copy and run the command in your cmd interface to download PyTorch.

N.B: If you use a GPU, select CUDA 11.7 or 11.8. Select the CPU if your device does not have an NVIDIA
graphics card.

FFmpeg
FFmpeg is one of the most critical tools in this list since it will help convert audio to the format Whisper can
process. To download it:

Visit the FFmpeg website to download the authentic file.

Scroll down to where the Windows Icon is and click on it. Click on one of the two files that appear below it. I
have chosen the ‘Windows builds by BtbN.’This will open a new page where you will find various ffmpeg
assets.
Scroll down and select the one that matches your system. For me, I'll choose the bigger ‘Win64’gpl. Click on
it to download the zip folder containing the files.

Extract the files to a folder and open them. In the bin file, you will find three applications you must install on
your system.

To do so, head to the local disk C and create a folder. Name this as ‘Path.’ Then, copy your three applications
and paste them into the ‘Path file’ on the local disk.
Click at the top of the drive to copy the file path ‘C:\Path’

Next, Click on the Start button and search for “Edit environment variables.” Open it.

Select ‘Path’ and click on the edit button

Click ‘New’ to add a path and paste the file path, C:\Path, at the end of the list. Then click ‘Okay’ to close the
box.

To confirm the installation is successful, open a new cmd prompt window and run ‘ffmpeg.” The installation
succeeded if the code appears like that in the image below.

Install Whisper
Since everything is ready, you can now install Whisper. To do so:
1. Open your command console and run the command lines below:
pip install git+https://github.com/openai/whisper.git

Two possible scenarios may occur:

The installation will be successful, as in the image above.

You may encounter an error like “cannot find command git.”

This error means the pip command cannot locate git in your device. As a result, it cannot connect to the
Whisper repository. To correct this problem, click here to download git for Windows, then run the pip install
command again. During the git installation, click on the check box that auto-updates the path automatically.
This will allow Pip to locate the git on your device.

2. Once the installation is complete, you only need to run Whisper in a command interface:

Here, you will see all the languages the tool can work with alongside other options that can help you run the
tool, such as the Whisper model and output format. To get more information on the various commands you
can run whisper on, use the command:

Whisper -h
N.B: If you encounter an error that says “it’s not a recognized internal or external command,” add the
Python script directory to the Path with your Python installation.

How to Record Your Voice on Mac and Windows

We are done with the hard part: the installation. Everything else that follows from now will be a breeze. To
record your voice on Mac or Windows, you need the help of a free tool such as Audacity. If you are not
interested in downloading software, you can use a web-based platform like Notta.

More Than Just Transcription

Notta not only transcribes but also translates, annotates and collaborates and seamlessly
integrates with your favorite tools like Notion and Salesforce. Let Notta improve your
productivity today!

Start Now

For the best results while recording, ensure that you:

1. Have a good microphone.

2. Record in a silent room without background noise.

When using Audacity:

1. Download the software from their main site.

2.Open the software and connect your microphone.

3.Click on Audio Setup and set your microphone as the recording device for a crispier take.

4.Click on the Record icon to start recording. Once done, Click the Stop Button to end the recording.

5.Head to ‘File’ and select ‘Export’ to save your recording as MP3, WAV, or OGG.
When using Notta:

1. Create a free account with Notta.

2. Click here to download the Chrome Extension

3.Login to your Notta account.

4.Connect your microphone and permit Notta to record.

5.Click on ‘Record an Audio’ in the top right corner of your screen to record straight from your dashboard. To
end the recording, click on the ‘Stop’ button.

The Chrome Extension can allow you to capture audio from a source.

To use it:

Identify the Audio or video you want to record.

Click on the Notta extension icon on your browser toolbar.

Hit ‘Start Recording’ and Play the audio source. Click ‘Stop’ to complete the recording.
N.B: Notta automatically saves all the recordings in the dashboard. To access and export them, navigate to
your account dashboard and find the recording you want to export. Notta allows you to export the audio as
an MP3.

How to Transcribe Voice to Text with Whisper Open AI

Now that we have the Audio, we can transcribe it using Whisper.

Save the audio file you want to transcribe in a new folder. I will call my folder ‘Transcribe.’

Open a new command prompt from the new folder. To do this, click on the file directory and type ‘cmd.’

In the command prompt window, Type ‘Whisper followed by the file name you want to transcribe. If there are
spaces in between the name of the file, remember to add apostrophe marks.
The transcription process will begin, and the time it takes to complete will depend on

The size of your file.

The speed of your GPU or CPU.

OpenAI Whisper Accuracy

Open AI’s Whisper is among the most accurate language models.

There are two ways to deduce the accuracy levels:

1. Analyzing the transcription quality.

Whisper claims that the language model has gone through 680,000 hours of multilingual data training. As a
result, it shows high levels of accuracy in transcription and translation. This intensive training has improved
Whisper AI’s robustness and ability to detect accents and eliminate background and technical noise.

2.A look at the difference in WER

A research paper comparing the Word-Error-Rate (WER) between Whisper and six other current speech
recognition models reveals that Whisper outperforms the best open-source model (NVIDIA STT) in every
data set.

As you can tell from the table above, Whisper AI takes the crown of being the most accurate tool among all
the other language models.

Still, it's essential to acknowledge that less than five languages have a word error rate lower than 5%, and
more than 25 languages have a 50% and above word error rate. Still, it manages to make 50% fewer errors
than language models.

N.B: AI speech technology is constantly improving, and Whisper AI is far from perfect. Some areas it may be
lacking include:
It can occasionally leave out some punctuations

It can transcribe some words incorrectly or fail to transcribe some at all

It does not provide a distinction between the different speakers

Whisper cannot provide real-time transcription. Currently, it only focuses on zero-shot

asynchronous transcription. To run Open AI Whisper online, you must use the Whisper API.

While it shines in performance, we still acknowledge that accuracy is still a concern to all language models,
Whisper included, especially when dealing with non-English languages.

Whisper Speech Recognition Languages

Whisper can transcribe a total of 99 languages and translate them all into English. According to the AI, the
most straightforward language to transcribe is Spanish, Italian, English, and Portuguese. All these have a
word error rate of less than 5%.

Here is a distribution of how the languages compare in their word error rates:

Number of Languages Word Error Rate

4 <5%

9 5 - 10 %

19 10 - 20 %

11 20 - 30 %

4 30 - 40 %

6 40 - 50 %

11 50 - 90 %

18 90 - 200%

Cost to Run Whisper

The most significant benefit that comes with using Whisper is that it is free to use! You can run Whisper
locally without registering and paying any subscription fees.

But there is a catch. It will cost you time and resources to install and use the software. Considering Open AI
does not provide ongoing support and integration assistance, encountering errors will create operational
setbacks.

At the same time, to get the best out of the tool, you need to use a device with a good GPU. How so?

Whisper provides five language models that you can use for transcription. These include

Tiny

Base

Small

Medium

Large.

Each model requires a certain amount of processing power to operate. For example, tiny and base needs a
VRAM of about 1 GB each, small 2GB, medium 5GB, and large 10 GB. The higher the processing power, the
faster the result.

Ideally, an Nvidia GPU (GTX970 or any newer version) can serve you well.
Do not confuse speed with accuracy. While the larger models use less time and more GPU resources, they
are not necessarily the most accurate.

Whisper Free Alternative- Notta AI Speech Recognition Software

As seen above, Whisper AI is a winner in transcription accuracy. Unfortunately, it lags behind due to its
limited features, numerous failure modes, and a lack of assistance. Also, it eliminates users with CPU devices
as they cannot maximize the use of the tool.

As such, one tool that may interest the average user that boasts high accuracy and everything else Whisper
lacks is Notta.

Notta is a transcription and translation software that can record, transcribe, and translate both audio and
video. It is among the best tools for podcasters, students, and marketing teams. Notta is a web app, Chrome
extension, and mobile app that allows seamless access across devices. Some of its most notable features
include:

1. Highly accurate - Notta delivers an accuracy of 99.98%, making it better than most tools in the
market.

2. AI summary - Notta leverages GPT-4 to derive a highly accurate and concise summary from the
generated transcription to give you an overview of the whole conversation.

3. Extensive language support - It can transcribe 58 languages and translate 42 more than any other
AI tool.

4. Fast turnaround time - The transcription process is very fast. You can get a 2-hour audio in just 5
minutes. Moreover, you don't need an expensive GPU to improve the speed!

5. Real-time meeting transcriptions and note-taking - Notta supports real-time transcription of

ongoing meetings. You only need to connect the app to your online meeting, and the AI assistant will
take care of everything.

To transcribe an audio file with Notta:

1. Sign up or log in to your Notta account.

2.At the top right corner, set the transcription language.

Rivi Richards is a trusted Freelance Contributor at Notta. With his BA in Journalism and multiple years of experience
in the tech industry, Rivi combines storytelling with technological insight. His work primarily focuses on helping users
navigate voice-to-text transcription services like Notta, simplifying complex tech concepts. Outside of work, Rivi
Rivi Richards enjoys photography, drone racing, and reading books on emerging technologies.
3.Click on ‘Import Audio’ to upload your audio file. You can drag and drop the file from your local files or
share a public URL from YouTube, Dropbox, or Google Drive. The transcription will happen immediately after
upload.

Ready for More Good Reads?

Email Subscribe

A world of handpicked news, insights, and trending topics are just one subscription away.

Related Blogs

4.Navigate to the dashboard, click on the transcribed file, and make any necessary edits using the built-in
editor.

Product Product

Notta Bot: A Smarter Way to Join and Record Meetings Notta Translation: An Efficient Translator Powered by AI

Viraj Mahajan 12mins Viraj Mahajan 12mins

5.When ready to export, click the ‘Download’ icon at the top right corner.
Get Notta Learn More Compare
6.Choose the format you want to export and save your transcript.
Web App About Us vs Otter.ai

Mobile App Blog vs Fireflies.ai

Chrome Extension Help Center vs Happy Scribe

Pricing vs Rev

Changelog vs Sonix.ai

Integrations Tools

Zoom Audio to Text Converter

Microsoft Teams Online Video Converter

Google Meet Online Audio Converter

Webex Online Vocal Remover

Conclusion
Dropbox YouTube Video Summarizer

From afar,
Google Whisper AI may seem like a tool only for tech-literate individuals, but it is, in fact, easy to use. The
Drive
only challenge you may encounter is during the set-up. While the steps may seem technical, follow this
guide to the letter, and nothing will stand in your way.

Please note that you can only access Whisper AI on the device that you install it. If you want a tool
compatible with various devices but still delivers the same level of accuracy as OpenAI’s Whisper model, give
English
Notta a try today.

Installing Whisper WebUI On Windows 10 - Web
No ratings yet
Installing Whisper WebUI On Windows 10 - Web
24 pages
Make games with Python: Create your own entertainment with Raspberry Pi
From Everand
Make games with Python: Create your own entertainment with Raspberry Pi
Sean M. Tracey
No ratings yet
Linux Clues
From Everand
Linux Clues
Lawson Hanson
No ratings yet
Setting Up Packages For Speech Recognition
No ratings yet
Setting Up Packages For Speech Recognition
3 pages
Speech Recognition MY Final Year Project
75% (85)
Speech Recognition MY Final Year Project
82 pages
Programming Abc With Python Or Programming The Easy Way: From Basics to Practical Projects in Python: From Basics to Practical Projects in Python
From Everand
Programming Abc With Python Or Programming The Easy Way: From Basics to Practical Projects in Python: From Basics to Practical Projects in Python
Mindaugas Vilčinskas
No ratings yet
NLP Based Model To Convert English Speech To Gujarati Text For Deaf Amp Dumb People
No ratings yet
NLP Based Model To Convert English Speech To Gujarati Text For Deaf Amp Dumb People
7 pages
2nddec Microsoft AI Engineer
No ratings yet
2nddec Microsoft AI Engineer
26 pages
Python Programming for Beginners
From Everand
Python Programming for Beginners
Adam Stewart
2.5/5 (2)
Using Ubuntu MATE and Its Applications
From Everand
Using Ubuntu MATE and Its Applications
Larry Bushey
No ratings yet
Chemical Engineering Ai Doa
No ratings yet
Chemical Engineering Ai Doa
34 pages
PYTHON FOR BEGINNERS: Master the Basics of Python Programming and Start Writing Your Own Code in No Time (2023 Guide for Beginners)
From Everand
PYTHON FOR BEGINNERS: Master the Basics of Python Programming and Start Writing Your Own Code in No Time (2023 Guide for Beginners)
Glen Jennings
No ratings yet
Mastering Play Framework for Scala
From Everand
Mastering Play Framework for Scala
Shiti Saxena
No ratings yet
Python for Beginners: Learn the Fundamentals of Computer Programming
From Everand
Python for Beginners: Learn the Fundamentals of Computer Programming
J Foster
No ratings yet
Programming Abc With Python Or Programming The Easy Way
From Everand
Programming Abc With Python Or Programming The Easy Way
Mindaugas Vilčinskas
No ratings yet
Python Programming: A Hands-On Guide: Hello World E-books STEM, #1
From Everand
Python Programming: A Hands-On Guide: Hello World E-books STEM, #1
Hello World E-books
No ratings yet
Python Programming Techniques: The Art of Coding and Programming Explained
From Everand
Python Programming Techniques: The Art of Coding and Programming Explained
Lance Gifford
No ratings yet
Python Programming
From Everand
Python Programming
Arthur Keane
No ratings yet
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations
No ratings yet
SONAR: Sentence-Level Multimodal and Language-Agnostic Representations
14 pages
LEARN PYTHON PROGRAMMING: A Comprehensive Guide for Beginners to Master Python Programming (2024)
From Everand
LEARN PYTHON PROGRAMMING: A Comprehensive Guide for Beginners to Master Python Programming (2024)
ELISE HARRISON
No ratings yet
Your First Python Program
From Everand
Your First Python Program
Alexander Paz
No ratings yet
Python for Everyone: A Complete Guide to Coding, Data, and Web Development: Your Guide to the Digital World, #3
From Everand
Python for Everyone: A Complete Guide to Coding, Data, and Web Development: Your Guide to the Digital World, #3
Atokhon Ghaniev
No ratings yet
Ubuntu MATE: Upgrading from Windows or OSX
From Everand
Ubuntu MATE: Upgrading from Windows or OSX
Larry Bushey
No ratings yet
Python Programming: 8 Simple Steps to Learn Python Programming Language in 24 hours! Practical Python Programming for Beginners, Python Commands and Python Language
From Everand
Python Programming: 8 Simple Steps to Learn Python Programming Language in 24 hours! Practical Python Programming for Beginners, Python Commands and Python Language
Norman James
2/5 (1)
Mastering Python: Learn Python Step-by-Step with Practical Projects
From Everand
Mastering Python: Learn Python Step-by-Step with Practical Projects
Amelia Hartman
No ratings yet
Mastering Python Networking - Third Edition: Your one-stop solution to using Python for network automation, programmability, and DevOps, 3rd Edition
From Everand
Mastering Python Networking - Third Edition: Your one-stop solution to using Python for network automation, programmability, and DevOps, 3rd Edition
Eric Chou
3/5 (2)
Artificial Intelligence (CSC 477) - 1
No ratings yet
Artificial Intelligence (CSC 477) - 1
119 pages
Introduction To Computational Linguistics and Natural Language Processing
100% (1)
Introduction To Computational Linguistics and Natural Language Processing
182 pages
20 Effective ChatGPT Prompts
No ratings yet
20 Effective ChatGPT Prompts
6 pages
Strategic Computing (1983)
100% (3)
Strategic Computing (1983)
111 pages
.PRA 019-106 85pgs
No ratings yet
.PRA 019-106 85pgs
85 pages
Python for Beginners: Learn It as Easy as Pie
From Everand
Python for Beginners: Learn It as Easy as Pie
Yatin Bayya
No ratings yet
Li - 2022 - Recent Advances in End-to-End Automatic Speech Recognition（语音识别综述）
No ratings yet
Li - 2022 - Recent Advances in End-to-End Automatic Speech Recognition（语音识别综述）
26 pages
Sandvik Asri 2.0: Installation Manual
100% (2)
Sandvik Asri 2.0: Installation Manual
78 pages
Profound Python
From Everand
Profound Python
Onder Teker
5/5 (1)
Python for Beginners: A Crash Course to Learn Python Programming in 1 Week
From Everand
Python for Beginners: A Crash Course to Learn Python Programming in 1 Week
Brady Ellison
No ratings yet
NLP Unleashed Transforming Communication & Insights
No ratings yet
NLP Unleashed Transforming Communication & Insights
12 pages
Instant MinGW Starter
From Everand
Instant MinGW Starter
Ilya Shpigor
No ratings yet
PYTHON FOR BEGINNERS: A Comprehensive Guide to Learning Python Programming from Scratch (2023)
From Everand
PYTHON FOR BEGINNERS: A Comprehensive Guide to Learning Python Programming from Scratch (2023)
Denton Freeman
No ratings yet
Convai Technical Overview Speech Ai Part 2 2301964
No ratings yet
Convai Technical Overview Speech Ai Part 2 2301964
11 pages
Speech Recognition With Llms Adapted To Disordered Speech Using Reinforcement Learning
No ratings yet
Speech Recognition With Llms Adapted To Disordered Speech Using Reinforcement Learning
5 pages
SALM Speech-Augmented Language Model With In-Context Learning For Speech Recognition and Translation
No ratings yet
SALM Speech-Augmented Language Model With In-Context Learning For Speech Recognition and Translation
5 pages
Ansible for IT Experts
From Everand
Ansible for IT Experts
Denis Zuev
No ratings yet
Instant PhoneGap Social App Development
From Everand
Instant PhoneGap Social App Development
Kerri Shotts
No ratings yet
Main (pt2)
No ratings yet
Main (pt2)
13 pages
The 1 Page Python Book
From Everand
The 1 Page Python Book
Barani Kumar
2/5 (1)
Unit - 1
No ratings yet
Unit - 1
9 pages
LINUX SECURITY AND ADMINISTRATION: Safeguarding Your Linux System with Proactive Administration Practices (2024 Guide for Beginners)
From Everand
LINUX SECURITY AND ADMINISTRATION: Safeguarding Your Linux System with Proactive Administration Practices (2024 Guide for Beginners)
RANDALL BLAIR
No ratings yet
Report On Smart Bot Using Python
No ratings yet
Report On Smart Bot Using Python
19 pages
Psycho Acoustic Hiding, Revised
No ratings yet
Psycho Acoustic Hiding, Revised
5 pages
Easy-To-Follow Tutorial To Learn Python Programming In Less Than One Week
From Everand
Easy-To-Follow Tutorial To Learn Python Programming In Less Than One Week
R.M. Lewis
3/5 (6)
Holberton School Syllabus 12132018174723
No ratings yet
Holberton School Syllabus 12132018174723
45 pages
Final Completed Virtual Assistant Alexa An Artificial Intelligence
No ratings yet
Final Completed Virtual Assistant Alexa An Artificial Intelligence
45 pages
Essential Python 3
From Everand
Essential Python 3
Kevin Vans-Colina
No ratings yet
English Voice To Translated Bangla Text For Android Phone: Department of Computer Science & Engineering
No ratings yet
English Voice To Translated Bangla Text For Android Phone: Department of Computer Science & Engineering
21 pages
Applying Wav2vec2 For Speech Recognition On Bengali Common Voices Dataset
No ratings yet
Applying Wav2vec2 For Speech Recognition On Bengali Common Voices Dataset
5 pages
Oracle Tutor
No ratings yet
Oracle Tutor
2 pages
Step-by-Step Guide to Bodhi Linux 4
From Everand
Step-by-Step Guide to Bodhi Linux 4
Roger Carter
No ratings yet
Comparative Analysis of State-of-the-Art Speech Recognition Models For Low-Resource Marathi Language
No ratings yet
Comparative Analysis of State-of-the-Art Speech Recognition Models For Low-Resource Marathi Language
2 pages
Bodhi Linux 3 for Beginners
From Everand
Bodhi Linux 3 for Beginners
Roger Carter
No ratings yet
Configure The Windows 7 To Work Options
From Everand
Configure The Windows 7 To Work Options
Noel Carboni
No ratings yet
Python Programming
From Everand
Python Programming
Brian Evenson
No ratings yet
OpenAI's GPT-4o: A Quantum Leap in Multimodal Understanding
100% (1)
OpenAI's GPT-4o: A Quantum Leap in Multimodal Understanding
8 pages
AI Scribes: Boosting Physician Efficiency in Clinical Documentation
No ratings yet
AI Scribes: Boosting Physician Efficiency in Clinical Documentation
10 pages
Learning To Execute: WOJ Zaremba Gmail COM
No ratings yet
Learning To Execute: WOJ Zaremba Gmail COM
8 pages
Understanding Python: Beginner's Guide to Programming
From Everand
Understanding Python: Beginner's Guide to Programming
Sabry Fattah
No ratings yet
Python Programming Reference Guide: A Comprehensive Guide for Beginners to Master the Basics of Python Programming Language with Practical Coding & Learning Tips
From Everand
Python Programming Reference Guide: A Comprehensive Guide for Beginners to Master the Basics of Python Programming Language with Practical Coding & Learning Tips
Coleman Newton
No ratings yet
Building a Plex Server with Raspberry Pi
From Everand
Building a Plex Server with Raspberry Pi
Brian Schell
No ratings yet
Python Programming: Learn, Code, Create
From Everand
Python Programming: Learn, Code, Create
Sachin Naha
No ratings yet
C Programming For Beginners: The Simple Guide to Learning C Programming Language Fast!
From Everand
C Programming For Beginners: The Simple Guide to Learning C Programming Language Fast!
Tim Warren
5/5 (1)
Introduction to Python Programming: Learn Coding with Hands-On Projects for Beginners
From Everand
Introduction to Python Programming: Learn Coding with Hands-On Projects for Beginners
Kiet Huynh
No ratings yet
Developing Software Installers
From Everand
Developing Software Installers
Mohit Singh
No ratings yet
Python Programming: Your Beginner’s Guide To Easily Learn Python in 7 Days
From Everand
Python Programming: Your Beginner’s Guide To Easily Learn Python in 7 Days
i Code Academy
2.5/5 (3)
Computer Productivity Book 3. Use AutoHotKey to License & Deploy Your Scripts to Sell: AutoHotKey productivity, #3
From Everand
Computer Productivity Book 3. Use AutoHotKey to License & Deploy Your Scripts to Sell: AutoHotKey productivity, #3
Max Drake
No ratings yet
Python for Beginners: An Introduction to Learn Python Programming with Tutorials and Hands-On Examples
From Everand
Python for Beginners: An Introduction to Learn Python Programming with Tutorials and Hands-On Examples
Nathan Metzler
4/5 (2)
Raspberry Pi For Beginners: How to get the most out of your raspberry pi, including raspberry pi basics, tips and tricks, raspberry pi projects, and more!
From Everand
Raspberry Pi For Beginners: How to get the most out of your raspberry pi, including raspberry pi basics, tips and tricks, raspberry pi projects, and more!
Matthew Oates
No ratings yet
An Ultimate Guide to Kali Linux for Beginners
From Everand
An Ultimate Guide to Kali Linux for Beginners
Ansh Goyal
3.5/5 (4)
PYTHON: Practical Python Programming For Beginners & Experts With Hands-on Project
From Everand
PYTHON: Practical Python Programming For Beginners & Experts With Hands-on Project
Mark Chan
5/5 (4)
Python: For Beginners A Crash Course Guide To Learn Python in 1 Week
From Everand
Python: For Beginners A Crash Course Guide To Learn Python in 1 Week
Timothy C. Needham
4/5 (25)
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
From Everand
Python Programming For Beginners: Learn The Basics Of Python Programming (Python Crash Course, Programming for Dummies)
James Tudor
5/5 (1)
Verizon Communication
No ratings yet
Verizon Communication
1 page
The Beginner’s Guide to AI - Aider
From Everand
The Beginner’s Guide to AI - Aider
Steven Mcananey
No ratings yet
Python Programming Illustrated For Beginners & Intermediates: “Learn By Doing” Approach-Step By Step Ultimate Guide To Mastering Python: The Future Is Here!: The Future Is Here!
From Everand
Python Programming Illustrated For Beginners & Intermediates: “Learn By Doing” Approach-Step By Step Ultimate Guide To Mastering Python: The Future Is Here!: The Future Is Here!
William Sullivan
4/5 (2)
PYTHON PROGRAMMING
From Everand
PYTHON PROGRAMMING
Ramsey Hamilton
4/5 (12)
Python Made Easy: A First Course in Computer Programming using Python
From Everand
Python Made Easy: A First Course in Computer Programming using Python
Kevin Wilson
No ratings yet
Learn Python in 10 Minutes
From Everand
Learn Python in 10 Minutes
Victor Ebai
4/5 (30)
Python Programming Illustrated For Beginners & Intermediates“Learn By Doing” Approach-Step By Step Ultimate Guide To Mastering Python: The Future Is Here!
From Everand
Python Programming Illustrated For Beginners & Intermediates“Learn By Doing” Approach-Step By Step Ultimate Guide To Mastering Python: The Future Is Here!
William Sullivan
3/5 (1)
Security and Privacy in the Internet of Things: & Dark-web Investigation
From Everand
Security and Privacy in the Internet of Things: & Dark-web Investigation
David Tripp
No ratings yet
A concise guide to PHP MySQL and Apache
From Everand
A concise guide to PHP MySQL and Apache
alasdair gilchrist
4/5 (2)
Python Programming : How to Code Python Fast In Just 24 Hours With 7 Simple Steps
From Everand
Python Programming : How to Code Python Fast In Just 24 Hours With 7 Simple Steps
Jason Scotts
4/5 (55)

How To Use Whisper AI - The Only Guide You Need

Uploaded by

How To Use Whisper AI - The Only Guide You Need

Uploaded by

🎉 Introducing Notta Showcase: instantly translate your video into multiple languages for a global reach!

How to Use Whisper AI: The Only Guide You Need

What is OpenAI's Whisper?

Who Can Use Open AI Whisper?

A student who wants to transcribe their class notes

A podcaster looking to repurpose their audio content into various formats

A video editor looking to add subtitles to a video and more.

Take Your Productivity to the Next Level

How to Download and Install Whisper

4. NVIDIA CUDA (optional)

5. Pip (only for older versions of Python)

There are two ways to install Rust into your system.

As of this post, PyTorch supports CUDA 11.7 and 11.8.

If there is a response, PIP is present in your Python.

Once done, you will get a Command line.

Visit the FFmpeg website to download the authentic file.

Select ‘Path’ and click on the edit button

Two possible scenarios may occur:

The installation will be successful, as in the image above.

You may encounter an error like “cannot find command git.”

How to Record Your Voice on Mac and Windows

More Than Just Transcription

For the best results while recording, ensure that you:

1. Have a good microphone.

2. Record in a silent room without background noise.

1. Download the software from their main site.

1. Create a free account with Notta.

2. Click here to download the Chrome Extension

3.Login to your Notta account.

4.Connect your microphone and permit Notta to record.

Identify the Audio or video you want to record.

Click on the Notta extension icon on your browser toolbar.

How to Transcribe Voice to Text with Whisper Open AI

The size of your file.

The speed of your GPU or CPU.

OpenAI Whisper Accuracy

There are two ways to deduce the accuracy levels:

1. Analyzing the transcription quality.

2.A look at the difference in WER

It can transcribe some words incorrectly or fail to transcribe some at all

It does not provide a distinction between the different speakers

Whisper cannot provide real-time transcription. Currently, it only focuses on zero-shot

Whisper Speech Recognition Languages

Number of Languages Word Error Rate

Cost to Run Whisper

Whisper Free Alternative- Notta AI Speech Recognition Software

5. Real-time meeting transcriptions and note-taking - Notta supports real-time transcription of

To transcribe an audio file with Notta:

1. Sign up or log in to your Notta account.

2.At the top right corner, set the transcription language.

Ready for More Good Reads?

Viraj Mahajan 12mins Viraj Mahajan 12mins

Mobile App Blog vs Fireflies.ai

Chrome Extension Help Center vs Happy Scribe

Zoom Audio to Text Converter

Microsoft Teams Online Video Converter

Google Meet Online Audio Converter

Webex Online Vocal Remover

Copyright © 2024 Notta. All Rights Reserved.

You might also like