0% found this document useful (0 votes)

175 views

Iloc, Loc, and Ix For Data Selection in Python Pandas - Shane Lynn

Uploaded by

vaskore

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

175 views

Iloc, Loc, and Ix For Data Selection in Python Pandas - Shane Lynn

Uploaded by

vaskore

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

🙂

Shane Lynn
Data science, Startups, Analytics, and Data visualisation.

Blog Pandas Tutorials !

! About !
! Contact

Using iloc, loc, & ix to select rows and

columns in Pandas DataFrames Get some data updates!
96 Comments / blog, data science, Pandas, python, Tutorials / By Shane
Enter your email address to subscribe to

this blog and receive notifications of new

posts by email.

Email Address

Pandas Data Selection

There are multiple ways to select and index rows and columns from Pandas DataFrames. I find
Categories
tutorials online focusing on advanced selections of row and column choices a little complex for my
requirements. Select Category

Selection Options
There’s three main options to achieve the selection and indexing activities in Pandas, which can be
confusing. The three selection cases and methods covered in this post are: Pandas Tutorials

1. Selecting data by row numbers (.iloc) Pandas Groupby: Summarising,

2. Selecting data by label or by a conditional statement (.loc) Aggregating, and Grouping data in
3. Selecting in a hybrid approach (.ix) (now Deprecated in Pandas 0.20.1) Python

The Pandas DataFrame – loading,

Data Setup editing, and viewing data in Python

This blog post, inspired by other tutorials, describes selection activities with these operations. The Merge and Join DataFrames with Pandas
tutorial is suited for the general data science situation where, typically I find myself: in Python

Bar Plots in Python using Pandas

1. Each row in your data frame represents a data sample.
DataFrames
2. Each column is a variable, and is usually named. I rarely select columns without their
names. Plotting with Python and Pandas –

3. I need to quickly and often select relevant rows from the data frame for modelling and Libraries for Data Visualisation

visualisation activities. Python Pandas read_csv – Load Data

from CSV Files

For the uninitiated, the Pandas library for Python provides high-performance, easy-to-use data
Using iloc, loc, & ix to select rows and
structures and data analysis tools for handling tabular data in “series” and in “data frames”. It’s
brilliant at making your data processing easier and I’ve written before about grouping and columns in Pandas DataFrames

summarising data with Pandas. Pandas Drop: Delete DataFrame Rows &

Columns

Natural Language Processing

Pandas
Summary of iloc and loc methods discussed in this blog post. iloc and loc are operations for python
retrieving data from Pandas dataframes.
R

Selection and Indexing Methods for Pandas ROS

DataFrames Software

Talks
For these explorations we’ll need some sample data – I downloaded the uk-500 sample data set
Tutorials
from www.briandunning.com. This data contains artificial names, addresses, companies and
Uncategorized
phone numbers for fictitious UK characters. To follow along, you can download the .csv
file here. Load the data as follows (the diagrams here come from a Jupyter notebook in the web
Anaconda Python install):

1
2 import pandas as pd

3 import random

4
5 # read the data from the downloaded CSV file.

6 data = pd.read_csv('https://s3-eu-west-1.amazonaws.com/shanebucket/downloads/uk-500.csv')

7 # set a numeric id for use as an index for examples.

8 data['id'] = [random.randint(0,1000) for x in range(data.shape[0])]

9
10 data.head(5)

Pandas Index - Loading Data.py hosted with ❤ by GitHub view raw

Example data loaded from CSV file.

1. Selecting pandas data using “iloc”

The iloc indexer for Pandas Dataframe is used for integer-location based indexing / selection by
position.

The iloc indexer syntax is data.iloc[<row selection>, <column selection>], which is sure to be a
source of confusion for R users. “iloc” in pandas is used to select rows and columns by
number, in the order that they appear in the data frame. You can imagine that each row has a row
number from 0 to the total rows (data.shape[0]) and iloc[] allows selections based on these
numbers. The same applies for columns (ranging from 0 to data.shape[1] )

There are two “arguments” to iloc – a row selector, and a column selector. For example:

1 # Single selections using iloc and DataFrame

2 # Rows:

3 data.iloc[0] # first row of data frame (Aleshia Tomkiewicz) - Note a Series data type output.

4 data.iloc[1] # second row of data frame (Evan Zigomalas)

5 data.iloc[-1] # last row of data frame (Mi Richan)

6 # Columns:

7 data.iloc[:,0] # first column of data frame (first_name)

8 data.iloc[:,1] # second column of data frame (last_name)

9 data.iloc[:,-1] # last column of data frame (id)

Pandas Index - Single iloc selections.py hosted with ❤ by GitHub view raw

Multiple columns and rows can be selected together using the .iloc indexer.

1 # Multiple row and column selections using iloc and DataFrame

2 data.iloc[0:5] # first five rows of dataframe

3 data.iloc[:, 0:2] # first two columns of data frame with all rows

4 data.iloc[[0,3,6,24], [0,5,6]] # 1st, 4th, 7th, 25th row + 1st 6th 7th columns.

5 data.iloc[0:5, 5:8] # first 5 rows and 5th, 6th, 7th columns of data frame (county -> phone1).

Pandas Index - Multi iloc selections.py hosted with ❤ by GitHub view raw

There’s two gotchas to remember when using iloc in this manner:

1. Note that .iloc returns a Pandas Series when one row is selected, and a Pandas DataFrame
when multiple rows are selected, or if any column in full is selected. To counter this, pass a
single-valued list if you require DataFrame output.

When using .loc, or .iloc, you can control the output format by passing lists or single values
to the selectors.

2. When selecting multiple columns or multiple rows in this manner, remember that in your
selection e.g.[1:5], the rows/columns selected will run from the first number to one minus
the second number. e.g. [1:5] will go 1,2,3,4., [x,y] goes from x to y-1.

In practice, I rarely use the iloc indexer, unless I want the first ( .iloc[0] ) or the last ( .iloc[-1] )
row of the data frame.

2. Selecting pandas data using “loc”

The Pandas loc indexer can be used with DataFrames for two different use cases:

a.) Selecting rows by label/index

b.) Selecting rows with a boolean / conditional lookup

The loc indexer is used with the same syntax as iloc: data.loc[<row selection>, <column
selection>] .

2a. Label-based / Index-based indexing using .loc

Selections using the loc method are based on the index of the data frame (if any). Where the index
is set on a DataFrame, using <code>df.set_index()</code>, the .loc method directly selects based
on index values of any rows. For example, setting the index of our test data frame to the persons
“last_name”:

1 data.set_index("last_name", inplace=True)
2 data.head()

Pandas Index - Setting index for iloc.py hosted with ❤ by GitHub view raw

Last Name set as Index set on sample data frame

Now with the index set, we can directly select rows for different “last_name” values using
.loc[<label>] – either singly, or in multiples. For example:

Selecting single or multiple rows using .loc index selections with pandas. Note that the first
example returns a series, and the second returns a DataFrame. You can achieve a single-column
DataFrame by passing a single-element list to the .loc operation.

Select columns with .loc using the names of the columns. In most of my data work, typically I have
named columns, and use these named selections.

When using the .loc indexer, columns are referred to by names using lists of strings, or “:” slices.

You can select ranges of index labels – the selection </code>data.loc[‘Bruch’:’Julio’]</code> will
return all rows in the data frame between the index entries for “Bruch” and “Julio”. The following
examples should now make sense:

1
2 # Select rows with index values 'Andrade' and 'Veness', with all columns between 'city' and 'email'

3 data.loc[['Andrade', 'Veness'], 'city':'email']

4 # Select same rows, with just 'first_name', 'address' and 'city' columns

5 data.loc['Andrade':'Veness', ['first_name', 'address', 'city']]

6
7 # Change the index to be based on the 'id' column

8 data.set_index('id', inplace=True)

9 # select the row with 'id' = 487

10 data.loc[487]

Pandas Index - Select rows with loc.py hosted with ❤ by GitHub view raw

Note that in the last example, data.loc[487] (the row with index value 487) is not equal to
data.iloc[487] (the 487th row in the data). The index of the DataFrame can be out of numeric
order, and/or a string or multi-value.

2b. Boolean / Logical indexing using .loc

Conditional selections with boolean arrays using data.loc[<selection>] is the most common
method that I use with Pandas DataFrames. With boolean indexing or logical selection, you pass
an array or Series of True/False values to the .loc indexer to select the rows where your Series has
True values.

In most use cases, you will make selections based on the values of different columns in your data
set.

For example, the statement data[‘first_name’] == ‘Antonio’] produces a Pandas Series with a
True/False value for every row in the ‘data’ DataFrame, where there are “True” values for the rows
where the first_name is “Antonio”. These type of boolean arrays can be passed directly to the .loc
indexer as so:

Using a boolean True/False series to select rows in a pandas data frame – all rows with first name
of “Antonio” are selected.

As before, a second argument can be passed to .loc to select particular columns out of the data
frame. Again, columns are referred to by name for the loc indexer and can be a single string, a list
of columns, or a slice “:” operation.

Selecting multiple columns with loc can be achieved by passing column names to the second
argument of .loc[]

Note that when selecting columns, if one column only is selected, the .loc operator returns a Series.
For a single column DataFrame, use a one-element list to keep the DataFrame format, for
example:

If selections of a single column are made as a string, a series is returned from .loc. Pass a list to get
a DataFrame back.

Make sure you understand the following additional examples of .loc selections for clarity:

1
2 # Select rows with first name Antonio, # and all columns between 'city' and 'email'

3 data.loc[data['first_name'] == 'Antonio', 'city':'email']

4
5 # Select rows where the email column ends with 'hotmail.com', include all columns

6 data.loc[data['email'].str.endswith("hotmail.com")]

7
8 # Select rows with last_name equal to some values, all columns

9 data.loc[data['first_name'].isin(['France', 'Tyisha', 'Eric'])]

10
11 # Select rows with first name Antonio AND hotmail email addresses

12 data.loc[data['email'].str.endswith("gmail.com") & (data['first_name'] == 'Antonio')]

13
14 # select rows with id column between 100 and 200, and just return 'postal' and 'web' columns

15 data.loc[(data['id'] > 100) & (data['id'] <= 200), ['postal', 'web']]

16
17 # A lambda function that yields True/False values can also be used.

18 # Select rows where the company name has 4 words in it.

19 data.loc[data['company_name'].apply(lambda x: len(x.split(' ')) == 4)]

20
21 # Selections can be achieved outside of the main .loc for clarity:

22 # Form a separate variable with your selections:

23 idx = data['company_name'].apply(lambda x: len(x.split(' ')) == 4)

24 # Select only the True values in 'idx' and only the 3 columns specified:

25 data.loc[idx, ['email', 'first_name', 'company']]

Pandas index - loc selection examples.py hosted with ❤ by GitHub view raw

Logical selections and boolean Series can also be passed to the generic [] indexer of a pandas
DataFrame and will give the same results: data.loc[data[‘id’] == 9] == data[data[‘id’] == 9] .

3. Selecting pandas data using ix

Note: The ix indexer has been deprecated in recent versions of Pandas,

starting with version 0.20.1.

The ix[] indexer is a hybrid of .loc and .iloc. Generally, ix is label based and acts just as the .loc
indexer. However, .ix also supports integer type selections (as in .iloc) where passed an integer.
This only works where the index of the DataFrame is not integer based. ix will accept any of the
inputs of .loc and .iloc.

Slightly more complex, I prefer to explicitly use .iloc and .loc to avoid unexpected results.

As an example:

1
2 # ix indexing works just the same as .loc when passed strings

3 data.ix[['Andrade']] == data.loc[['Andrade']]

4 # ix indexing works the same as .iloc when passed integers.

5 data.ix[[33]] == data.iloc[[33]]

6
7 # ix only works in both modes when the index of the DataFrame is NOT an integer itself.

Pandas index - ix selections.py hosted with ❤ by GitHub view raw

Setting values in DataFrames using .loc

With a slight change of syntax, you can actually update your DataFrame in the same statement as
you select and filter using .loc indexer. This particular pattern allows you to update values in
columns depending on different conditions. The setting operation does not make a copy of the
data frame, but edits the original data.

As an example:

1 # Change the first name of all rows with an ID greater than 2000 to "John"
2 data.loc[data['id'] > 2000, "first_name"] = "John"

3
4 # Change the first name of all rows with an ID greater than 2000 to "John"

5 data.loc[data['id'] > 2000, "first_name"] = "John"

Pandas index - changing data with loc.py hosted with ❤ by GitHub view raw

That’s the basics of indexing and selecting with Pandas. If you’re looking for more, take a look at
the .iat, and .at operations for some more performance-enhanced value accessors in the Pandas
Documentation and take a look at selecting by callable functions for more iloc and loc fun.

← Previous Post Next Post →

! Subscribe !

Join the discussion

{} [+] #

96 COMMENTS " #

Implementare l’algoritmo KNN in Python e Scikit-learn | Lorenzo Govoni

" 1 year ago

[…] maggiori informazioni, si veda il seguente articolo (solo in […]

0 Reply

mariana
" 1 year ago

Really helpful Shane for beginners. Very through and detailed. Looking for more of your blogs on pandas and
python.

8 Reply

Yahor
" 1 year ago

Very helpful content, Shane. Helped me clear my understanding of working with row selections.

2 Reply

Bowen
" 1 year ago

Thank you so much! This is very helpful and illustrative

1 Reply

Maria
" 1 year ago

Very precise and clear. Easy to understand. Thanks for the content

1 Reply

Amol Wadpalle
" 1 year ago

Very detailed explanation! thanks!

when following your examples, i was expecting to get a type = dataframe for the below query: however its
throwing an error
print(df.iloc[[1:4, 2:4]])

3 Reply

Dihao Qi
" 1 year ago

excellent explanations. really helpful

2 Reply

Dung
" 1 year ago

Thank you so much!. Very detailed and helpful

2 Reply

Hari Natarajan
" 1 year ago

Finally, I have a clear picture. Your instructions are precise and self-explanatory. I wish you publish a detailed
book on Python Programming so that it will be of immense help for learners and programmers.

2 Reply

Data Preprocessing with Python | BeingDatum

" 1 year ago

[…] You can read more about the usage of iloc here. […]

1 Reply

srinivas reddy pachika

" 1 year ago

Excellent post. Thank you so much for coming with such awesome content

1 Reply

Marilu
" 11 months ago

Thank you so much, it helped me a lot to understand pandas selection, great article for beginners like me

1 Reply

Elaheh Arjomand
" 7 months ago

Thanks for the content/

0 Reply

Chuck
" 6 months ago

Great job – even greater examples

0 Reply

khoa
" 6 months ago

this is so concise and fully side of selecting element in pandas. Thank you, writer!

0 Reply

Sujay Bhujbal
" 4 months ago

Thank you for the explanation

0 Reply

nick
" 4 months ago

Fantastic explanation. Thanks, Shane!

0 Reply

MARCELLO DISTASIO
" 1 month ago

Exactly what I needed,n this is extremelyhelpful -thank you.

0 Reply

Aurelien
" 26 days ago

Hello!
Thank you very much for this nice article.
I try to use a dataset with scikit-learn M/L algorithm. I have approximatly 4000 samples (Sn), but my dataset is
in this format : (first image, multiple lines for one output); I would like to move it in this format (second image),
to have each sample on 1 raw.

loc and iloc can helps me in moving every 5 raw for column 1 in a single raw please?

Thank you for your help and advises.

0 Reply

Aurelien
" 26 days ago

Hello!
Thank you very much for this nice article.
I try to use a dataset with scikit-learn M/L algorithm. I have approximatly 4000 samples (Sn), but my dataset is
in this format : (multiple lines of input for one output); I would like to move it in this format (second image), to
have each sample on 1 raw.

loc and iloc can helps me in moving every 5 raw for column 1 in a single raw please?

Thank you for your help and advises.

0 Reply

« Previous 1 2 3

Barracuda Web Application Firewall Best Practices Guide PDF
No ratings yet
Barracuda Web Application Firewall Best Practices Guide PDF
14 pages
Learning Pandas PDF
No ratings yet
Learning Pandas PDF
171 pages
Pandas Basics
No ratings yet
Pandas Basics
84 pages
ITSMS Service Catalog Template
100% (2)
ITSMS Service Catalog Template
13 pages
Pandas Tutorial
No ratings yet
Pandas Tutorial
1 page
Pandas Cheatsheet DF
No ratings yet
Pandas Cheatsheet DF
1 page
DevOps Session 3 Pandas.pptx
No ratings yet
DevOps Session 3 Pandas.pptx
33 pages
PPT for Assignment-3 (Final_Pandas_Lab)
No ratings yet
PPT for Assignment-3 (Final_Pandas_Lab)
40 pages
PANDAS Python
No ratings yet
PANDAS Python
2 pages
IP Imp Notes
No ratings yet
IP Imp Notes
5 pages
Pandas PDF
No ratings yet
Pandas PDF
171 pages
Pandas Notes
No ratings yet
Pandas Notes
4 pages
RA Continuing Education (Data Processing With Pandas)
No ratings yet
RA Continuing Education (Data Processing With Pandas)
77 pages
Python Pandas Tutorial For Beginners
No ratings yet
Python Pandas Tutorial For Beginners
203 pages
Pandas
No ratings yet
Pandas
4 pages
Pandas
No ratings yet
Pandas
40 pages
Python Pandas Tutorial
No ratings yet
Python Pandas Tutorial
2 pages
Lab3 - Python - Pandas DataFrame - GeeksforGeeks
No ratings yet
Lab3 - Python - Pandas DataFrame - GeeksforGeeks
20 pages
01-Numpy & Pandas
No ratings yet
01-Numpy & Pandas
69 pages
rajni_ip_file_final
No ratings yet
rajni_ip_file_final
42 pages
FDS Module 2 Notes
No ratings yet
FDS Module 2 Notes
24 pages
Pandas Questions
No ratings yet
Pandas Questions
11 pages
Unit-4Introduction To Pandas
No ratings yet
Unit-4Introduction To Pandas
44 pages
Eda Unit 2
No ratings yet
Eda Unit 2
65 pages
Mdad - Numpy ML
No ratings yet
Mdad - Numpy ML
85 pages
Pandas Notes(1)
No ratings yet
Pandas Notes(1)
44 pages
Python Pandas Tutorial
No ratings yet
Python Pandas Tutorial
45 pages
Christian Mayer, Lukas Rieger, Kyrylo Kravets - Coffee Break Pandas - 74 Pandas Puzzles To Build Your Pandas Data Science Superpower-Finxter - Com (2020)
No ratings yet
Christian Mayer, Lukas Rieger, Kyrylo Kravets - Coffee Break Pandas - 74 Pandas Puzzles To Build Your Pandas Data Science Superpower-Finxter - Com (2020)
156 pages
FDS Notes Unit-4
No ratings yet
FDS Notes Unit-4
30 pages
Day64 - Pandas Interview Questions
No ratings yet
Day64 - Pandas Interview Questions
5 pages
Practical Guide To Pandas For Data Science
No ratings yet
Practical Guide To Pandas For Data Science
26 pages
Phan1_Pandas_Numpy_Matplotlib
No ratings yet
Phan1_Pandas_Numpy_Matplotlib
158 pages
Data Wrangling With Python and Pandas
No ratings yet
Data Wrangling With Python and Pandas
7 pages
Pandas
No ratings yet
Pandas
13 pages
Loki Temp PPT Pandas 2
No ratings yet
Loki Temp PPT Pandas 2
31 pages
Pandas PDF(2)
No ratings yet
Pandas PDF(2)
25 pages
pandas_Trick_ques
No ratings yet
pandas_Trick_ques
2 pages
3Y3Z2Xzqn7 U Y%K : 2. How To Create A Data Frame Using A Dictionary of Pre-Existing Columns or Numpy 2D Arrays?
No ratings yet
3Y3Z2Xzqn7 U Y%K : 2. How To Create A Data Frame Using A Dictionary of Pre-Existing Columns or Numpy 2D Arrays?
8 pages
Combining Data in Pandas With Merge, .Join, and Concat - Real Python
No ratings yet
Combining Data in Pandas With Merge, .Join, and Concat - Real Python
2 pages
Rest of the Ip Project
No ratings yet
Rest of the Ip Project
26 pages
lab 1 ML lab
No ratings yet
lab 1 ML lab
15 pages
Pandas
No ratings yet
Pandas
42 pages
Pandas (Ziad)
No ratings yet
Pandas (Ziad)
38 pages
unit-3(FODS)
No ratings yet
unit-3(FODS)
34 pages
Pandas DataFrames
No ratings yet
Pandas DataFrames
1 page
Python Data Frame New
No ratings yet
Python Data Frame New
32 pages
PYTHON Pandas and Manipulation Data
No ratings yet
PYTHON Pandas and Manipulation Data
36 pages
lecture-week2
No ratings yet
lecture-week2
72 pages
Pandas Learndatasci
No ratings yet
Pandas Learndatasci
86 pages
Jacky Bai - Pandas Hands-On - Data Analysis Crash Course (2020)
No ratings yet
Jacky Bai - Pandas Hands-On - Data Analysis Crash Course (2020)
139 pages
IP 12th Chapter 3
No ratings yet
IP 12th Chapter 3
9 pages
Python & MySQL for Data Analysis
No ratings yet
Python & MySQL for Data Analysis
45 pages
Python CSBS Bhavya Lab Manual
No ratings yet
Python CSBS Bhavya Lab Manual
14 pages
Chapter-2 Python Pandas
100% (2)
Chapter-2 Python Pandas
33 pages
Pandas Notes
No ratings yet
Pandas Notes
3 pages
Pandas
No ratings yet
Pandas
94 pages
Lecture 7 Working With Pandas (1)
No ratings yet
Lecture 7 Working With Pandas (1)
15 pages
ainotes
No ratings yet
ainotes
5 pages
a5
No ratings yet
a5
28 pages
Learning Pandas 2.0: A Comprehensive Guide to Data Manipulation and Analysis for Data Scientists and Machine Learning Professionals
From Everand
Learning Pandas 2.0: A Comprehensive Guide to Data Manipulation and Analysis for Data Scientists and Machine Learning Professionals
Matthew Rosch
No ratings yet
Mastering Pandas in Python: Course Book
From Everand
Mastering Pandas in Python: Course Book
Pedro Martins
No ratings yet
Python Data Analysis
From Everand
Python Data Analysis
Ivan Idris
4/5 (2)
Python - How Do I Find Numeric Columns in Pandas - Stack Overflow
No ratings yet
Python - How Do I Find Numeric Columns in Pandas - Stack Overflow
6 pages
Narrowing The Search: Which Hyperparameters Really Matter?
No ratings yet
Narrowing The Search: Which Hyperparameters Really Matter?
9 pages
Python - Display Number With Leading Zeros - Stack Overflow
No ratings yet
Python - Display Number With Leading Zeros - Stack Overflow
8 pages
Organisational Restructure Excel Dashboard - Excel Dashboards VBA
No ratings yet
Organisational Restructure Excel Dashboard - Excel Dashboards VBA
1 page
R - How Dnorm Works? - Stack Overflow
No ratings yet
R - How Dnorm Works? - Stack Overflow
1 page
Mboxcox, Interpreting Difficult Regressions: 2 Answers
No ratings yet
Mboxcox, Interpreting Difficult Regressions: 2 Answers
1 page
Problems With Stepwise Regression
No ratings yet
Problems With Stepwise Regression
1 page
For-Loops in R (Optional Lab) : This Is A Bonus Lab. You Are Not Required To Know This Information For The Final Exam
No ratings yet
For-Loops in R (Optional Lab) : This Is A Bonus Lab. You Are Not Required To Know This Information For The Final Exam
2 pages
Three Reasons That You Should NOT Use Deep Learning - by George Seif - Towards Data Science
No ratings yet
Three Reasons That You Should NOT Use Deep Learning - by George Seif - Towards Data Science
1 page
3 Must-Have Projects For Your Data Science Portfolio - by Aakash N S - Jovian - Jan, 2021 - Medium
No ratings yet
3 Must-Have Projects For Your Data Science Portfolio - by Aakash N S - Jovian - Jan, 2021 - Medium
1 page
VBA - String Parsing. String Parsing Involves Looking Through - by Breakcorporate - Medium
No ratings yet
VBA - String Parsing. String Parsing Involves Looking Through - by Breakcorporate - Medium
1 page
Semi-Automated Exploratory Data Analysis (EDA) in Python - by Destin Gong - Mar, 2021 - Towards Data
No ratings yet
Semi-Automated Exploratory Data Analysis (EDA) in Python - by Destin Gong - Mar, 2021 - Towards Data
3 pages
Autofilter With Column Formatted As Date: 10 Answers
No ratings yet
Autofilter With Column Formatted As Date: 10 Answers
1 page
Excel - Selecting A Specific Column of A Named Range For The SUMIF Function - Stack Overflow
No ratings yet
Excel - Selecting A Specific Column of A Named Range For The SUMIF Function - Stack Overflow
1 page
Refer To Excel Cell in Table by Header Name and Row Number: 7 Answers
No ratings yet
Refer To Excel Cell in Table by Header Name and Row Number: 7 Answers
1 page
Excel - Can Advanced Filter Criteria Be in The VBA Rather Than A Range? - Stack Overflow
No ratings yet
Excel - Can Advanced Filter Criteria Be in The VBA Rather Than A Range? - Stack Overflow
1 page
You're Not Good at Excel and You Don't Even Know It - by Breakcorporate - Medium
No ratings yet
You're Not Good at Excel and You Don't Even Know It - by Breakcorporate - Medium
1 page
MS Excel PivotTable Deleted Items Remain - Excel and Access
No ratings yet
MS Excel PivotTable Deleted Items Remain - Excel and Access
1 page
VBA - Bubble Sort. A Bubble Sort Is A Technique To Order - by Breakcorporate - Medium
No ratings yet
VBA - Bubble Sort. A Bubble Sort Is A Technique To Order - by Breakcorporate - Medium
1 page
TreeSheets: App Reviews, Features, Pricing & Download - AlternativeTo
No ratings yet
TreeSheets: App Reviews, Features, Pricing & Download - AlternativeTo
1 page
Excel VBA Type Mismatch Error Passing Range To Array - Stack Overflow
No ratings yet
Excel VBA Type Mismatch Error Passing Range To Array - Stack Overflow
1 page
Excel VBA - Message and Input Boxes in Excel, MsgBox Function, InputBox Function, InputBox Method
No ratings yet
Excel VBA - Message and Input Boxes in Excel, MsgBox Function, InputBox Function, InputBox Method
2 pages
Sorting Arrays in VBA
No ratings yet
Sorting Arrays in VBA
2 pages
As32 TTL 1W
No ratings yet
As32 TTL 1W
22 pages
Advantages and Extra Functions of Distributed
No ratings yet
Advantages and Extra Functions of Distributed
3 pages
Introduction To Visual FoxPro 5
No ratings yet
Introduction To Visual FoxPro 5
26 pages
The "Ultra-Secure" Network Architecture
No ratings yet
The "Ultra-Secure" Network Architecture
9 pages
Python
No ratings yet
Python
68 pages
Coa Unit-3
No ratings yet
Coa Unit-3
35 pages
COMPUTER APPLICATION MCQ
No ratings yet
COMPUTER APPLICATION MCQ
3 pages
Flue Tah
No ratings yet
Flue Tah
47 pages
NGS Solutions and Cases NW V1.2
100% (2)
NGS Solutions and Cases NW V1.2
28 pages
009 11895 R14.0 - SMDR MCI PMS Interface Specifications
No ratings yet
009 11895 R14.0 - SMDR MCI PMS Interface Specifications
234 pages
Geoland: Installation and Administration Guide
No ratings yet
Geoland: Installation and Administration Guide
54 pages
LAB211 Assignment: Title Background Program Specifications
No ratings yet
LAB211 Assignment: Title Background Program Specifications
3 pages
Sap Fico
No ratings yet
Sap Fico
10 pages
Am29LV008B: 8 Megabit (1 M X 8-Bit) CMOS 3.0 Volt-Only Boot Sector Flash Memory
No ratings yet
Am29LV008B: 8 Megabit (1 M X 8-Bit) CMOS 3.0 Volt-Only Boot Sector Flash Memory
38 pages
Cse 308
No ratings yet
Cse 308
2 pages
Managing Customer Database
No ratings yet
Managing Customer Database
13 pages
4th Year Dw& Dm Kai075 Unit 1
No ratings yet
4th Year Dw& Dm Kai075 Unit 1
25 pages
IT3010 Network Design & Management: 3 Year, 1 Semester
No ratings yet
IT3010 Network Design & Management: 3 Year, 1 Semester
17 pages
Vsolution Pon App Example v1.0.1
No ratings yet
Vsolution Pon App Example v1.0.1
60 pages
Security Associations (Sas) : VPN Overview
No ratings yet
Security Associations (Sas) : VPN Overview
7 pages
16Gb DDR4 SDRAM
No ratings yet
16Gb DDR4 SDRAM
379 pages
OLAP Cube: 2 Hierarchy
No ratings yet
OLAP Cube: 2 Hierarchy
4 pages
Memory-Performance-Optimization-ppt
No ratings yet
Memory-Performance-Optimization-ppt
10 pages
CCNA 1 (v5.1 + v6.0) Chapter 4 Exam Answers Quiz #4
No ratings yet
CCNA 1 (v5.1 + v6.0) Chapter 4 Exam Answers Quiz #4
2 pages
All About TransactionScope - CodeProject
No ratings yet
All About TransactionScope - CodeProject
16 pages
TS-x59Pro TS-x39ProII - Datasheet
No ratings yet
TS-x59Pro TS-x39ProII - Datasheet
5 pages
JVM (Java Virtual Machine)
No ratings yet
JVM (Java Virtual Machine)
34 pages
How To Configure RMAN To Work With NETbackup
No ratings yet
How To Configure RMAN To Work With NETbackup
7 pages

Iloc, Loc, and Ix For Data Selection in Python Pandas - Shane Lynn

Uploaded by

Iloc, Loc, and Ix For Data Selection in Python Pandas - Shane Lynn

Uploaded by

🙂

Blog Pandas Tutorials !

Using iloc, loc, & ix to select rows and

this blog and receive notifications of new

Pandas Data Selection

1. Selecting data by row numbers (.iloc) Pandas Groupby: Summarising,

The Pandas DataFrame – loading,

Bar Plots in Python using Pandas

visualisation activities. Python Pandas read_csv – Load Data

from CSV Files

Natural Language Processing

Selection and Indexing Methods for Pandas ROS

7 # set a numeric id for use as an index for examples.

8 data['id'] = [random.randint(0,1000) for x in range(data.shape[0])]

Pandas Index - Loading Data.py hosted with ❤ by GitHub view raw

Example data loaded from CSV file.

1. Selecting pandas data using “iloc”

1 # Single selections using iloc and DataFrame

4 data.iloc[1] # second row of data frame (Evan Zigomalas)

5 data.iloc[-1] # last row of data frame (Mi Richan)

7 data.iloc[:,0] # first column of data frame (first_name)

8 data.iloc[:,1] # second column of data frame (last_name)

9 data.iloc[:,-1] # last column of data frame (id)

1 # Multiple row and column selections using iloc and DataFrame

There’s two gotchas to remember when using iloc in this manner:

2. Selecting pandas data using “loc”

a.) Selecting rows by label/index

2a. Label-based / Index-based indexing using .loc

Last Name set as Index set on sample data frame

3 data.loc[['Andrade', 'Veness'], 'city':'email']

5 data.loc['Andrade':'Veness', ['first_name', 'address', 'city']]

9 # select the row with 'id' = 487

2b. Boolean / Logical indexing using .loc

3 data.loc[data['first_name'] == 'Antonio', 'city':'email']

9 data.loc[data['first_name'].isin(['France', 'Tyisha', 'Eric'])]

12 data.loc[data['email'].str.endswith("gmail.com") & (data['first_name'] == 'Antonio')]

15 data.loc[(data['id'] > 100) & (data['id'] <= 200), ['postal', 'web']]

18 # Select rows where the company name has 4 words in it.

19 data.loc[data['company_name'].apply(lambda x: len(x.split(' ')) == 4)]

22 # Form a separate variable with your selections:

23 idx = data['company_name'].apply(lambda x: len(x.split(' ')) == 4)

25 data.loc[idx, ['email', 'first_name', 'company']]

3. Selecting pandas data using ix

Note: The ix indexer has been deprecated in recent versions of Pandas,

4 # ix indexing works the same as .iloc when passed integers.

Pandas index - ix selections.py hosted with ❤ by GitHub view raw

Setting values in DataFrames using .loc

5 data.loc[data['id'] > 2000, "first_name"] = "John"

← Previous Post Next Post →

Join the discussion

Implementare l’algoritmo KNN in Python e Scikit-learn | Lorenzo Govoni

[…] maggiori informazioni, si veda il seguente articolo (solo in […]

Thank you so much! This is very helpful and illustrative

Very detailed explanation! thanks!

excellent explanations. really helpful

Thank you so much!. Very detailed and helpful

Data Preprocessing with Python | BeingDatum

srinivas reddy pachika

Thanks for the content/

Great job – even greater examples

Thank you for the explanation

Fantastic explanation. Thanks, Shane!

Exactly what I needed,n this is extremelyhelpful -thank you.

Thank you for your help and advises.

Thank you for your help and advises.

Copyright © 2021 Shane Lynn | Powered by Astra WordPress Theme

You might also like