Open In App

Create a list from rows in Pandas dataframe

Last Updated : 11 Jul, 2024
Summarize
Comments
Improve
Suggest changes
Like Article
Like
Save
Share
Report
News Follow

Python list is easy to work with and also list has a lot of in-built functions to do a whole lot of operations on lists. Pandas dataframe’s columns consist of series but unlike the columns, Pandas dataframe rows are not having any similar association. In this post, we are going to discuss several ways in which we can extract the whole row of the dataframe at a time.

Solution #1:

In order to iterate over the rows of the Pandas dataframe we can use

DataFrame.iterrows()

function and then we can append the data of each row to the end of the list.

Python
# importing pandas as pd
import pandas as pd

# Create the dataframe
df = pd.DataFrame({'Date':['10/2/2011', '11/2/2011', '12/2/2011', '13/2/11'],
                   'Event':['Music', 'Poetry', 'Theatre', 'Comedy'],
                   'Cost':[10000, 5000, 15000, 2000]})

# Print the dataframe
print(df)


Output :

Now we will use the

DataFrame.iterrows()

function to iterate over each of the row of the given Dataframe and construct a list out of the data of each row.

Python
# Create an empty list
Row_list =[]

# Iterate over each row
for index, rows in df.iterrows():
    # Create list for the current row
    my_list =[rows.Date, rows.Event, rows.Cost]
    
    # append the list to the final list
    Row_list.append(my_list)

# Print the list
print(Row_list)

Output :

As we can see in the output, we have successfully extracted each row of the given dataframe into a list. Just like any other Python’s list we can perform any list operation on the extracted list.

Python
# Find the length of the newly 
# created list
print(len(Row_list))

# Print the first 3 elements
print(Row_list[:3])

Output :

Solution #2:

In order to iterate over the rows of the Pandas dataframe we can use

DataFrame.itertuples()

function and then we can append the data of each row to the end of the list.

Python
# importing pandas as pd
import pandas as pd

# Create the dataframe
df = pd.DataFrame({'Date':['10/2/2011', '11/2/2011', '12/2/2011', '13/2/11'],
                   'Event':['Music', 'Poetry', 'Theatre', 'Comedy'],
                   'Cost':[10000, 5000, 15000, 2000]})

# Print the dataframe
print(df)


Output :

Now we will use the

DataFrame.itertuples()

function to iterate over each of the row of the given Dataframe and construct a list out of the data of each row.

Python
# Create an empty list
Row_list =[]

# Iterate over each row
for rows in df.itertuples():
    # Create list for the current row
    my_list =[rows.Date, rows.Event, rows.Cost]
    
    # append the list to the final list
    Row_list.append(my_list)

# Print the list
print(Row_list)

Output :

As we can see in the output, we have successfully extracted each row of the given dataframe into a list. Just like any other Python’s list we can perform any list operation on the extracted list.

Python
# Find the length of the newly 
# created list
print(len(Row_list))

# Print the first 3 elements
print(Row_list[:3])

Output :

Create a list from rows in Pandas dataframe – FAQs

How to Create a List from a DataFrame Row

To create a list from a row in a DataFrame, you can use the .iloc method to access the row and then convert it to a list using .tolist():

import pandas as pd

# Example DataFrame
df = pd.DataFrame({
'A': [1, 2, 3],
'B': [4, 5, 6],
'C': [7, 8, 9]
})

# Select the first row and convert to a list
row_list = df.iloc[0].tolist()
print(row_list) # Output: [1, 4, 7]

How to Create a DataFrame from a List of Rows in Pandas

If you have a list where each element (which is itself a list) represents a row, you can easily convert this into a DataFrame by passing the list to the DataFrame constructor and specifying column names:

# List of lists where each inner list is a row
data = [[1, 4, 7], [2, 5, 8], [3, 6, 9]]

# Create DataFrame
df_new = pd.DataFrame(data, columns=['A', 'B', 'C'])
print(df_new)

How to Make a List in a Pandas DataFrame

To insert or store lists within a DataFrame, you can directly assign lists to a new column or modify an existing column:

# Adding a new column where each entry is a list
df['D'] = [[10, 11], [12, 13], [14, 15]]
print(df)

How to Append Pandas Rows to List

If you want to append multiple rows from a DataFrame to a list, where each row is converted into a list:

# Create an empty list
all_rows = []

# Append each row as a list to 'all_rows'
for index, row in df.iterrows():
all_rows.append(row.tolist())

print(all_rows)

How to Combine Multiple Rows into One List in Pandas

To combine multiple rows into a single list, where all cell values across the rows are gathered into one flat list, you can use list comprehension combined with a flattening operation:

# Combine all DataFrame rows into a single list
combined_list = [item for sublist in df.values.tolist() for item in sublist]
print(combined_list)


Next Article

Similar Reads

Create a list from rows in Pandas DataFrame | Set 2
In an earlier post, we had discussed some approaches to extract the rows of the dataframe as a Python's list. In this post, we will see some more methods to achieve that goal. Note : For link to the CSV file used in the code, click here. Solution #1: In order to access the data of each row of the Pandas dataframe, we can use DataFrame.iloc attribut
2 min read
Drop a list of rows from a Pandas DataFrame
Let us see how to drop a list of rows in a Pandas DataFrame. We can do this using the Pandas drop() function. We will also pass inplace = True and axis=0 to denote row, as it makes the changes we make in the instance stored in that instance without doing any assignment. Creating Dataframe to drop a list of rows C/C++ Code # import the module import
3 min read
How to group dataframe rows into list in Pandas Groupby?
Suppose you have a Pandas DataFrame consisting of 2 columns and we want to group these columns. In this article, we will discuss the same. Creating Dataframe to group Dataframe rows into a list C/C++ Code # importing pandas as pd import pandas as pd # Create the data frame df = pd.DataFrame({'column1': ['A', 'B', 'C', 'A', 'C', 'C', 'B', 'D', 'D',
3 min read
Select Rows From List of Values in Pandas DataFrame
Let's learn how to select rows from a list of values in Pandas DataFrame using isin() method. Using isin() to Select Rows from a List of ValuesThe isin() function is one of the most commonly used methods for filtering data based on a list of values. Let’s walk through a simple example to illustrate this. [GFGTABS] Python import pandas as pd data =
4 min read
Pandas DataFrame assign() Method | Create new Columns in DataFrame
Python is a great language for data analysis, primarily because of the fantastic ecosystem of data-centric Python packages. Pandas is one of those packages, making importing and analyzing data much easier. The Dataframe.assign() method assigns new columns to a DataFrame, returning a new object (a copy) with the new columns added to the original one
4 min read
How to select the rows of a dataframe using the indices of another dataframe?
Prerequisites: PandasRandom Using Pandas module it is possible to select rows from a data frame using indices from another data frame. This article discusses that in detail. It is advised to implement all the codes in jupyter notebook for easy implementation. Approach:Import moduleCreate first data frame. In the example given below choice(), randin
2 min read
Pandas DataFrame hist() Method | Create Histogram in Pandas
A histogram is a graphical representation of the numerical data. Sometimes you'll want to share data insights with someone, and using graphical representations has become the industry standard. Pandas.DataFrame.hist() function plots the histogram of a given Data frame. It is useful in understanding the distribution of numeric variables. This functi
4 min read
Get all rows in a Pandas DataFrame containing given substring
Let's see how to get all rows in a Pandas DataFrame containing given substring with the help of different examples. Code #1: Check the values PG in column Position # importing pandas import pandas as pd # Creating the dataframe with dict of lists df = pd.DataFrame({'Name': ['Geeks', 'Peter', 'James', 'Jack', 'Lisa'], 'Team': ['Boston', 'Boston', 'B
3 min read
Sorting rows in pandas DataFrame
Pandas DataFrame is two-dimensional size-mutable, potentially heterogeneous tabular data structure with labeled axes (rows and columns). We often need to do certain operations on both rows and column while handling the data. Let’s see how to sort rows in pandas DataFrame. Code #1: Sorting rows by Science # import modules import pandas as pd # creat
2 min read
Ranking Rows of Pandas DataFrame
To rank the rows of Pandas DataFrame we can use the DataFrame.rank() method which returns a rank of every respective index of a series passed. The rank is returned on the basis of position after sorting. Example #1 : Here we will create a DataFrame of movies and rank them based on their ratings. # import the required packages import pandas as pd #
2 min read
Drop rows from Pandas dataframe with missing values or NaN in columns
Pandas provides various data structures and operations for manipulating numerical data and time series. However, there can be cases where some data might be missing. In Pandas missing data is represented by two value: None: None is a Python singleton object that is often used for missing data in Python code. NaN: NaN (an acronym for Not a Number),
4 min read
Get the number of rows and number of columns in Pandas Dataframe
Pandas provide data analysts a variety of pre-defined functions to Get the number of rows and columns in a data frame. In this article, we will learn about the syntax and implementation of few such functions. Method 1: Using df.axes() Method axes() method in pandas allows to get the number of rows and columns in a go. It accepts the argument '0' fo
3 min read
Apply a function to single or selected columns or rows in Pandas Dataframe
In this article, we will learn different ways to apply a function to single or selected columns or rows in Dataframe. We will use Dataframe/series.apply() method to apply a function. Apply a function to single row in Pandas DataframeHere, we will use different methods to apply a function to single rows by using Pandas dataframe. First lets create a
5 min read
Select first or last N rows in a Dataframe using head() and tail() method in Python-Pandas
Let’s discuss how to select top or bottom N number of rows from a Dataframe using head() & tail() methods. 1) Select first N Rows from a Dataframe using head() method of Pandas DataFrame : Pandas head() method is used to return top n (5 by default) rows of a data frame or series Syntax: Dataframe.head(n). Parameters: (optional) n is integer value,
2 min read
Sort rows or columns in Pandas Dataframe based on values
In this article, Let's discuss how to Sort rows or columns in Pandas Dataframe based on values. Pandas sort_values() method sorts a data frame in Ascending or Descending order of passed Column. It's different than the sorted Python function since it cannot sort a data frame and particular column cannot be selected. Syntax: DataFrame.sort_values(by,
7 min read
Get minimum values in rows or columns with their index position in Pandas-Dataframe
Let’s discuss how to find minimum values in rows & columns of a Dataframe and also their index position. a) Find the minimum value among rows and columns : Dataframe.min() : This function returns the minimum of the values in the given object. If the input is a series, the method will return a scalar which will be the minimum of the values in the se
4 min read
Count all rows or those that satisfy some condition in Pandas dataframe
Let’s see how to count number of all rows in a Dataframe or rows that satisfy a condition in Pandas. 1) Count all rows in a Pandas Dataframe using Dataframe.shape. Dataframe.shape returns tuple of shape (Rows, columns) of dataframe/series. Let's create a pandas dataframe. # import pandas library as pd import pandas as pd # List of Tuples students =
4 min read
How to Select Rows from Pandas DataFrame?
pandas.DataFrame.loc is a function used to select rows from Pandas DataFrame based on the condition provided. In this article, let's learn to select the rows from Pandas DataFrame based on some conditions. Syntax: df.loc[df['cname'] 'condition'] Parameters: df: represents data frame cname: represents column name condition: represents condition on w
2 min read
Pandas - How to shuffle a DataFrame rows
Let us see how to shuffle the rows of a DataFrame. We will be using the sample() method of the pandas module to randomly shuffle DataFrame rows in Pandas. Algorithm : Import the pandas and numpy modules.Create a DataFrame.Shuffle the rows of the DataFrame using the sample() method with the parameter frac as 1, it determines what fraction of total i
1 min read
Count the number of rows and columns of Pandas dataframe
In this article, we'll see how we can get the count of the total number of rows and columns in a Pandas DataFrame. There are different methods by which we can do this. Let's see all these methods with the help of examples. Example 1: We can use the dataframe.shape to get the count of rows and columns. dataframe.shape[0] and dataframe.shape[1] gives
2 min read
Shuffle a given Pandas DataFrame rows
Let us see how to shuffle the rows of a DataFrame. We will be using the sample() method of the pandas module to randomly shuffle DataFrame rows in Pandas. Example 1: C/C++ Code # import the module import pandas as pd # create a DataFrame data = {'Name': ['Mukul', 'Rohan', 'Mayank', 'Shubham', 'Aakash'], 'Class': ['BCA', 'BBA', 'BCA', 'MBA', 'BBA'],
1 min read
Display the Pandas DataFrame in table style and border around the table and not around the rows
Let us see how to style a Pandas DataFrame such that it has a border around the table. We will be using the set_table_styles() method of the Styler class in the Pandas module. set_table_styles() Syntax : set_table_styles(self, table_styles) Parameters : table_styles : List, each individual table_style should be a dictionary with selector and props
1 min read
Remove last n rows of a Pandas DataFrame
Let's see the various methods to Remove last n rows of a Pandas Dataframe.First, let's make a dataframe: C/C++ Code # Import Required Libraries import pandas as pd # Create a dictionary for the dataframe dict = { 'Name': ['Sukritin', 'Sumit Tyagi', 'Akriti Goel', 'Sanskriti', 'Abhishek Jain'], 'Age': [22, 20, 45, 21, 22], 'Marks': [90, 84, -33, -87
3 min read
How to Filter DataFrame Rows Based on the Date in Pandas?
Filtering a DataFrame rows by date selects all rows which satisfy specified date constraints, based on a column containing date data. For instance, selecting all rows between March 13, 2020, and December 31, 2020, would return all rows with date values in that range. Use DataFrame.loc() with the indexing syntax [condition] to select only the rows f
2 min read
Select Pandas dataframe rows between two dates
Prerequisites: pandas Pandas is an open-source library that is built on top of NumPy library. It is a Python package that offers various data structures and operations for manipulating numerical data and time series. It is mainly popular for importing and analyzing data much easier. Pandas is fast and it has high-performance & productivity for
2 min read
Pandas filter a dataframe by the sum of rows or columns
In this article, we will see how to filter a Pandas DataFrame by the sum of rows or columns. This can be useful in some conditions. Let's suppose you have a data frame consisting of customers and their purchased fruits. The rows consist of different customers and columns contain different types of fruits. You want to filter the data frame on the ba
4 min read
How to sum values of Pandas dataframe by rows?
While working on the python pandas module there may be a need, to sum up, the rows of a Dataframe. Below are the examples of summing the rows of a Dataframe. A Dataframe is a 2-dimensional data structure in form of a table with rows and columns. It can be created by loading the datasets from existing storage, storage can be SQL Database, CSV file,
3 min read
Drop specific rows from multiindex Pandas Dataframe
In this article, we will learn how to drop specific rows from the multi-index DataFrame. First, let us create the multi-index DataFrame. The steps are given below: C/C++ Code import numpy as np import pandas as pd mldx_arrays = [np.array(['lion', 'lion', 'lion', 'bison', 'bison', 'bison', 'hawk', 'hawk', 'hawk']), np.array(['height', 'weight', 'spe
3 min read
Find maximum values & position in columns and rows of a Dataframe in Pandas
In this article, we are going to discuss how to find the maximum value and its index position in columns and rows of a Dataframe. Create Dataframe to Find max values & position of columns or rows C/C++ Code import numpy as np import pandas as pd # List of Tuples matrix = [(10, 56, 17), (np.NaN, 23, 11), (49, 36, 55), (75, np.NaN, 34), (89, 21,
4 min read
Dealing with Rows and Columns in Pandas DataFrame
A Data frame is a two-dimensional data structure, i.e., data is aligned in a tabular fashion in rows and columns. We can perform basic operations on rows/columns like selecting, deleting, adding, and renaming. In this article, we are using nba.csv file. Dealing with Columns In order to deal with columns, we perform basic operations on columns like
5 min read