Home How can I select one of each category with a Pandas DataFrame in Python?

Questions

How can I select one of each category with a Pandas DataFrame in Python?

November 25, 2021

I have a dataframe that looks like this:

author|string
abc|hi
abc|yo
def|whats
ghi|up
ghi|dog

how can I select only one row per author? I’m at a loss.
I want to do something like this:

df.loc[unique authors].sample(n=1000)

and get something like this:

author|string
abc|hi
def|whats
ghi|up

I was thinking of converting the author column to categories, but I don’t know where to go from there.

I could just do something like this but it seems stupid.

author_list = df['author'].unique().tolist()
indexes = []
for author in author_list:
  indexes.append(df.loc[df['author'] == author].iloc[0].index)
df.iloc[indexes].sample(n=1000)

>Solution :

You can do

out = df.drop_duplicates('author')

dataframe

byMR

Published November 25, 2021

Add a comment

How can I verify if a file only contains numbers in C?

byMR

November 25, 2021

Questions

Creating button in javascript, but onclick function automatic works

byMR

November 25, 2021

Questions

Get entire contents of cell – not just values

byMR

November 25, 2021

Questions

How do I read JSON using pandas?

byMR

November 25, 2021

Questions

recreate array with all items but the last

byMR

November 25, 2021

Questions

Merge Pandas Dataframes based on substring or partial match in another Dataframe

byMR

November 25, 2021

How can I select one of each category with a Pandas DataFrame in Python?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How can I verify if a file only contains numbers in C?

Creating button in javascript, but onclick function automatic works

Get entire contents of cell – not just values

How do I read JSON using pandas?

recreate array with all items but the last

Merge Pandas Dataframes based on substring or partial match in another Dataframe

Keep Up to Date with the Most Important News

How can I select one of each category with a Pandas DataFrame in Python?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How can I verify if a file only contains numbers in C?

Creating button in javascript, but onclick function automatic works

Get entire contents of cell – not just values

How do I read JSON using pandas?

recreate array with all items but the last

Merge Pandas Dataframes based on substring or partial match in another Dataframe

Discover more from Dev solutions