I have a dataframe that looks like so:
Item Quantity Price Totals
1 24" Box Desert Museum (14" Caliper) 10 92 920
2 24" Box Mastic tree (1" Caliper) 28 135 3780
3 36" Box Thornless Mesquite (2" Caliper) 9 335 3015
I am trying to remove the the parentheses, along with what is inside of it. In the end, I want it to look like so:
Item Quantity Price Totals
1 24" Box Desert Museum 10 92 920
2 24" Box Mastic tree 28 135 3780
3 36" Box Thornless Mesquite 9 335 3015
The issue I am having is not so much about removing the parentheses and its contents, its the fact that the number varies in digits, which makes every case slightly different form each other.
>Solution :
Assuming the term in parentheses always occur at the end of the item description, we can use sub() as follows:
df$item <- sub("\\s*\\(.*?\\)$", "", df$item)
We can also use str_replace to do this the stringr way:
df$item <- str_replace(df$item, "\\s*\\(.*?\\)$", "")