Home Doing simple operations with itertools combinatorics?

Questions

Doing simple operations with itertools combinatorics?

November 13, 2022

I have a python dataset that has the following structure:

cluster  pts   lon   lat

0        5      45    24
1        6      47    23
2        10     45    20

As you can see, I have a column that refers to a cluster, the number of points within a cluster, the representative latitude of the cluster and the representative longitude of the cluster. In the whole dataframe I have 140 clusters.

Now I would like to calculate for each cluster the following operation by means of a combinatorial:

𝑤𝑒𝑖𝑔ℎ𝑡(𝑖,𝑗)=−𝑛𝑖+𝑛𝑗/𝑑𝑖𝑠𝑡(𝑖,𝑗)

where i refers to a cluster and j to another.
where n refers to the number of pts

On the one hand it does the sum of the points between cluster i and cluster j, and in the denominator it calculates by means of haversine the distance between the two clusters taking into account their representative coordinates.

I’ve started by coming up with a code that uses itertools, but I have problems to continue. Any idea?

from itertools import combinations

for c in combinations(df['cluster'],2):
    sum_pts=
    distance=
    weight=-(sum_pts/distance)
    print(c,weight)

>Solution :

As you mentioned, to do the combinations, you can use itertools.
To calculate the distance you can use geopy.distance.distance. Refer to the documentation for details: https://geopy.readthedocs.io/en/stable/#module-geopy.distance

This should work:

from itertools import combinations
from geopy.distance import distance

for p1, p2 in combinations(df['cluster'], 2):
    sum_pts = df['pts'][p1] + df['pts'][p2]
# distance in km
    dist = distance(df.loc[p1, ['lat', 'lon']], df.loc[p2, ['lat', 'lon']]).km
    weight = -sum_pts/dist
    print ((p1, p2), weight)

Edit: for a case when clusters don’t necessarily correspond to index

for c1, c2 in combinations(df['cluster'], 2):
    p1, p2 = df[df['cluster'] == c1].iloc[0], df[df['cluster'] == c2].iloc[0] 
    sum_pts = p1['pts'] + p2['pts']
    dist = distance((p1['lat'], p1['lon']), (p2['lat'], p2['lon'])).km
    weight = -sum_pts/dist
    print ((c1, c2), weight)

Output:

(0, 1) -0.04733881547464973
(0, 2) -0.033865977446857085
(1, 2) -0.04086856230889897

haversine

byMR

Published November 13, 2022

Add a comment

Pygame. Sprite is still drawing after killing itself

byMR

November 13, 2022

Questions

Display only unique values in dropdown

byMR

November 13, 2022

Questions

Vector of an array of structs resets strings to be blank. C++

byMR

November 13, 2022

Questions

Is there a regular expression that can recognize positive even numbers (including numbers with leading zeros, but excluding "0000" numbers)?

byMR

November 13, 2022

Questions

Operating a calculation on four basic operation using java script functions

byMR

November 13, 2022

Questions

Why Type 'never[]' is not assignable to type 'number'

byMR

November 13, 2022

Doing simple operations with itertools combinatorics?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

Pygame. Sprite is still drawing after killing itself

Display only unique values in dropdown

Vector of an array of structs resets strings to be blank. C++

Is there a regular expression that can recognize positive even numbers (including numbers with leading zeros, but excluding "0000" numbers)?

Operating a calculation on four basic operation using java script functions

Why Type 'never[]' is not assignable to type 'number'

Keep Up to Date with the Most Important News

Doing simple operations with itertools combinatorics?

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

Pygame. Sprite is still drawing after killing itself

Display only unique values in dropdown

Vector of an array of structs resets strings to be blank. C++

Is there a regular expression that can recognize positive even numbers (including numbers with leading zeros, but excluding "0000" numbers)?

Operating a calculation on four basic operation using java script functions

Why Type 'never[]' is not assignable to type 'number'

Discover more from Dev solutions