Home Converting .data file into numpy arrays

Questions

Converting .data file into numpy arrays

January 24, 2022

My file.data looks like this:

   "3.0,1.5,0\n
     4.6,0.7,1\n
     5.8,2.7,2"

And I want to load this data into two numpy arrays so that it looks like this in the end:

X = [ [3.0, 1.5],
      [4.6, 0.7],
      [5.8, 2.7] ]

y = [0, 1, 2]

If I do the following…

fname = open("file.data", "r")
for line in fname.readlines():
    print(line)

…I can read line by line as strings, but what would be the best way to separate these values and put them into the two numpy arrays as shown above?

Is there a nice module or function in numpy that does this really efficiently?

>Solution :

If your data file is a simple txt file with a delimiter as you shown, then you can use numpy.loadtxt to load entire data once

import numpy as np
data = np.loadtxt("file.data",delimiter=',')
X = data[:,0:2]
Y = data[:,2]

Incase you want to read line by line, you can try using numpy.fromstring which will output each string into an array

import numpy as np
data =[]
fname = open("file.data", "r")
for line in fname.readlines():
    data.append(fromstring(line,sep=','))
data_array = np.array(data)
X = data_array[:,0:2]
Y = data_array[:,2]

byMR

Published January 24, 2022

Add a comment

How to remove second underscore from string in R dataframe

byMR

January 24, 2022

Questions

Convert Default Flutter_bloc 8.0 Counter from Cubit to Bloc

byMR

January 24, 2022

Questions

How to calculate cumulative missing values group by an ID (in python)?

byMR

January 24, 2022

Questions

"binary 'operator+' has too many parameters

byMR

January 24, 2022

Questions

How to create and print a Dictionary that has keys as the names in list and their values as number of times the name appears on the list

byMR

January 24, 2022

Questions

count occurrences of unique string row by row

byMR

January 24, 2022

Converting .data file into numpy arrays

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Like this:

Leave a ReplyCancel reply

Read more

How to remove second underscore from string in R dataframe

Convert Default Flutter_bloc 8.0 Counter from Cubit to Bloc

How to calculate cumulative missing values group by an ID (in python)?

"binary 'operator+' has too many parameters

How to create and print a Dictionary that has keys as the names in list and their values as number of times the name appears on the list

count occurrences of unique string row by row

Keep Up to Date with the Most Important News

Converting .data file into numpy arrays

MEDevel.com: Open-source for Healthcare and Education

>Solution :

Share this:

Like this:

Leave a ReplyCancel reply

Keep Up to Date with the Most Important News

Read more

How to remove second underscore from string in R dataframe

Convert Default Flutter_bloc 8.0 Counter from Cubit to Bloc

How to calculate cumulative missing values group by an ID (in python)?

"binary 'operator+' has too many parameters

How to create and print a Dictionary that has keys as the names in list and their values as number of times the name appears on the list

count occurrences of unique string row by row

Discover more from Dev solutions