parsing long raw data in Python

I have a raw data file containig texts of many books. the data formatted as following (sequencial number,book name, text) 0,book_a, " long content" 1,book_b," long content" 2,book_c," long content" how to parser and load the books to a python dicationary {‘id’: 0, ‘book_name’, ‘text’: "full text"} >Solution : I assumed your Raw data file… Read More parsing long raw data in Python

December 15, 2023 MRLeave a comment

Iterating over a dictionary of pdf files and their name and create a dictionary and put the name and corresponding text into it

I wrote the code as follws to extract one single pdf file and put the text into a list. how can I modify the code that it iterates over a dictionary of pdf files and their name and create a dictionary and put the name and corresponding text into it? dic = { ‘0R.pdf’:’m1′, ‘2R.pdf’:’m2′,… Read More Iterating over a dictionary of pdf files and their name and create a dictionary and put the name and corresponding text into it

April 14, 2023 MRLeave a comment

Count words in a sentence controlling for negations

I am trying to count the number of times some words occur in a sentence while controlling for negations. In the example below, I write a very basic code where I count the number of times "w" appear in "txt". Yet, I fail to control for negations like "don’t" and/or "not". w = ["hello", "apple"]… Read More Count words in a sentence controlling for negations

March 20, 2023 MRLeave a comment

Delete rows with a certain condition in pandas

I have a data frame and I want to delete rows that in the column "Phrase", pattern "___" exists. Index PHRASE Label 0 proposed by the president of the 1 1 Living ___ 1 2 "Murder, ___ Wrote" 0 But Imagin that the data fram has 2,000,000 enteries import re df_clean = pd.DataFrame() z =… Read More Delete rows with a certain condition in pandas

November 24, 2022 MRLeave a comment

Pythonic way to create dataset for multilabel text classification

I have a text dataset that looks like this. import pandas as pd df = pd.DataFrame({‘Sentence’: [‘Hello World’, ‘The quick brown fox jumps over the lazy dog.’, ‘Just some text to make third sentence!’ ], ‘label’: [‘greetings’, ‘dog,fox’, ‘some_class,someother_class’ ]}) I want to transform this data into something like this. Is there a pythonic way… Read More Pythonic way to create dataset for multilabel text classification

November 15, 2022 MRLeave a comment

Python if/elif statement not working correctly

Please help understand what is wrong with the code below. The code works fine if I pass values up to 34. Once I pass 35 or higher, the output is incorrect. tuk=0 if tuk <= 24: print (‘The text is very easy to read.’) elif tuk >= 25 & tuk <= 34: print(‘The text is… Read More Python if/elif statement not working correctly

October 22, 2022 MRLeave a comment

Why does it print the same log twice?

I have to do an .upperCase() through a formatter, but I don’t understand why it prints the same message but without the upper, since I have established that it uses only that. I am using java util test. public DatabaseAccessProxy(String pass, DatabaseAccess database) throws SecurityException, IOException { this.logged = false; this.pass = pass; this.database =… Read More Why does it print the same log twice?

May 26, 2022 MRLeave a comment

How can I fix Table Row Insertion of the following code?

I’m new to html and js. When ever I run the code and try to add new row, it does not work. I am using VS Code and the html page runs perfectly but the ADD new row function not working. Can anybody help me? here is the code segment of that particular HTML page,… Read More How can I fix Table Row Insertion of the following code?

May 21, 2022 MRLeave a comment

Why is a column going into the next row when there are only 12 columns?

I have a row with 4 columns that are each 3 wide, which adds up to the 12 that are supposed the be in a row, but for some reason the last one is still going onto a new row. Any idea how to fix this? Here’s my code: <div class=”container text-center”> <h3>Welcome to “Ender’s… Read More Why is a column going into the next row when there are only 12 columns?

May 20, 2022May 27, 2022 MRLeave a comment

created a new column in a pandas dataframe based on multiple different conditions

I found a similar question here, but it did not help me because my case is different. I have a huge dataframe that looks more or less like the example below: x y st mt ast sr c7 z w 0 mt 2 1 4 2 2 a yes 1 b 3 3 3 3… Read More created a new column in a pandas dataframe based on multiple different conditions

May 19, 2022 MRLeave a comment

Dev solutions

Solutions for development problems

Tag: nlp