Skip to main content

Questions tagged [python]

Use for data science questions related to the programming language Python. Not intended for general coding questions (which should be asked on Stack Overflow).

Filter by
Sorted by
Tagged with
0 votes
0 answers
5 views

I've read somewhere (I forgot the source) that we perform 2 types of EDA: Light EDA: See shapes of data with df.shape See null with df.isna().mean() or df.isna().sum See duplicates with df.duplicated(...
michella's user avatar
0 votes
0 answers
11 views

I'm working on a project using the Elo Merchant Category Recommendation dataset (Kaggle). My goal is to perform Customer Segmentation base on their transactions by combining RFM metrics with Customer ...
Thị Mến Nguyễn's user avatar
0 votes
0 answers
11 views

I am trying to make a Convolutional Neural Network (CNN) using only numpy and other basic libraries. As a first step though, I needed to test the regular NN layers first, so I currently have a regular ...
a dude's user avatar
  • 1
7 votes
1 answer
619 views

I'm currently working in a dataset with censorship on profanity. Basically, fuck would be 4 heart emojis. Considering I'm trying to run a topic modelling w/ BERTopic, what kinda of preprocessing would ...
Gabriel Fagundes's user avatar
1 vote
0 answers
20 views

I'm training CSDI model and the output is very suspicious? Low diffusion steps? too high learning rate? nothing seems to change this behavior? some normalization issue that I'm not accounting for, I'...
cdt123's user avatar
  • 151
5 votes
1 answer
204 views

I am working with multiple data files in a folder where some files are in CSV format and others are in JSON format. I want to combine all of them into a single DataFrame for further analysis. Here is ...
Iformatio's user avatar
2 votes
0 answers
15 views

The documentation is a little bit confusing so I thought I would ask here to make sure, I'm using: ...
cdt123's user avatar
  • 151
5 votes
1 answer
55 views

I'm implementing this paper and trying to train it on a generated data and return full ground truths and a single forecast but the forecasts my model is producing are not anchored to the past series ...
cdt123's user avatar
  • 151
7 votes
4 answers
339 views

if I have the binary matrix which looks something like this (this is only 10 rows of binary matrix, I have a dataset of a million rows, so you can see what the binary matrix looks like): ...
machine_learning_student's user avatar
2 votes
0 answers
46 views

I am a 6th sem student and my mini project is IR pedestrian detection using yolo v8. My job is to train GAN(Pix2Pix-based) for the generation of synthetic IR images. I will give my code below. My 99th ...
Oh na na's user avatar
5 votes
0 answers
32 views

I have build and trained a NMT model using Rnn in Google colab and Now when I am trying to predict for my test data my Google colab session keeps on crashing . The shape of my test data is 47838×55 ...
swar_codes's user avatar
6 votes
1 answer
131 views

I'm learning about classification using decision trees. I'm using DecisionTreeClassifier function in the scikit-learn library in Python to train the model on training data (yields fully grown tree), ...
aweller3's user avatar
2 votes
0 answers
33 views

Given a text resource (Corpus/novel/...) I want to find pair of words that 1) appear statistically significantly together and 2) extract contextual knowledge from these pairs. For simplicity I'm ...
MissingSemikolonAt63's user avatar
5 votes
1 answer
120 views

I am working on a movie recommendation problem where I get multiple files from the source, and the total data size is around 900 MB. I am using the ...
RushHour's user avatar
  • 259
5 votes
1 answer
82 views

In an structure, IT has deployed a strict proxy policy (no specific right for any people). Windows 11 is the OS installed for every people, in a strict way. To run datascience tasks using python, in ...
an ch's user avatar
  • 51

15 30 50 per page
1
2 3 4 5
442