225 questions
1
vote
0
answers
32
views
How to use Spark cluster with Featuretools
I am relatively new to Featuretools but I have ran a few projects locally with Featuretools and I think it is a great tool for ML. I want to introduce this to my team at work. My workplace has our own ...
0
votes
0
answers
31
views
grouping column is dropped in later versions of featuretools. Can it be retained?
In former versions of Featuretools, the column that was used to group by and calculate the primitives was retained in the final output. Check out for example the output of this notebook, cell 8, which ...
1
vote
0
answers
57
views
How to use featuretools at the test time?
I would demonstrate the issue with an example:
Let us say we want to use the primitive 'PERCENTILE'
Imports:
import pandas as pd
import featuretools as ft
For training (create a simple data with one ...
1
vote
0
answers
36
views
How to create an advanced primitive (value count of top n most frequent categories) with featuretools
I want to create a primitive that would count the number of occurrences of the top n categories in a column
The following is my code:
import featuretools as ft
from featuretools.tests.testing_utils ...
0
votes
0
answers
135
views
Anaconda suggesting file path that does not exist
I am importing package A which is dependent on package B and getting an import error for something within package B. Despite lots of trying, I can't get the dependencies to match, so I want to just ...
0
votes
1
answer
96
views
Featuretools failed to load plugin tsfresh from library featuretools_tsfresh_primitives.__init__
I'm trying to make featuretools and featuretools_tsfresh_primitives in my Jupyter notebook environment.
I installed both library using conda
conda install -c conda-forge featuretools
conda ...
0
votes
1
answer
69
views
AttributeError: Cutoff time DataFrame must contain a column with either the same name as the target dataframe index or a column named "instance_id"
I'm learning how to use Featuretools with this tutorial and I've made it to a snippet which is right below this paragraph:
from featuretools.tsfresh import CidCe
import featuretools as ft
fm, ...
0
votes
1
answer
64
views
featuretools basic aggegration on time measures
I am using featuretools (1.1x version), I read the docs,and also searched here
but still struggle to find how to do simple things like SELECT MIN(datetime_field_1)..
I also checked list_primitives() ...
1
vote
1
answer
133
views
How to show every primitives in featuretools
I want to list every built-in primitive in Featuretool without skip("...").
I know I can use list_primitives() but I don't know how to show everything.
import featuretools as ft
print(ft....
3
votes
1
answer
87
views
Legacy Problems trying FeatureTools exercises
I was trying this exercise in GitHub:
https://github.com/WillKoehrsen/automated-feature-engineering/blob/master/walk_through/Automated_Feature_Engineering.ipynb
but the code is too old to run on the ...
1
vote
1
answer
37
views
Can we use Feature Engineering tools without any IDENTIFIER?
My target feature(frame strength) is not an unique value. I have train and test dataset. How can I approach to use Ft? My datasets feature are temperature, hive size, some percentile values, some ...
1
vote
1
answer
71
views
Featuretools group by issue
I have a set of dataframes/entity set for rugby league/sports data: players, teams, venues, games, team_stats and player_stats
players: player_id, player_name
teams: team_id, team_name
games: game_id, ...
0
votes
1
answer
107
views
How to implement Featuretools into my ML Process?
I am exploring the possibility of implementing Featuretools into my pipeline, to be able to create new features from my Df.
Currently I am using a GridSearchCV, with a Pipeline embedded inside it. ...
0
votes
0
answers
77
views
Featuretools deep feature synthesis doesn't generate features
I'm using 3 datasets to create EntitySet using featuretools and use deep feature synthesis to generate additional features:
entity_set = ft.EntitySet("basketball_players")
entity_set....
0
votes
1
answer
86
views
IndexError: Index contains null values when adding dataframe to featuretools EntitySet
I have my dataframe which I want to add to EntitySet:
Unnamed: 0 Year name Pos Age Tm G GS \
24672 24672 2017.0 Troy Williams SF 22.0 TOT 30.0 16.0 ...