-
Recent Posts
Recent Comments
Yuanqing on Configuration Spaces and the M… Mel on Recurrent Neural Networks Teradata Expands Cap… on K-modes Teradata Expands Cap… on K-modes Qu'est-ce qu… on Graphs and networks Archives
- December 2016
- November 2016
- October 2016
- June 2016
- April 2016
- January 2016
- November 2015
- October 2015
- July 2015
- June 2015
- May 2015
- January 2015
- September 2014
- June 2014
- May 2014
- March 2014
- February 2014
- January 2014
- December 2013
- October 2013
- September 2013
- August 2013
- July 2013
- June 2013
- May 2013
- April 2013
- March 2013
Categories
Meta
Monthly Archives: April 2013
Data Normalization
In the last post, on nearest neighbors classification, we used the “distance” between different pairs of points to decide which class each new data point should be placed into. The problem is that there are different ways to calculate distance … Continue reading
Posted in Normalization/Kernels
3 Comments
Nearest Neighbors Classification
Before we dive into nearest neighbor classification, I want to point out a subtle difference between the regression algorithm that I discussed a few posts back and what I will write about today. The goal of regression was to find … Continue reading
Posted in Classification
14 Comments
Visualization and Projection
One of the common themes that I’ve emphasized so far on this blog is that we should try to analyze high dimensional data sets without being able to actually “see” them. However, it is often useful to visualize the data … Continue reading
Posted in Visualization
3 Comments
Principal Component Analysis
Now that we’ve gotten a taste of the curse of dimensionality, lets look at another potential problem with the basic form of regression we discussed a few posts back. Notice that linear/least squares regression always gives you an answer, whether or … Continue reading
Posted in Modeling
22 Comments
The curse of dimensionality
Now that we’ve had a glimpse of what it means to analyze data sets in different dimensions, we should take a little detour to consider really high dimensional data. In the discussion of regression, I suggested using your intuition about … Continue reading
Posted in Modeling
10 Comments