Comments about ‘U. researchers meeting with NSA on data mining work’
What You May Have Missed
Most Popular
Across Site
In Utah
- Bottom 30 elementary schools in Utah by test...
- Top 30 elementary schools in Utah by test scores
- Growing pains: Rate of young men struggling...
- BYU student killed after falling 70 feet in...
- New president to lead Mormon Tabernacle Choir
- Glenn Beck unleashes his dogs of war
- Gail Miller gets engaged to Salt Lake attorney
- Charges: Runaway teen caused accident that...
Most Commented
Across Site
In Utah
- Make it a small: N.Y.'s ban on large...
37 - Glenn Beck unleashes his dogs of war
34 - Cottonwood High School football coach...
25 - Rep. Jim Matheson favors getting rid of...
15 - Idaho awaits No Child Left Behind waiver
14 - Poll shows Utahns think Legislature's...
14 - Man shot brother while showing him...
13 - Jon Huntsman Jr. is done pulling punches
12






For the past five semesters I've been working with Dr. Richard Wellman of the Math department at Westminster College on an undergraduate research project focused on machine learning and the Netflix dataset. The details are certainly lacking here, but custom computer programs shouldn't generally struggle with 5000 data points; our fairly standard program can calculate and sort 100 million distances in 200 dimensional space in about a half hour on a typical workstation. I'm very interested to see Venkatasubramanian's work, but the concept of reducing the dimensionality of data isn't new. Indeed, the favored approach for the Netflix data is matrix factorization by descending the gradient of the error function. As a result we compress a matrix with ~8.5 billion entries into ~50 million entries, with the added bonus of predictions given by projecting the user/movie features that were factorized. All of this doesn't take more than a few hours to compute. There are quite a few other ways to compress data (SVD, PCA, FFT) and it will be interesting to see another.
DeseretNews.com encourages a civil dialogue among its readers. We welcome your thoughtful comments.
— About comments