in

CS246: Mining Data Sets, Hacker News

        

            

Content

            

What is this course about? [Info Handout]

            

                The course will discuss data mining and machine learning algorithms for analyzing very large amounts of data. The emphasis will be on MapReduce and Spark as tools for creating parallel algorithms that                 can process very large amounts of data.
Topics include : Frequent itemsets and Association rules, Near Neighbor Search in High Dimensional Data, Locality Sensitive Hashing (LSH), Dimensionality reduction, Recommendation Systems, Clustering, Link Analysis, Large-scale                 Supervised Machine Learning, Data streams, Mining the Web for Structured Data, Web Advertising.             

            

Previous offerings

            

The previous version of the course is CS A: Data Mining which also included a course project. CS 823 A has now been split into two courses CS (Winter, 3-4 Units, homework, final , no project)                 and CS 345 (Spring, 3 Units, project-focused             You can access class notes and slides of previous versions of the course here:             

                         

            

Students are expected to have the following background:             

      of knowledge of basic computer science principles and skills, at a level sufficient to write a reasonably non-trivial computer program (eg, CS
      (or CS) or equivalent are recommended).

                        

          Good knowledge of Java and Python will be extremely helpful since most assignments will require the use of Spark.

                             Familiarity with basic probability theory (CS

              (or Stat) or equivalent is sufficient but not necessary.

                                

                  Familiarity with writing rigorous proofs (at a minimum, at the level of CS )                 

                    Familiarity with basic linear algebra (eg, any of Math

, Math , Math CS

, or EE

would be much more than necessary.                 

What do you think?

Leave a Reply

Your email address will not be published. Required fields are marked *

GIPHY App Key not set. Please check settings

Stop Shaming Gayle King for Refusing to Ignore Kobe Bryant Rape Charge, Crypto Coins News

Stop Shaming Gayle King for Refusing to Ignore Kobe Bryant Rape Charge, Crypto Coins News

Stamen Maps, Hacker News

Stamen Maps, Hacker News