Approximate credit structure:
| Midsem exam | 20% | Mar 6 |
| Endsem exam | 30% | Apr 20-30 |
| Project | 25% | Announced last week of feb, design report: 7th Mar, final report and demo: end-april |
| Three short surprise quizzes | 10% | best two of three quizzes used for grading. All quizzes will be surprise and there will be no compensation for missed quizzes except under very special circumstances. |
| Two homeworks | 15% | Announced mid-jan and first-week-feb |
| Topic | Classes | Dates |
| Overview: Data warehousing, OLAP and Data mining [ FPSSU96, CD97] | 1 | Jan |
| Classification: | 7 | |
| Decision tree learning: construction, performance [chapter 3Mit97 ,SAM96] | 1 | Jan |
| Issues: tree pruning methods, missing values, continuous classes etc [chapter 3,Mit97] Domingos' paper on Occam Razor | 1 | 2 Feb |
| Instance-based learning [chapter 8,Mit97]: K-NN, Similarity indexing | 1 | |
| Baysesian learning (Text classification) [chapter 6, Mit97] | 1 | |
| Support Vector Machines, SVM Applet | 1 | |
| Neural networks: Tom Mitchell's book Chapter 4 | 1 | Feb |
| Meta learning: Dietterich's survey article, Bagging, | 1 | 9 Feb |
| Discussion of Projects | 1 | 1 Feb |
| Automatic Information extraction: Rule-based methods | 1 | 16 Feb |
| Evaluating learning methods, choosing between different models (Lift curves, cost-based etc) | 1 | |
| Clustering: | 3 | |
| Clustering methods (Chapter from book in IT folder and 6.11.6 from Mitchell's book for the EM algorithm) | 1 | |
| Ways of scaling clustering algorithms [BFR98], Birch Clustering paper | 1 | |
| Semi-supervised and active learning | <1> | |
| Case study for classification learning: KDD Cup | 1 | Feb |
| Interesting itemset mining: | 2 | |
| Basic framework and algorithms [AMS+] | 16 March | |
| Variants for sequential : [ SA96, CSD98 Temporal mining] | 21 March | |
| Interesting case study: Intrusion detection [LSM99], Mining telecommunications data | 1 | 27 March |
| Warehousing: data warehousing overviewCD97 web warehousing | 1 | 28 March |
| Schema integration and data cleaning, Deduplication, Merge/Purge paper, class notes , active learning in deduplication | 1.5 | 28 Mar/3 Apr |
| Data marts: Multidimensional databases (OLAP) | 1 | 4 Apr |
| Advanced topics: Integrating OLAP and mining [Sar99], Online aggregation | 1 | 4/10 Apr |
| Recap, future and visions. | 1 | 17 April |
| Project presentations | - | 18 April |
| Project demos: | - | April 20 |
List of papers: