Syllabus — Machine Learning and Data Analytics
Year I, Part I — MSNCS, IOE Pulchowk, Tribhuvan University. 4 credits.
Chapter 1 — Basics of Machine Learning (7 marks)
- 1.1 History of machine learning, definition of learning, types of learning, importance of ML
- 1.2 Statistics review: Min, Max, Mean, Mode, Median, Standard deviation, MSE
- 1.3 ML terminology: class, pattern, feature, training, validation, test data
- 1.4 Feasibility of learning — error and noise — training vs testing
- 1.5 Generalization tradeoff — bias and variance — learning curve
- 1.6 Overfitting and Underfitting
Chapter 2 — Data Analytics Process (9 marks)
- 2.1 Process of data analytics
- 2.2 Data types and attributes
- 2.3 Data pre-processing
- 2.4 Visualization and exploring data
- 2.5 Descriptive, diagnostic, predictive, prescriptive analytics
- 2.6 Architectural design patterns and stack for handling Big Data
Chapter 3 — Supervised Learning (9 marks)
- 3.1 Definition and classification problem
- 3.2 Classifiers and discriminant functions
- 3.3 Linear supervised learning models: linear regression, Perceptron
- 3.4 Learning neural network structures
- 3.5 Decision tree representation model, basic decision tree algorithm, applications
- 3.6 Support vector machines and applications
Chapter 4 — Bayesian Decision Based Learning (9 marks)
- 4.1 Bayes probability theory and conditional probability
- 4.2 Decision surfaces and classifying with Bayes decision theory
- 4.3 Bayesian belief network and applications
- 4.4 Gradient descent method
- 4.5 K-nearest neighbor
Chapter 5 — Unsupervised Learning and Dimensionality Reduction (9 marks)
- 5.1 Introduction to clustering, criterion function for clustering
- 5.2 Algorithms for clustering: K-means, hierarchical, and other methods
- 5.3 Dimensionality reduction techniques and need
- 5.4 Principal component analysis (PCA)
- 5.5 Linear discriminant analysis (LDA)
Chapter 6 — Measures for Performance Evaluation (8 marks)
- 6.1 Classification accuracy
- 6.2 Confusion matrix
- 6.3 Misclassification costs
- 6.4 Sensitivity and specificity, recall, precision, F1-score
- 6.5 ROC curve, box plot, confidence interval
- 6.6 Cross-validation
Chapter 7 — Deep Learning Basics (9 marks)
- 7.1 Definition of deep networks
- 7.2 Feed-Forward and backpropagation
- 7.3 Activation functions: Sigmoid, Tanh, ReLU, Softmax
- 7.4 Convolutional neural networks (CNN): CNN architectures
- 7.5 Recurrent neural networks (RNN): RNN architectures
- 7.6 ML applications in security: anomaly/intrusion detection, malware/phishing/fraud detection
· min read