npm - @xdev-asia/xdev-knowledge-mcp - Versions diffs - 1.0.42 → 1.0.44 - Mend

@xdev-asia/xdev-knowledge-mcp 1.0.42 → 1.0.44

This diff represents the content of publicly available package versions that have been released to one of the supported registries. The information contained in this diff is provided for informational purposes only and reflects changes between package versions as they appear in their respective public registries.

Files changed (32) hide show

package/content/series/luyen-thi/luyen-thi-aws-ml-specialty/chapters/02-phan-2-modeling/lessons/04-bai-4-sagemaker-built-in-algorithms.md ADDED Viewed

@@ -0,0 +1,186 @@
+---
+id: 8d704042-9cc5-478e-b198-d80ea70c22c5
+title: 'Bài 4: SageMaker Built-in Algorithms'
+slug: bai-4-sagemaker-built-in-algorithms
+description: >-
+  XGBoost, Linear Learner, Random Cut Forest, K-Means, KNN.
+  BlazingText, Seq2Seq, DeepAR, Object Detection, Semantic Segmentation.
+  Khi nào dùng algorithm nào — decision table chi tiết.
+duration_minutes: 90
+is_free: true
+video_url: null
+sort_order: 4
+section_title: "Phần 2: Modeling (36%)"
+course:
+  id: 019c9619-lt02-7002-c002-lt0200000002
+  title: 'Luyện thi AWS Certified Machine Learning - Specialty'
+  slug: luyen-thi-aws-ml-specialty
+---
+<div style="text-align: center; margin: 2rem 0;">
+<img src="/storage/uploads/2026/04/aws-mls-bai4-sagemaker-algorithms.png" alt="SageMaker Built-in Algorithms" style="max-width: 800px; width: 100%; border-radius: 12px;" />
+<p><em>SageMaker Built-in Algorithms: từ XGBoost, Linear Learner đến DeepAR và Image Classification</em></p>
+</div>
+<h2 id="overview"><strong>1. SageMaker Built-in Algorithms Overview</strong></h2>
+<p>SageMaker cung cấp 18+ <strong>built-in algorithms</strong> được optimize để chạy distributed trên AWS infrastructure. Đây là topic <strong>cực kỳ quan trọng</strong> trong MLS-C01 — thường chiếm 8-12 câu.</p>
+<blockquote>
+<p><strong>Exam tip:</strong> Học thuộc bảng "Problem Type → Algorithm". Đề thi luôn cho scenario và hỏi algorithm phù hợp. Key patterns: time series → DeepAR; anomaly → Random Cut Forest; NLP classification → BlazingText; tabular → XGBoost.</p>
+</blockquote>
+<h2 id="supervised-table"><strong>2. Supervised Learning Algorithms</strong></h2>
+<table>
+<thead><tr><th>Algorithm</th><th>Problem Type</th><th>Input</th><th>Key Trait</th></tr></thead>
+<tbody>
+<tr><td><strong>XGBoost</strong></td><td>Classification, Regression</td><td>Tabular (CSV/LibSVM)</td><td>Top performer cho tabular data, gradient boosting</td></tr>
+<tr><td><strong>Linear Learner</strong></td><td>Binary/Multiclass classification, Regression</td><td>RecordIO, CSV</td><td>Fast, scalable, regularization built-in</td></tr>
+<tr><td><strong>Factorization Machines</strong></td><td>Binary classification, Regression</td><td>RecordIO-protobuf (sparse)</td><td>Sparse data, recommendation systems, CTR prediction</td></tr>
+<tr><td><strong>KNN (k-Nearest Neighbors)</strong></td><td>Classification, Regression</td><td>RecordIO-protobuf</td><td>Instance-based, no training, lazy learner</td></tr>
+<tr><td><strong>DeepAR</strong></td><td>Time series forecasting</td><td>JSON Lines</td><td>Multiple related time series, probabilistic forecasts</td></tr>
+<tr><td><strong>Object2Vec</strong></td><td>Embeddings</td><td>Paired sequences</td><td>Learn embeddings cho words, products, users</td></tr>
+</tbody>
+</table>
+<h2 id="nlp-algorithms"><strong>3. NLP Algorithms</strong></h2>
+<table>
+<thead><tr><th>Algorithm</th><th>Output</th><th>Use Case</th></tr></thead>
+<tbody>
+<tr><td><strong>BlazingText</strong></td><td>Word vectors hoặc text classification</td><td>Sentiment analysis, spam detection, entity classification</td></tr>
+<tr><td><strong>Seq2Seq</strong></td><td>Sequence → Sequence</td><td>Machine translation, summarization, Q&amp;A</td></tr>
+<tr><td><strong>LDA (Latent Dirichlet Allocation)</strong></td><td>Topics per document</td><td>Topic modeling, document categorization</td></tr>
+<tr><td><strong>NTM (Neural Topic Model)</strong></td><td>Latent representations</td><td>Topic modeling với neural networks</td></tr>
+</tbody>
+</table>
+<blockquote>
+<p><strong>Exam tip:</strong> <strong>BlazingText</strong> có 2 modes: (1) <code>Word2Vec</code> mode — unsupervised, generates word embeddings; (2) <code>Text Classification</code> mode — supervised, like FastText. Phân biệt rõ khi đọc câu hỏi.</p>
+</blockquote>
+<h2 id="unsupervised-algorithms"><strong>4. Unsupervised Learning Algorithms</strong></h2>
+<table>
+<thead><tr><th>Algorithm</th><th>Problem Type</th><th>Use Case</th></tr></thead>
+<tbody>
+<tr><td><strong>K-Means</strong></td><td>Clustering</td><td>Customer segmentation, document grouping</td></tr>
+<tr><td><strong>PCA (Principal Component Analysis)</strong></td><td>Dimensionality reduction</td><td>High-dimensional data, feature compression</td></tr>
+<tr><td><strong>Random Cut Forest (RCF)</strong></td><td>Anomaly detection</td><td>Fraud detection, IoT anomaly, time series anomaly</td></tr>
+<tr><td><strong>IP Insights</strong></td><td>Anomaly detection</td><td>Detect unusual IP-entity relationships, security</td></tr>
+</tbody>
+</table>
+<h2 id="computer-vision"><strong>5. Computer Vision Algorithms</strong></h2>
+<table>
+<thead><tr><th>Algorithm</th><th>Task</th><th>Output</th></tr></thead>
+<tbody>
+<tr><td><strong>Image Classification</strong></td><td>Multi-class classification</td><td>Class label + confidence</td></tr>
+<tr><td><strong>Object Detection</strong></td><td>Locate + classify objects</td><td>Bounding boxes + labels</td></tr>
+<tr><td><strong>Semantic Segmentation</strong></td><td>Pixel-level classification</td><td>Segmentation mask</td></tr>
+</tbody>
+</table>
+<h2 id="algorithm-decision"><strong>6. Algorithm Selection Decision Tree</strong></h2>
+<pre><code class="language-text">What is the problem type?
+│
+├── Tabular data, classification/regression?
+│   └── XGBoost (best general choice)
+│
+├── Sparse features, recommendation, ad CTR?
+│   └── Factorization Machines
+│
+├── Time series forecasting (multiple related series)?
+│   └── DeepAR
+│
+├── Anomaly detection on time series / IoT?
+│   └── Random Cut Forest (RCF)
+│
+├── Text classification / sentiment?
+│   └── BlazingText (supervised mode)
+│
+├── Sequence-to-sequence (translation / summarization)?
+│   └── Seq2Seq
+│
+├── Topic modeling?
+│   └── LDA or NTM
+│
+├── Clustering?
+│   └── K-Means
+│
+├── Dimensionality reduction?
+│   └── PCA
+│
+└── Image tasks?
+    ├── Classification only → Image Classification
+    ├── Locate objects → Object Detection
+    └── Pixel mask → Semantic Segmentation
+</code></pre>
+<h2 id="training-modes"><strong>7. Training Input Modes</strong></h2>
+<table>
+<thead><tr><th>Mode</th><th>How It Works</th><th>Best For</th></tr></thead>
+<tbody>
+<tr><td><strong>File Mode</strong></td><td>Downloads entire dataset to training instance before starting</td><td>Small to medium datasets</td></tr>
+<tr><td><strong>Pipe Mode</strong></td><td>Streams data directly from S3 during training</td><td>Very large datasets — no disk bottleneck</td></tr>
+<tr><td><strong>FastFile Mode</strong></td><td>Access S3 as if local file system (via FUSE)</td><td>Random access patterns</td></tr>
+</tbody>
+</table>
+<blockquote>
+<p><strong>Exam tip:</strong> Khi đề hỏi "reduce training time for large dataset", đáp án thường là chuyển sang <strong>Pipe Mode</strong> với <strong>RecordIO format</strong>. Pipe Mode không download toàn bộ dataset — stream trực tiếp từ S3.</p>
+</blockquote>
+<h2 id="cheat-sheet"><strong>8. Cheat Sheet — Quick Reference</strong></h2>
+<table>
+<thead><tr><th>Keyword in Question</th><th>Algorithm</th></tr></thead>
+<tbody>
+<tr><td>"tabular data", "structured data"</td><td>XGBoost</td></tr>
+<tr><td>"time series", "forecast"</td><td>DeepAR</td></tr>
+<tr><td>"anomaly detection"</td><td>Random Cut Forest</td></tr>
+<tr><td>"recommendation", "sparse features"</td><td>Factorization Machines</td></tr>
+<tr><td>"text classification", "sentiment"</td><td>BlazingText (supervised)</td></tr>
+<tr><td>"word embeddings"</td><td>BlazingText (Word2Vec mode)</td></tr>
+<tr><td>"translation", "summarization"</td><td>Seq2Seq</td></tr>
+<tr><td>"topic modeling"</td><td>LDA or NTM</td></tr>
+<tr><td>"clustering", "segmentation"</td><td>K-Means</td></tr>
+<tr><td>"dimensionality reduction"</td><td>PCA</td></tr>
+<tr><td>"bounding boxes", "object detection"</td><td>Object Detection</td></tr>
+<tr><td>"pixel-level", "segmentation mask"</td><td>Semantic Segmentation</td></tr>
+<tr><td>"IP address anomaly", "fraud login"</td><td>IP Insights</td></tr>
+</tbody>
+</table>
+<h2 id="practice"><strong>9. Practice Questions</strong></h2>
+<p><strong>Q1:</strong> A retail company wants to forecast product demand for the next 30 days across 5,000 product categories. Which SageMaker algorithm is BEST suited?</p>
+<ul>
+<li>A) K-Means</li>
+<li>B) Linear Learner</li>
+<li>C) DeepAR ✓</li>
+<li>D) Seq2Seq</li>
+</ul>
+<p><em>Explanation: DeepAR is specifically designed for time series forecasting across multiple related time series. It learns global patterns from all 5,000 series simultaneously, providing probabilistic forecasts. This is exactly the use case it's optimized for.</em></p>
+<p><strong>Q2:</strong> An IoT system monitors server CPU usage. The team wants to detect unusual spikes automatically. Which SageMaker built-in algorithm should be used?</p>
+<ul>
+<li>A) XGBoost</li>
+<li>B) Random Cut Forest ✓</li>
+<li>C) BlazingText</li>
+<li>D) PCA</li>
+</ul>
+<p><em>Explanation: Random Cut Forest (RCF) is SageMaker's built-in anomaly detection algorithm. It assigns an anomaly score to each data point and works well for time series anomaly detection, such as CPU usage spikes.</em></p>
+<p><strong>Q3:</strong> A data scientist is training a model on a 500 GB dataset. Training is very slow because downloading data to the training instance takes too long. Which change will MOST improve performance?</p>
+<ul>
+<li>A) Switch from CSV to JSON format</li>
+<li>B) Increase the training instance size</li>
+<li>C) Switch to Pipe Mode with RecordIO-protobuf format ✓</li>
+<li>D) Add more training epochs</li>
+</ul>
+<p><em>Explanation: Pipe Mode streams data directly from S3 during training without downloading it first, eliminating the I/O bottleneck for large datasets. Combined with RecordIO-protobuf format, it dramatically reduces startup time.</em></p>

package/content/series/luyen-thi/luyen-thi-aws-ml-specialty/chapters/02-phan-2-modeling/lessons/05-bai-5-training-hyperparameter-tuning.md ADDED Viewed

@@ -0,0 +1,159 @@
+---
+id: 8a7a5367-e4a4-4796-8aab-68326c1dc574
+title: 'Bài 5: Training & Hyperparameter Tuning'
+slug: bai-5-training-hyperparameter-tuning
+description: >-
+  SageMaker Training Jobs: instance types, Pipe Mode vs File Mode.
+  Distributed training: data parallelism vs model parallelism.
+  Automatic Model Tuning (HPO): Bayesian vs Random vs Grid search.
+  Spot Instance Training để giảm chi phí.
+duration_minutes: 60
+is_free: true
+video_url: null
+sort_order: 5
+section_title: "Phần 2: Modeling (36%)"
+course:
+  id: 019c9619-lt02-7002-c002-lt0200000002
+  title: 'Luyện thi AWS Certified Machine Learning - Specialty'
+  slug: luyen-thi-aws-ml-specialty
+---
+<div style="text-align: center; margin: 2rem 0;">
+<img src="/storage/uploads/2026/04/aws-mls-bai5-training-hpo.png" alt="SageMaker Training & Hyperparameter Tuning" style="max-width: 800px; width: 100%; border-radius: 12px;" />
+<p><em>SageMaker Training Jobs & Hyperparameter Tuning: distributed training, Spot Instances, và HPO strategies</em></p>
+</div>
+<h2 id="training-jobs"><strong>1. SageMaker Training Jobs</strong></h2>
+<p><strong>SageMaker Training Jobs</strong> chạy ML training code trên managed compute infrastructure. Training xảy ra trên ephemeral instances — chỉ tính phí khi chạy.</p>
+<pre><code class="language-text">Training Job Lifecycle:
+  Submit Job ──→ Provision Instances ──→ Download Data
+                                              ↓
+                                       Run Training Code
+                                              ↓
+                                       Save Model to S3
+                                              ↓
+                                       Terminate Instances
+</code></pre>
+<h2 id="instance-types"><strong>2. Instance Types cho Training</strong></h2>
+<table>
+<thead><tr><th>Instance Family</th><th>Hardware</th><th>Best For</th></tr></thead>
+<tbody>
+<tr><td><strong>ml.c5</strong></td><td>CPU optimized</td><td>Tabular ML, XGBoost, sklearn</td></tr>
+<tr><td><strong>ml.m5</strong></td><td>General purpose CPU</td><td>Light training, data processing</td></tr>
+<tr><td><strong>ml.p3</strong></td><td>V100 GPU</td><td>Deep learning training</td></tr>
+<tr><td><strong>ml.p4d</strong></td><td>A100 GPU (8x)</td><td>Large-scale DL, distributed training</td></tr>
+<tr><td><strong>ml.g4dn</strong></td><td>T4 GPU (cost-effective)</td><td>Small-medium DL models</td></tr>
+<tr><td><strong>ml.trn1</strong></td><td>AWS Trainium</td><td>LLM training, cost optimization</td></tr>
+</tbody>
+</table>
+<h2 id="distributed-training"><strong>3. Distributed Training</strong></h2>
+<p>Khi model hoặc dataset quá lớn cho một instance, cần <strong>distributed training</strong> trên nhiều instances.</p>
+<table>
+<thead><tr><th>Strategy</th><th>How It Works</th><th>When to Use</th></tr></thead>
+<tbody>
+<tr><td><strong>Data Parallelism</strong></td><td>Mỗi instance có copy của model, train trên subset của data, sync gradients</td><td>Dataset quá lớn, model vừa vặn trong 1 GPU</td></tr>
+<tr><td><strong>Model Parallelism</strong></td><td>Model split across instances, mỗi instance chứa 1 phần</td><td>Model quá lớn cho 1 GPU (LLMs)</td></tr>
+</tbody>
+</table>
+<pre><code class="language-text">Data Parallelism:
+Instance 1 [Full Model] ──→ Train on data shard A ──→ ↓
+Instance 2 [Full Model] ──→ Train on data shard B ──→ ↓  AllReduce
+Instance 3 [Full Model] ──→ Train on data shard C ──→ ↓  (sync gradients)
+                                                          ↓
+                                              Updated Model Weights
+Model Parallelism:
+Instance 1 [Layers 1-4]  ──→ forward pass ──→
+Instance 2 [Layers 5-8]  ──→ forward pass ──→
+Instance 3 [Layers 9-12] ──→ forward pass ──→ output
+</code></pre>
+<blockquote>
+<p><strong>Exam tip:</strong> SageMaker cung cấp <strong>SageMaker Distributed</strong> library với 2 modules: (1) <code>smdistributed.dataparallel</code> — optimized AllReduce; (2) <code>smdistributed.modelparallel</code> — auto pipeline parallelism. Khi đề hỏi "large model training" → model parallelism.</p>
+</blockquote>
+<h2 id="hpo"><strong>4. Automatic Model Tuning (HPO)</strong></h2>
+<p><strong>Hyperparameter Optimization (HPO)</strong> tự động tìm hyperparameters tốt nhất bằng cách chạy nhiều training jobs với configs khác nhau.</p>
+<table>
+<thead><tr><th>Strategy</th><th>How It Works</th><th>Tradeoff</th></tr></thead>
+<tbody>
+<tr><td><strong>Random Search</strong></td><td>Randomly sample hyperparameters từ range</td><td>Fast, good baseline</td></tr>
+<tr><td><strong>Grid Search</strong></td><td>Try all combinations</td><td>Exhaustive, expensive, bad for large spaces</td></tr>
+<tr><td><strong>Bayesian Optimization</strong></td><td>Probabilistic model của outcome, suggest best next config</td><td>Efficient, learns from previous trials — SageMaker default</td></tr>
+<tr><td><strong>Hyperband</strong></td><td>Early-stop poorly performing trials</td><td>Resource-efficient, fast</td></tr>
+</tbody>
+</table>
+<blockquote>
+<p><strong>Exam tip:</strong> SageMaker AMT (Automatic Model Tuning) dùng <strong>Bayesian Optimization</strong> by default. Nó XEM KẾT QUẢ từ các jobs trước để suggest next hyperparameter set — intelligent search, không phải brute force.</p>
+</blockquote>
+<h2 id="spot-training"><strong>5. Spot Instance Training</strong></h2>
+<p>SageMaker hỗ trợ dùng <strong>EC2 Spot Instances</strong> cho training jobs, tiết kiệm đến <strong>90% chi phí</strong> so với On-Demand.</p>
+<table>
+<thead><tr><th>Feature</th><th>Detail</th></tr></thead>
+<tbody>
+<tr><td><strong>MaxWaitTimeInSeconds</strong></td><td>Maximum thời gian đợi spot capacity</td></tr>
+<tr><td><strong>Checkpointing</strong></td><td>Lưu model to S3 periodically — resume sau khi bị interrupt</td></tr>
+<tr><td><strong>use_spot_instances=True</strong></td><td>Parameter trong SageMaker Estimator</td></tr>
+</tbody>
+</table>
+<blockquote>
+<p><strong>Exam tip:</strong> Khi đề hỏi "reduce training costs", đáp án thường là <strong>Spot Instances với checkpointing</strong>. Checkpointing quan trọng để tránh mất progress khi spot instance bị terminate.</p>
+</blockquote>
+<h2 id="bias-variance"><strong>6. Bias-Variance Tradeoff</strong></h2>
+<table>
+<thead><tr><th>Issue</th><th>Symptom</th><th>Cause</th><th>Solution</th></tr></thead>
+<tbody>
+<tr><td><strong>High Bias (Underfitting)</strong></td><td>High train error, high test error</td><td>Model quá đơn giản</td><td>Tăng model complexity, thêm features, giảm regularization</td></tr>
+<tr><td><strong>High Variance (Overfitting)</strong></td><td>Low train error, high test error</td><td>Model quá phức tạp</td><td>Thêm data, dropout, regularization, feature selection</td></tr>
+<tr><td><strong>Balanced</strong></td><td>Low train error, low test error (gần nhau)</td><td>Good fit</td><td>Deploy model</td></tr>
+</tbody>
+</table>
+<h2 id="practice"><strong>7. Practice Questions</strong></h2>
+<p><strong>Q1:</strong> A company is training a large deep learning model that doesn't fit on a single GPU instance. Which SageMaker distributed training strategy should they use?</p>
+<ul>
+<li>A) Data parallelism</li>
+<li>B) Model parallelism ✓</li>
+<li>C) Pipeline parallelism only</li>
+<li>D) Increase batch size</li>
+</ul>
+<p><em>Explanation: Model parallelism splits the model itself across multiple GPU instances, allowing training of models too large to fit in a single GPU's memory. Data parallelism keeps a full model copy on each instance, which doesn't help when the model itself is too large.</em></p>
+<p><strong>Q2:</strong> A team wants to minimize the cost of running 500 hyperparameter tuning jobs. Training can tolerate interruptions. What is the MOST cost-effective approach?</p>
+<ul>
+<li>A) Use larger instances to run jobs faster</li>
+<li>B) Use Spot Instances with checkpointing enabled ✓</li>
+<li>C) Use Grid Search instead of Bayesian Optimization</li>
+<li>D) Reduce the number of epochs</li>
+</ul>
+<p><em>Explanation: Spot Instances can save up to 90% compared to On-Demand pricing. With checkpointing enabled, interrupted jobs save their state to S3 and can resume, making Spot Instances practical for long HPO jobs.</em></p>
+<p><strong>Q3:</strong> A model achieves 95% accuracy on training data but only 62% on the test set. What problem does this indicate?</p>
+<ul>
+<li>A) Underfitting / High bias</li>
+<li>B) Overfitting / High variance ✓</li>
+<li>C) Data leakage</li>
+<li>D) Class imbalance</li>
+</ul>
+<p><em>Explanation: The large gap between training accuracy (95%) and test accuracy (62%) is a classic sign of overfitting (high variance). The model memorized the training data but fails to generalize. Solutions: more data, regularization (L1/L2, dropout), reduce model complexity.</em></p>

package/content/series/luyen-thi/luyen-thi-aws-ml-specialty/chapters/02-phan-2-modeling/lessons/06-bai-6-model-evaluation.md ADDED Viewed

@@ -0,0 +1,169 @@
+---
+id: 53fa302d-d4b6-483f-af7d-5c9b26bbf21e
+title: 'Bài 6: Model Evaluation & Validation'
+slug: bai-6-model-evaluation
+description: >-
+  Metrics: Accuracy, Precision, Recall, F1, AUC-ROC, RMSE, MAE, R².
+  Confusion Matrix. Cross-validation strategies.
+  SageMaker Clarify cho bias detection & explainability.
+  A/B testing với Production Variants.
+duration_minutes: 60
+is_free: true
+video_url: null
+sort_order: 6
+section_title: "Phần 2: Modeling (36%)"
+course:
+  id: 019c9619-lt02-7002-c002-lt0200000002
+  title: 'Luyện thi AWS Certified Machine Learning - Specialty'
+  slug: luyen-thi-aws-ml-specialty
+---
+<div style="text-align: center; margin: 2rem 0;">
+<img src="/storage/uploads/2026/04/aws-mls-bai6-model-evaluation.png" alt="Model Evaluation Metrics" style="max-width: 800px; width: 100%; border-radius: 12px;" />
+<p><em>Model Evaluation: Classification metrics (AUC-ROC, F1), Regression metrics (RMSE, MAE), và Confusion Matrix</em></p>
+</div>
+<h2 id="classification-metrics"><strong>1. Classification Metrics</strong></h2>
+<p>Chọn metric đúng là một trong các kỹ năng quan trọng nhất của ML Engineer. Đề thi MLS-C01 thường cho scenario và hỏi metric phù hợp.</p>
+<h3 id="confusion-matrix"><strong>1.1. Confusion Matrix</strong></h3>
+<pre><code class="language-text">                 Predicted
+                 Positive  Negative
+Actual Positive │   TP   │   FN   │  ← Recall = TP / (TP + FN)
+       Negative │   FP   │   TN   │
+Precision  = TP / (TP + FP)   ← of all predicted positive, how many are correct?
+Recall     = TP / (TP + FN)   ← of all actual positive, how many did we catch?
+F1 Score   = 2 × (P × R) / (P + R)   ← harmonic mean
+Accuracy   = (TP + TN) / Total
+</code></pre>
+<table>
+<thead><tr><th>Metric</th><th>Optimize When</th><th>Real-World Example</th></tr></thead>
+<tbody>
+<tr><td><strong>Precision</strong></td><td>FP cost is high — don't want false alarms</td><td>Spam filter (don't block legitimate email)</td></tr>
+<tr><td><strong>Recall (Sensitivity)</strong></td><td>FN cost is high — don't miss positives</td><td>Cancer detection (find all cancer patients)</td></tr>
+<tr><td><strong>F1 Score</strong></td><td>Balance Precision and Recall, imbalanced data</td><td>Fraud detection</td></tr>
+<tr><td><strong>Accuracy</strong></td><td>Balanced classes only</td><td>Multi-class, balanced datasets</td></tr>
+<tr><td><strong>AUC-ROC</strong></td><td>Ranking quality, threshold-independent</td><td>Credit scoring, ad ranking</td></tr>
+<tr><td><strong>PR-AUC</strong></td><td>Imbalanced, care about minority class</td><td>Fraud, medical diagnoses</td></tr>
+</tbody>
+</table>
+<blockquote>
+<p><strong>Exam tip:</strong> Kịch bản hay gặp — "Medical diagnosis, missing cancer is worse than false positive" → optimize <strong>Recall</strong>. "Spam detector, blocking good emails is bad" → optimize <strong>Precision</strong>. Imbalanced data → dùng <strong>F1 hoặc AUC-ROC</strong>, không dùng Accuracy.</p>
+</blockquote>
+<h2 id="regression-metrics"><strong>2. Regression Metrics</strong></h2>
+<table>
+<thead><tr><th>Metric</th><th>Formula</th><th>Sensitivity to Outliers</th><th>Use Case</th></tr></thead>
+<tbody>
+<tr><td><strong>RMSE</strong></td><td>√(mean(errors²))</td><td>High — penalizes large errors</td><td>When large errors are unacceptable (price prediction)</td></tr>
+<tr><td><strong>MAE</strong></td><td>mean(|errors|)</td><td>Low — equal weight all errors</td><td>Robust for outliers, demand forecasting</td></tr>
+<tr><td><strong>R² (R-squared)</strong></td><td>1 - SS_res/SS_tot</td><td>Medium</td><td>Proportion of variance explained (0–1)</td></tr>
+<tr><td><strong>MAPE</strong></td><td>mean(|error/actual|×100)</td><td>High when actuals near 0</td><td>Percentage error, easy business interpretation</td></tr>
+</tbody>
+</table>
+<h2 id="cross-validation"><strong>3. Cross-Validation</strong></h2>
+<table>
+<thead><tr><th>Strategy</th><th>How It Works</th><th>Best For</th></tr></thead>
+<tbody>
+<tr><td><strong>Hold-out Split</strong></td><td>Train/Val/Test split (e.g., 70/15/15)</td><td>Large datasets, fast evaluation</td></tr>
+<tr><td><strong>K-Fold CV</strong></td><td>K subsets, train on K-1, evaluate on 1, repeat K times</td><td>Medium datasets, robust estimate</td></tr>
+<tr><td><strong>Stratified K-Fold</strong></td><td>Same as K-Fold but maintains class proportions each fold</td><td>Imbalanced classification</td></tr>
+<tr><td><strong>Leave-One-Out (LOOCV)</strong></td><td>N-fold (each sample is test once)</td><td>Very small datasets</td></tr>
+<tr><td><strong>Time-Series Split</strong></td><td>Training window grows forward — no future data in training</td><td>Time series data</td></tr>
+</tbody>
+</table>
+<blockquote>
+<p><strong>Exam tip:</strong> Time series data PHẢI dùng <strong>time-based splits</strong>, không được shuffle rồi dùng K-Fold thông thường — sẽ leak future data vào training.</p>
+</blockquote>
+<h2 id="clarify"><strong>4. SageMaker Clarify — Bias & Explainability</strong></h2>
+<p><strong>SageMaker Clarify</strong> phát hiện bias trong data/model và cung cấp model explainability sử dụng <strong>SHAP values</strong>.</p>
+<table>
+<thead><tr><th>Feature</th><th>What It Does</th><th>Output</th></tr></thead>
+<tbody>
+<tr><td><strong>Pre-training bias detection</strong></td><td>Analyzes raw data before training</td><td>Bias metrics: CI, DPL, KL, JS</td></tr>
+<tr><td><strong>Post-training bias detection</strong></td><td>Evaluates model predictions for bias</td><td>Metrics: DPPL, DI, DCO, RD</td></tr>
+<tr><td><strong>Model Explainability</strong></td><td>SHAP values cho feature importance</td><td>Feature weight contribution per prediction</td></tr>
+</tbody>
+</table>
+<pre><code class="language-text">SHAP Explainability Example (Loan Approval):
+Feature            SHAP Value  Contribution
+─────────────────────────────────────────────
+credit_score       +0.42       ↑ approval
+income             +0.28       ↑ approval
+debt_ratio         -0.35       ↓ approval
+employment_years   +0.15       ↑ approval
+age                -0.02       minimal impact
+</code></pre>
+<h2 id="production-variants"><strong>5. A/B Testing với Production Variants</strong></h2>
+<p>SageMaker Endpoints hỗ trợ <strong>Production Variants</strong> — chạy nhiều models versions cùng lúc với traffic splitting.</p>
+<pre><code class="language-text">Endpoint with A/B Testing:
+          ┌──────────────────────────────┐
+ Request ─→  SageMaker Endpoint         │
+          │                              │
+          │  Variant A (v1): 80% traffic │──→ Model v1 (current)
+          │  Variant B (v2): 20% traffic │──→ Model v2 (candidate)
+          └──────────────────────────────┘
+                        ↓
+                 Compare metrics, shift traffic gradually
+</code></pre>
+<h2 id="cheat-sheet"><strong>6. Cheat Sheet — Evaluation Metrics</strong></h2>
+<table>
+<thead><tr><th>Scenario</th><th>Best Metric</th></tr></thead>
+<tbody>
+<tr><td>Medical diagnosis (FN is critical)</td><td>Recall (Sensitivity)</td></tr>
+<tr><td>Spam filter (FP is critical)</td><td>Precision</td></tr>
+<tr><td>Imbalanced fraud detection</td><td>F1 Score, AUC-ROC</td></tr>
+<tr><td>House price prediction (outliers matter)</td><td>RMSE</td></tr>
+<tr><td>Demand forecasting (robust)</td><td>MAE</td></tr>
+<tr><td>Explain individual prediction</td><td>SHAP (via SageMaker Clarify)</td></tr>
+</tbody>
+</table>
+<h2 id="practice"><strong>7. Practice Questions</strong></h2>
+<p><strong>Q1:</strong> A hospital wants to build a model to detect early-stage cancer. Missing an actual cancer case is more dangerous than a false positive. Which metric should be OPTIMIZED?</p>
+<ul>
+<li>A) Precision</li>
+<li>B) Recall ✓</li>
+<li>C) Accuracy</li>
+<li>D) RMSE</li>
+</ul>
+<p><em>Explanation: Recall = TP / (TP + FN). Optimizing Recall minimizes False Negatives (missed cancer cases), which is the critical concern here. Precision optimizes against False Positives, Accuracy is misleading for imbalanced medical data, and RMSE is for regression.</em></p>
+<p><strong>Q2:</strong> A company wants to gradually test a new model version in production while keeping the existing model as fallback. Which SageMaker feature provides this capability?</p>
+<ul>
+<li>A) SageMaker Experiments</li>
+<li>B) SageMaker Pipelines</li>
+<li>C) Production Variants on SageMaker Endpoints ✓</li>
+<li>D) SageMaker Model Monitor</li>
+</ul>
+<p><em>Explanation: SageMaker Endpoints support Production Variants, allowing multiple model versions to run simultaneously with configurable traffic weights. This enables A/B testing and canary deployments without downtime.</em></p>
+<p><strong>Q3:</strong> A model for predicting house prices has RMSE=50,000 and MAE=20,000. This indicates the presence of what?</p>
+<ul>
+<li>A) High bias</li>
+<li>B) Data leakage</li>
+<li>C) Outliers driving up RMSE ✓</li>
+<li>D) Underfitting</li>
+</ul>
+<p><em>Explanation: When RMSE is significantly higher than MAE, it indicates outliers — since RMSE squares errors, it penalizes large errors much more than MAE. The gap (50k vs 20k) suggests some predictions have very large errors (outliers in target variable).</em></p>