What is ML, types of learning, the ML pipeline, and setting up your Python environment
Machine Learning (ML) is a subset of Artificial Intelligence that enables computers to learn from data without being explicitly programmed. Instead of writing rules, you provide examples and let the algorithm discover patterns.
"Machine learning is the field of study that gives computers the ability to learn without being explicitly programmed." — Arthur Samuel, 1959
The algorithm learns from labelled data — each example has an input and a known correct output.
| Task | Example | Algorithm |
|---|---|---|
| Classification | Email spam detection | Logistic Regression, SVM |
| Regression | House price prediction | Linear Regression, Random Forest |
The algorithm finds hidden patterns in unlabelled data.
An agent learns by trial and error, receiving rewards for good actions.
Data Collection → Data Cleaning → Feature Engineering
→ Model Selection → Training → Evaluation → Deployment
Data Collection → Data Cleaning → Feature Engineering
→ Model Selection → Training → Evaluation → Deployment
| Library | Purpose |
|---|---|
| NumPy | Numerical computing, arrays |
| Pandas | Data manipulation, DataFrames |
| Matplotlib/Seaborn | Data visualisation |
| scikit-learn | ML algorithms, preprocessing |
| TensorFlow/Keras | Deep learning |
Run the code below to see a simple pattern recognition:
to use AI code explanations