Machine Learning Fundamentals
Core Components of Machine Learning
Every ML system has these essential parts:
- Data: Raw information to learn from
- Model: System to transform data
- Objective Function: Measures performance
- Algorithm: Optimizes model parameters
1. Data Science Fundamentals
Data Components
- Features (inputs/covariates)
- Labels (targets for prediction)
Data Types Examples
Images
- RGB values (200×200×3 = 120,000 values)
- Variable resolutions
Healthcare
- Patient vitals
- Medical history
- Treatment records
Data Quality
- Quality > Quantity
- "Garbage in, garbage out"
- Must check for bias
- Need representative samples
2. Models
Types
- Simple models for basic tasks
- Deep learning for complex problems
- Multiple transformations chained together
Selection Criteria
- Problem complexity
- Data availability
- Resource constraints
3. Objective Functions
Characteristics
- Measure model performance
- Lower values = better
- Called "loss functions"
Common Types
- Regression: Squared error
- Classification: Error rate
Data Split
- Training set: Learning
- Test set: Evaluation
- Avoid overfitting
4. Optimization
Gradient Descent
- Iterative parameter updates
- Minimizes loss function
- Small step-by-step improvements
ML Problem Types
1. Supervised Learning
- Regression
- Classification
- Tagging
- Search
- Recommender Systems
- Sequence Learning
2. Unsupervised Learning
- Clustering
- Subspace estimation
3. Environmental Interaction
- Offline learning
- Distribution shift challenges
4. Reinforcement Learning
- Agent-environment interaction
- Action-based learning
Historical Background
Early Foundations
- Bernoulli distribution (1655-1705)
- Gaussian distribution (1777-1855)
- Early statistical methods
Key Contributors
Ronald Fisher
- Statistical foundations
- Iris dataset
Claude Shannon
- Information theory
Alan Turing
- Computation theory
- AI testing concepts
Donald Hebb
- Neural learning principles
Neural Networks Evolution
- Biological inspiration
- Alternating layer structure
- Backpropagation importance
Key Takeaways from Day 1
- ML requires quality data, appropriate models, clear objectives, and efficient optimization
- Different problems need different approaches
- Historical foundations inform modern methods
- Balance between theory and practical application is crucial
- Ethics and bias awareness are essential