Applied Data Science with AI/ML
Duration – 20 Days.
Program Structure
- Data Analysis
- Exploratory Data Analysis using Pandas
- Machine Learning
- Python (Pre-recorded video)
- Deep Learning and NLP
- Deep Learning
- Natural Language Processing (NLP)
Project Stream
- Machine Learning and Deep Learning Deploy using Streamlit or Hugging Face
Program Outcomes
- Perform exploratory data analysis (EDA) using Pandas, including data cleaning with proficiency in handling large datasets.
- Build and evaluate machine learning models, including regression, classification, and clustering algorithms, and apply them to solve practical business problems.
- Develop projects that demonstrate the ability to apply learned concepts.
Experiential Project Based Learning
- End-to-End AI Project Experience
Tools / Platform:
- Google Colab / Jupyter Notebook
| Data Analysis | ||
|---|---|---|
| Exploratory Data Analysis with Pandas | ||
| NumPy | Vectorization | Broadcasting |
| Slicing of Matrices | Filtering | Array Creation Functions |
| NumPy Functions across axis | Stacking of arrays | Matrix Calculation |
| Pandas Series | Data Cleaning | Handling Missing Data |
| Pandas Data frame | Selection Data (loc, iloc) | Filtering Data Frames |
| Sorting Data Frames | Importing Data | Data Visualization |
| Working with Categorical Data | Grouping & Aggregation | Merging Data Frame (concat, merge) |
| Machine Learning | ||
| Understand what is Machine Learning | Regression | Logistic regression |
| Supervised machine learning | Simple linear regression | Naïve Bayed Classification |
| Unsupervised machine learning | Multiple linear regression | Decision tress and its types |
| Train test split the data | Performance measure for regression | K Nearest Neighbor Classification |
| ML Workflow for project implementation | MSE, R-Squared, MAE, SSE | Performance Measure for Classification |
| Classification | Various types of classification | Accuracy, Recall, Precision, F1measure |
| Sorting Data Frames | Importing csv files | Importing Excel Files |
| Deep Learning and NLP | ||
|---|---|---|
| Deep Learning | ||
| What is Deep Learning | Hidden Layers | Building project based on CNN |
| Deep Learning Methods | Activation Function | Tensorflow, pytorch, Keras |
| Deep Learning Application | Forward and Backward propagation | Batch Normalization, dropout |
| Artificial Neural Network | Deep Learning Libraries | Performance measure for ANN |
| CNN architecture | CNN for computer vision | Object detection |
| Computer vision basics | OpenCV | Working with Images |
| Natural Language Processing | ||
| Natural Language processing | NLP Applications | Regular Expression |
| Tokenization, stopwords | Stemming, Lemmatization | Word Embeddings |
| NLTK | POS Tagging NER | bag of words, TF-IDF, unigrams, bigrams |
| RNN, RNN architecture | Bidirectional LSTM – Encoders and Decoders | Text classification using ML |
| Project: Machine Learning and Deep Learning | ||
| Apply statistical methods to make decision in various business problems, including bank, stock market | Apply regression to predict future flight price | Apply classification to classify customer churn |
