NASSCOM Futureskills Prime Certified Advanced Data Science with Gen AI including Free Internship & 100% Placement Assistance
Transform Your Career with Data Science + Generative AI
This advanced NASSCOM FutureSkills Prime Certified Data Science course in Hyderabad guarantees career transformation. Here’s a one-time opportunity to learn with the best Data Science training in Hyderabad. Gain knowledge of data analytics, tools, and operations for data science certification and meet the massive demand for these skills.
Here you will learn to read, analyze, clean, engineer, and present data in a way that promotes the growth of your business. To drive data and extract significant results, this Data Science course can help you progress in leaps and bounds. This NASSCOM FutureSkills Prime Certified Data Science training will accelerate your career as it covers relevant topics & pushes you to work on real-time scenarios.
Artificial Intelligence and Machine Learning in Data Science technology are constantly revolutionizing the industry by innovating and solving complex business problems.
Our principle of holistic development lies in the strong bedrock that believes in the amalgamation of theoretical knowledge along with practical training.
PREREQUISITE: The candidate must be pursuing a Bachelor’s Degree.
Previous coding experience is an added benefit.
NASSCOM FutureSkills Prime Certified Advanced Data Science with Gen AI Curriculum (Syllabus)
Your Title Goes Here
Your content goes here. Edit or remove this text inline or in the module Content settings. You can also style every aspect of this content in the module Design settings and even apply custom CSS to this text in the module Advanced settings.
Module 1 : Python Programming
Introduction to Python Programming
- What is Python programming language ?
- Why Python programming language required for Data Science ?
- What is Anaconda ?
- Installation of Anaconda
- Understanding Jupyter notebook
- Basic commands in Jupyter notebook
- Understanding Python Syntax
- Data types in python broadly discussed
Literals, Keywords and Data Types
- What is an Identifier?
- Rules for Identifier Naming
- Inbuilt Keywords
- Variables and Data Types
- print() and input()
Python Operators
- Arithmetic operators
- Assignment operators
- Comparison operators (relational)
- Logical operators
- Identity operators
Conditional Statement
- What are conditional statements ?
- if Statement
- if else Syntax
- if elif else
- nested if and if else
- elif ladder
While Loops & While-Else
- What is loop ?
- Why loops are used ?
- while loop syntax
- Control Flow
- Solving Pattern Problems
Lists
- What is a list ?
- How to define a list ?
- Accessing list elements using indexing and slicing
- Builtin methods
- Basic examples using lists to print elements at even index, etc
Tuple & Set
- What is tuple ?
- How to create a tuple ?
- Accessing tuple elements using indexing and slicing
- Differences between lists and tuples
- What is Set ?
- How to create a set ?
- Builtin methods
- Set operations
- Manipulating and accessing sets
Dictionary
- What is Dictionary ?
- How to create a dictionary ?
- Adding, Modifying and Retrieving values from a dictionary
- Getting all methods
- Builtin methods
- Differences between sets and dictionaries
- Mutable vs Immutable Data Types
Strings
- What is string ?
- Create a string
- Accessing a string using indexing and slicing
- Built-in methods
- format() method
- User input / Dynamical entry
- Type-casting of string
For Loop, For-else & Comprehension
- Introduction to for loop, range(), enumarate(),
- For-else
- List Comprehension & Dictionary Comprehension
- Problem Solving wrt to String Methods utilizing for loops & conditional Statements makingf use
- of chr() and ord()
- Problem Solving with List and Dictionary
- Comprehension
Functions
- What is function ?
- Why functions are used ?
- Terminologies used in functions ?
- How to define a function ?
- How to call a function ?
- Positional and Keyword arguments
Higher Order Functions
- Functions with/without return values
- lambda functions
- Map, filter & reduce
- Namespaces
- Modules and Package
- Using conditional statements
- Using Loops
- Using Functions
Module & Packages
- Modules and Package
- Datetime
- Random
- math,os
- Creating custom module
OOPS – Classes and Objects
- What is a Class ?
- How to create a class ?
- What are the properties and methods of a class ? What is an Object ?
- How to create an Object ?
- How to access the properties and methods of a class using an Object ?
- What is a Constructor ?
- What are the Types of variables ? What are the Types of methods ? Access modifiers
Inheritance & Abstraction
- Inheritance:
- Single Inheritance
- Multilevel Inheritance
- Multiple Inheritance
- Hierarchical Inheritance
- Hybrid Inheritance and MRO Abstraction
Polymorphism and Encapsulation
- Polymorphism:
- Dynamically Typed or Duck Typing
- Operator, Method and Constructor Overloading
- Method and Constructor Overriding Encapsulation
- Setter and Getter
File Handling & Exceptional Handling
- What is a file ?
- How to create a file ? How to open a file ? How to close a file ? Writing to a file Appending to a
- file Reading from a file Testing file’s existence
- What is an exception ? Raising exceptions try finally
- Custom exceptions
VS Code & Streamlit Deployment
- What is streamlit
- Components of Streamlit
- Creating an application with Streamlit
- VS Code, Setting Environment
- Executing programs in VS Code.
Backend with FastAPI
- Client Sever Architecture
- Introduction to Fast API
- Request Methods
- Hands on FastAPI
- Application building
- Integration of Fast API with streamlit
- Case Study
Module 2 : EDA with Applied Statistics
Introduction to EDA
- How is EDA different from python programming?
- EDA vs Python with case study.
- Types of data (Numerical, Categorical)
- Types of Analysis (Nonvisual and visual, Univariate and Bivariate, Descriptive, Inferential & Probabilistic)
Descriptive Statistics
- What and why Statistics?
- Data and its Measures
- Measures of central tendency (Univariate Analysis)
- Measures of dispersion (Bi Variate)
Core Numpy Operations
- Generating Random Numbers – np.random.rand, np.random.randn, np.random.randint, np.random.uniform
- Indexing and Slicing of Arrays (1D & 2D)
- Indexing with Boolean Arrays
- Updating Numpy Array
- Insert, Append and Delete
- Array Shape Manipulations
– reshape
– ravel and flatten
– transpose - Problem Solving
Mathematical & Statistical Functions
- Iterating over a 1D and 2D numpy array
- Iteration using np.nditer()
- Python Operators on Numpy Array
- Numpy Maths – sqrt(), exp(), sin(), cos(), add(), subtract(), multiply(), divide(), dot(), @operator
- Numpy Statistics – sum(), min(), max(), mean(), median(), var(), std(), corrcoef()
Vectorization and Advanced Numpy Functions
- Concatenate, vstack, hstack, column_stack, hsplit, vsplit
- Sorting the Data in an array
– sort
– argsort - Broadcasting Arrays
- Numpy where, any
Introduction to Pandas
- Introduction to Pandas
- Pandas Series
- Pandas DataFrame
- Difference between Series and Dataframe
- Creating Series and Dataframes
Data Exploration and Understanding
- Loading data from csv and excel files
- Understanding the dataframe structure
- Dataframe non indexing attributes
- Dataframe utility methods
- Dataframe iteration methods
- Mathematical functions on whole dataframe
- Getting column index/label
- Changing column labels
- Selecting, swapping columns
- Adding new columns/dropping existing columns
- Vectorized arithmetic operations
- Datatype conversions
- Math and statistical functions on columns
Groupby and Pivot on Data Frame
- Summarizing data using groupby()
- Multidimensional aggregation using pivot table()
- Categorical analysis using crosstab
Working With Multiple Tables Using Joins (Merge Operation)
- Concatenation – vertical and horizontal stacking
- Merging (merge) – inner, outer, left and right joins
- Index based merging
Handling Missing Values, String and Datetime Manipulations
- Types of missing data
- Identifying and visualizing missing data
- Missing value imputation
- String operations
- Working with datetime”
Advanced Data Transformations
- apply() and map()
- WIndow functions (rolling(), expanding())
- Handling JSON files
- JSON record_path_normalization
Univariate and Bivariate Statistical Analysis (Non Visual)
- Measures of shape
- Measure of relationship
- With a Case Study Descriptive stats
Introduction to Data Visualization
- Understanding visualization
- Basic structure of a plot – figure, axis, title, legends, ticks, labels, subplots
- Creating basic plots with raw data
- Customizing the plots – colormaps, axis limits, markers, linestyles, colors
Univariate analysis with Matplotlib/Seaborn
- Univariate plots for numeric data
– Histogram, kde, box plot, violin plot - Univariate plots for categorical data
– bar plot, count plot
Bivariate and Multivariate analysis with Matplotlib/Seaborn
- Plots between Numerical variables
– Scatter plot, heat map, line plot - Plots between Categorical variables
– kde plot, box plot, histogram, boxen plot, violin plot - Plots between a numeric and a categorical variable
– count plot, stacked bar plot - Plots for multivariate analysis
– pairplot, heatmap
Introduction to Probability
- Random Experiment
- Sample Space
- Event
- Axioms of Probability
- Basic Probability Examples
Frequency Tables
- Introduction to Frequency Tables
- Joint Probability
- Marginal Probability
- Conditional Probability
Introduction to Data Distributions, Discrete Data Distributions
- What is a Data Distribution?
- Why Study Data Distribution?
- Key Characteristics
- Types of Distributions
- Introduction to Scipy Library
- Bernaulli, Binomial, and Poisson
Continuous Data Distribution
- Uniform, Normal
- Exponential
- Log Normal
- Pareto Distribution
- 68-95-99.7% Rule
- Verify the Distribution using QQ Plot”
Feature Engineering
- Feature Scaling Normalization and Standardization
- Data Transformation
- Log Transformation and Box-Cox
Introduction to Inferential Statistics
- Inferential Statistics
- Population vs Sample
- Why Inferential Statistics?
- Sampling Techniques
– Convenience Sampling
– Systematics Sampling
– Simple Random Sampling
– Stratified Sampling
– Cluster Sampling - Sampling in Pandas DataFrame
Central Limit Theorem and Point Estimate
- Performing Estimations
- Point Estimate using single sample
- Central Limit Theorem
- Point Estimate using m samples
Confidence Interval Estimate
- Confidence Interval Estimates using 1 Sample (Population Std is known and unknown)
- Confidence Level
- Significance Level
- Critical Value – z score and t -score
Hypothesis Testing and Advanced Statistical Tests
- What is Hypothesis Testing?
- What is p-value?
- Important Hypothesis Tests
- Type I and Type II Errors
- Parametric and Non Parametric Tests
– chisquare test
– ANOVA
Regular Expressions
- Pattern Matching Basics
- Email, Phone Number & URL Extraction
- Text Cleaning & Data Validation
- Removing Special Characters
- Regex with Python (re module)
Ethical Web Scraping
- Requests Library
- BeautifulSoup
- HTML Parsing
- CSS Selectors & Inspect Element
- Extracting Tables, Text & Links
- Pagination Handling
Data Cleaning & Preprocessing
- Handling Missing Values
- Removing Duplicates
- String Cleaning
- Formatting & Standardization
- Data Transformation using Pandas
Exploratory Data Analysis (EDA)
- Data Understanding & Profiling
- Statistical Summary
- Data Visualization using Matplotlib & Seaborn
- Correlation & Trend Analysis
- Business Insights Generation
Project on EDA
- Collect real-time data from websites ethically
- Clean and preprocess raw datasets
- Apply Regex for data extraction & validation
- Perform EDA and generate insights
- Build industry-oriented projects
Module 3 : Database (MySQL)
Introduction to SQL
- Data
- What is Database
- DBMS
- RDBMS
- SQL vs MYSQL
- SQL vs NoSQL
- CRUD operations
- Pandas vs SQL
Data Exploration and Data Filtering (DQL and OPERATORS)
- Client Server Architecture
- Workbench introduction
- Select (retrieve)
- Data Exploration
- Selecting columns
- Performing (limit, distinct, aggregation values, indexing and slicing using offset)
- Data Filtering
- Filtering data based of conditions (with all operators(Like,Regexp,Between))
Clauses
- GROUP BY (Aggregate Function)
- HAVING (correlating with get_group in pandas)
- ORDER BY
- CASE
- Order of execution
- Case Study: A Case Study of Clauses
Multiple Tables
- Multiple Tables
- Primary key
- Composite key
- Foreign key
- Types of relationships in SQL
- ER diagram
Joins, Unions And Subquery
- Types of Joins
- Inner join
- Outer join
- Left
- Right
- Cross Join
- Self join
- Set operations
- UNION
- UNION ALL
- Subquery
- Scalar Subquery
- Multiple Subquery
- Correlated Subquery
Temporary Tables
- Derived table
- CTE
- Inbuilt Functions
- Window Functions
- Case Study: A Case Study of Temporary Tables
SQL Fundamentals
- Types of SQL Commands
- Data types
- Constrains (PRIMARY KEY/auto_increment, NOT NULL, UNIQUE, DEFAULT, CHECK)
- Creating table with constraints
- DDL (CREATE, ALTER, DROP, TRUNCATE)
- DML (Insert, Update, Delete)
SQL Database Objects
- Views
- Stored Procedure
- Functions
Advance Topics
- Transaction Control Language
- ACID properties
- (COMMIT, ROLLBACK, Savepoint)
- Triggers
Project on MySQL
- Analyze normalized relational databases and ER diagrams based on real-world business requirements.
- Import and manage CSV datasets in SQL databases.
- Write SQL queries using filters, aggregations, sorting, and joins.
- Develop advanced SQL solutions using CTEs, Views, Stored Procedures, and Window Functions.
- Solve domain-specific business problems and generate actionable insights.
Module 4: Reporting Tool (Power Bi)
Introduction To Power BI
- What is Business Intelligence?
- Power BI Introduction
- Quadrant report
- Comparison with other BI tools
- Power BI Desktop overview
- Power BI workflow
Data Import And Data Visualizations
- Data import options in Power BI
- Import from Web (hands on)
- Why Visualization?
- Visualization types
- Categorical data visualization
- Trend Data viz
Power Queries
- Power Query Introduction
- Data Transformation – its benefits
- Introducing ribbons
- Queries panel
- M Language briefing
- Power BI Datatypes
- Changing Datatypes of columns
- Filtering
- Inbuilt column Transformations
- Inbuilt row Transformations
- Combine Queries
- Merge Queries
Power Pivot And Introduction To Dax
- Power Pivot
- Intro to Data Modeling
- Relationship and Cardinality
- Relationship view
- Calculated Columns vs Measures
- DAX Introduction and Syntax
Data Analysis Expressions
- DAX logical functions
- DAX text functions
- DAX math and statistical Functions
- DAX aggregation function
- DAX filter function
- DAX time intelligent function
- Creating a Date Dimension table
- Related aspects with tables
Login, Publish To Web And RLS
- Power BI services
- Dashboard creation
- Web Content, Image, Text Box
- Dashboard formatting
- Sharing your dashboard
- RLS introduction
Miscellaneous Topics
- Visual Interactions
- Drill Through
- Drilldown
- Conditional Formatting
- Creating buttons in Power BI reports
- Creating Python Script Visuals
Project on POWER BI
- Understand business problems and define reporting objectives.
- Load data from multiple sources into Power BI.
- Clean and transform data using Power Query.
- Build data models and establish relationships.
- Create DAX measures and calculated columns.
- Design interactive dashboards and reports.
- Generate insights and business recommendations across multiple industry domains.
Module 5 : Applied Machine Learning
ML vs DL vs AI, Reg vs Class
- Why learn AI/ML?
- AI vs ML vs DL
- Supervised vs Unsupervised Learning
- Various Machine Algorithms
- Classification Task
- Regression Task
Model Building Pipeline
- Understanding the Model Building Phase,Splitting data into train and test
- Data Preprocessing Pipelines
- Intro to Numerical Data Preprocessing
- Intro to Categorical Data Preprocessing
Data Preprocessing – Numerical & Categorical Features
- Introducing sklearn module
- Preprocessing Numerical Data
- Preprocessing Categorical Data
- Nominal Encoding
- Ordinal Encoding
Case Study
- Building Models
- Making Prediction and Evaluation using sklearn
- Case Study
Text Preprocessing
- Text Data – Introduction to NLP
- Why Text data is hard to work with
- Cleaning Text Data
- Tokenisation
- Stop Words
- Lemmatization
- Stemming
Text Transformation
- Bag of Word
- Term Frequency Inverse Document
- Frequency,Spam Ham Detection
Working with Image Data
- Image Data – Understanding Images
- RGB channels
- Images as 3D numpy arrays
- Image Data – Understanding Images
- RGB channels
- Images as 3D numpy arrays
Image Case Study
Mathematical Foundations
- Introduction and why Linear Algebra
- Fundamentals of Vectors and Matrices
- Unit Vector
- Matrices operations
- Dot Product of vectors
- Angle between two vectors
- Projection of a vector onto another vector
- Length of projection
- Distances and Dot Products
kNN for Classification & Regression
- Intuition behind KNN
- Developing the algorithm for KNN
- Solving Classification Problems with KNN
- Code Implementation for KNN
Classification Evaluation Metrics
- Accuracy
- Confusion matrix
- Precision and Recall
- F1 Score
- ROC AUC
- Log Loss
kNN for Regression & Regression Evaluation Metrics
- KNN for Regression
- Regression Metrics
– MAE
– MSE
– R square
-Adjusted R square
Introduction to Model Selection
- Occam’s Razor
- What is model complexity?
- Complex vs Simple Models
Train and Test Errors
- Overtting and Undertting
- Bias and Variance
- Model Complexity vs Train Test Error
- Parametric vs Non Parametric Models
- Hyperparameter vs Parameters
Hyperparameters and Cross Validation
- Hyperparameter for ML Algorithms
- Regularization
- Hyper Parameter Tuning
- Hyperparameter for ML Algorithms
- Regularization
- Hyper Parameter Tuning
Hyperparameter Tuning with Lab
- GridSearch CV
- RandomizedSearch CV
- Hands on
Naive Bayes Derivation, Solving an Example with NB
- Derivation of Naive Bayes Algorithm
- Solving an example
- Code implementation,NB Example
Decision Tree(DT) Algorithm
- Introduction to rule based approach to solve classification and regression problem
- Building Decision Trees
- ID3, C4.5, C5.0 algorithms to build decision trees
- Concept of Entropy, Information Gain and Gini Impurity
- Python Code Sample
DT for Classification and Regression
- Working of DT with numerical input data
- DT for Regression
- Reduction of Variance
Solving an Example with DT
Equation of Line and Linear Regression, Multiple Linear Regression
- Understanding equation of a Line and Hyperplane
- Intuition behind Linear Regression
- Building a cost function for Regression and Classification Problem
- Mathematical Formulation of Linear Regression
- Calculating the errors
- Python Implementation,Simple Linear Regression vs Multiple Linear Regression
- Issues with Multiple Linear Regression
- Python Implementation
- Issue of Multicollinearity
- Detecting Multicollinearity with VIF
Linear Regression Edge Cases
- Linear Regression with Polynomial Features
- Assumptions of Linear Regression
- Hands on
Gradient Descent
- Understanding Differentiation of a Function
- Slope of Tangent
- Computing Derivative
- Computing Maxima and Minima
- Iterative algorithm to solve the problem of Maxima and Minima
- Update function of Gradient Descent
Logistic Regression
- Geometric Intuition behind Logistic Regression
- Mathematical Formulation of Logistic Regression
- Signed Distance Formulation
- Sigmoid Function
- Numerical Instability Issue
- Assumptions of Logistic Regression
- Code Sample
Support Vector Machines
- Intro to SVM
- Solving classification problem using SVM
- Logistic Regression vs SVC
- Margin Maximization
- Hard Margin Support Vector Classier,Soft Margin Support Vector Classier
- How kernel trick works?
- Pros and Cons of SVC
Parallel Ensembles
- Introducing Model Ensembles
- Voting Ensembles
- Stacking
- Bagging
– Random Forest-case study
Sequential Ensembles
- Cascading
- Boosting
- ADABoost
- GBDT
- XGBoost
-case study
Feature Engineering – Transformation, Feature Engineering – Selection
- Revision on Feature Transformation
- Feature Selection Introduction
- Variance Thresholding,Variance Ignition Factor
- Recursive Feature Elimination
- Lasso Regularization
- Decision Tree for Feature Importance
- Implementation using Python
Introduction to Unsupervised Learning and KMeans, K Means ++
- Understanding Unsupervised Learning
- Clustering
- Applications of Clustering
- K-Means Algorithm
- Python Code Sample,K Means++ initialisation
Hierarchical Clustering, Customer Segmentation
- Intro to Hierarchical Clustering
- Merging
- Linkage – Single, Centroid, Complete
- Dendrogram
- Cut Tree, Solving Customer Segmentation using Clustering
PCA and Use Case
- Introduction to Dimensionality Reduction
- Introduction to Principle Component Analysis
- Eigenvalues and Eigen Vectors
- Orthogonality
- Transforming Eigenvalues
- Proportion of Variance Explained in PCA
- Code Implementation
Project on Machine Learning
- Understand business problems and dene machine learning objectives.
- Collect datasets from multiple sources or work with publicly available datasets.
- Perform data cleaning, preprocessing, and feature engineering.
- Conduct Exploratory Data Analysis (EDA) to identify trends and patterns.
- Split data into training, validation, and testing datasets.
- Build and train supervised and unsupervised machine learning models.
- Evaluate model performance using appropriate metrics.
- Optimize models using hyperparameter tuning and cross-validation.
- Implement MLOps workflows using MLow for experiment tracking and model management.
- Track model metrics, parameters, versions, and training workflows.
- Interpret model outputs and generate actionable business insights.
- Deploy machine learning applications using Streamlit.
- Create end-to-end ML pipelines for scalable real-world applications.
Your Title Goes Here
Your content goes here. Edit or remove this text inline or in the module Content settings. You can also style every aspect of this content in the module Design settings and even apply custom CSS to this text in the module Advanced settings.
Module 6 : Deep Learning ANN
Introduction to Deep Learning and Artificial Neural Networks
- Understanding Biological Neuron
- Analogy between Biological and Artificial Neuron
- Single Layer Perceptron Model
- Understanding Weights, Bias and Activation
Perceptron Model
- Perceptron learning rule
- Artificial Neuron
- MLP/FCNN/ANN Model
- Nomenclature of ANN
Training Neural Network
- Forward pass with formulation
- Backward Pass (Without Formulation)
- Understanding backpropagation algorithm (Using AutoGrad methods like, Gradient Tape in Tensorow)
ANN Model Building
- Classification Model Building Using Tensorow/Keras
- Regression Model Building Using Tensorow/Keras (As an assignment)
- Evaluating Model
Neuron Activation Function and Output Functions
- Linear Activation
- SIgmoid Function
- Tanh Function
- ReLU Function (Leaky ReLU, Parametric ReLU)
- Softmax Function
Optimizers
- Gradient Descent
- Mini Batch Gradient Descent
- Momentum Based Gradient Descent
- Adaptive Gradient Descent
- Adam
- RMS Prop
Overfitting in ANN
- L1 and L2 Regularization
- Dropout Regularizer
- Early Stopping
- Batch Normalization
Hyperparameter Tuning in ANN
- Introduction to Optuna
- Creating a Study in Optuna
- Finding Hyperparameters using Optuna
- Hyperparameter Importance Using Optuna
Hyperparameter Tuning in ANN
- Introduction to Optuna
- Creating a Study in Optuna
- Finding Hyperparameters using Optuna
- Hyperparameter Importance Using Optuna
Project on ANN
- Understand business problems and define machine learning objectives.
- Collect datasets from multiple sources or work with publicly available datasets.
- Perform data cleaning and preprocessing
- Split data into training, validation, and testing datasets.
- Build supervised ANN models.
- Evaluate model performance using appropriate metrics.
- Optimize models using hyperparameter tuning.
- Track model metrics, parameters, versions and training workflows.
- Interpret model outputs and generate actionable business insights.
- Deploy machine learning applications using Streamlit.
- Create end-to-end ML pipelines for scalable real-world applications.
Module 7: NLP with Transformers
Introduction to Text and its Preprocessing with nltk or spacy
- Intro to NLP
- Text Preprocessing Steps
- BOW, TF-IDF
- Coding for BOW and TF-IDF using nltk/spacy
Word Embeddings – Word2Vec
- Word2Vec
- Autoencoders
- How Word2Vec algorithm works (CBOW, Skip-gram)
Case Study – Text Classification
- Text Data Classification Case Study
Sequence modelling with RNN
- Intro to RNN
- Training of RNN
- Types of RNN
- Text Classication
LSTM, GRU
- Limitations of RNN
- Idea of LSTM, GRU
- POS Tagging
Self Attention and Transformers
Seq2Seq Architecture
- Attention Mechanism
- Implementing Attention Layers
- Hands-on: RNN/LSTM + Attention
Transformer Architecture
- Positional Encoding
- Self and Multihead Attention
- Masked Multihead Attention
- GPT with Autoregressive Language Modelling
- BERT withAuto-encoding Language Modelling
HuggingFace API
- Introduction to HuggingFace
- HuggingFace Embeddings
- Case Study: Sentiment Analysis, Named Entity Recognition, Parts of Speech Tagging, Question & Answering
NLP Tasks using AutoClasses
- Auto classes – Cong, Tokenization and Models
Text Generation & Question Answering
- Text Summarization
- Text Translation
Evaluation Metrics
- Introduction evaluation Metrics
- BLEU, ROUGE
- METEOR, Perplexity
HuggingFace for Image and Audio Data
- Image classification
- Image detection
- Audio Classication
Project on Natural Language Processing (NLP)
- Understand real-world text analytics problems and dene NLP objectives.
- Collect textual datasets from multiple sources or use publicly available datasets.
- Perform text cleaning, tokenization, stemming, and lemmatization.
- Convert text into numerical representations using TF-IDF, Word Embeddings, and basic transformer pipelines.
- Build NLP models for tasks such as Sentiment Analysis, Text Classification, Spam Detection, and Chatbots.
- Train and evaluate NLP models using appropriate performance metrics.
- Extract insights from unstructured textual data.
- Deploy NLP applications using Streamlit.
- Build scalable NLP solutions for real-world business use cases.
Module 8 : Advanced Gen AI & Agentic AI
Introduction to Generative AI and LLMs
- What is GenAI?
- Large Language Models and its capabilities
- Open AI / Gemini API / Groq API authentication
Prompt Engineering
- Temperature, top-p
- Prompting
- User and system prompts
Introduction to LangChain
- Why Langhain?
- Import Chat Models
- Integrating with OpenAI/Gemini/Groq API
- LCEL Chain
Components of a Chain
- Prompt Template
- Chat Prompt Template
- System Message, Human Message & AI Messages
LCEL Chain
- Output Parser
– StroutputParsers, pydantic Parser - Build a Basic ChatBo
Runnables
- Runnable Passthrough
- Runnable Parallel
- Runnable
- Case study
Conversation Memory
- Adding memory to langchain applications
- Context Window Optimization
- Conversation Summarization Techniques
Monitoring and Observability
- Introduction to Monitoring and Observability
- Observability tools
- Tracing Implementation
RAG Fundamentals
- What is RAG? and Why RAG?
- Document loaders (PDF, web, CSV)
- Text splitters & chunking
Embeddings and Vector Database
- Vector stores
- Vector embeddings
- Retrieval chain
Hybrid RAG
- Keyword, Semantic and Hybrid Search
- Re-ranking
RAG Evaluation
- RAGAS evaluation
- LLM-as-a-judge
Tool Calling & Agents
- ReAct loop
- Build a ReAct Agent using Create Agent
- Add custom tools (like search and wikipedia)
- Response Formatting
- Error Handling
- Memory with Checkpointer
- PII middleware
- Tool Retry and Model Retry
Monitoring and Observability
- MCP Client Server Architecture
- Transport Protocol – stdio and http
- JSON-RPC Protocol
- Creating MCP Host with MCP Client in Langchain
Introduction to LangGraph
- Langchain vs LangGraph
- What is the state?
- StateSchema – TypedDict
Building workflows with LangGraph
- Building a Graph
- StateGraph – adding nodes and edges
- Sequential, Parallel and Conditional Workflows
Conversational Chatbot with Memory and Tool
- Adding memory to graph
- InMemorySaver
- Add tools to chatbot
Streaming and Human In The Loop
- Streaming and its types
- Human in the Loop (HITL)
LangGraph application Deployment
- LangGraph Agent deployment
- AgentChatUI
FineTuning Transformers for NLP Task
- TrainerAPI, Training Arguments
- Sequence-to-Sequence vs Causal SFT
- Case Study Text Summarization
PEFT with LoRA and QLoRA
- PEFT
- Low-Rank Adaptation (LoRA)
- Bits and bytes integration
- QLoRA
Project on Gen AI
- Understand real-world business problems and dene Generative AI objectives.
- Collect datasets from multiple sources or use publicly available datasets.
- Preprocess and prepare textual data for Generative AI applications.
- Work with Large Language Models (LLMs) and prompt engineering techniques.
- Build applications AI Chatbots, Document Q&A Systems, Text Summarization, & Content Generation tools.
- Integrate vector databases and embeddings for semantic search and retrieval.
- Implement Retrieval-Augmented Generation (RAG) pipelines.
- Evaluate Generative AI applications using relevant qualitative and quantitative metrics.
- Implement observability, tracing, and monitoring using tools such as Langfuse.
- Track prompts, responses, token usage, latency, and application performance.
- Deploy Generative AI applications using FastAPI and Streamlit.
- Build scalable end-to-end Generative AI solutions for real-world business use cases
Project on Agentic AI
- Understand real-world business problems and dene Agentic AI objectives.
- Design AI agents capable of reasoning, planning, and task execution.
- Integrate Large Language Models (LLMs) with external tools and APIs.
- Build multi-step workflows using agent frameworks and orchestration techniques.
- Develop AI agents for tasks such as Automation, Research Assistance, Data Analysis, & Workforce Management.
- Implement memory, context handling, and conversational capabilities in AI agents.
- Build Retrieval-Augmented Generation (RAG) and tool-calling pipelines for intelligent task execution.
- Evaluate agent performance based on accuracy, efficiency, and task completion.
- Implement observability, tracing, and monitoring using tools such as Langfuse.
- Monitor agent workflows, reasoning steps, latency, and tool usage.
- Deploy Agentic AI applications using FastAPI and Streamlit.
- Build scalable autonomous AI systems for real-world business applications.
Module 9: Computer Vision
Intro to Images and Image Preprocessing with OpenCV
- Introduction to an Image
- How Images are formed and stored in machines
- Introduction To OpenCV – Read, Write and Save images
- Converting to Color Spaces (RGB, BGR, HLS, HSV)
- Bitwise Operators on Images
Image Preprocessing with OpenCV
- Drawing on images
- Ane and Non-Ane Transformation
- Building Histograms for Images
- Edge detection and Blurring
- Read videos
- Capturing images with web camera
- Manipulating videos with previous operations on images
Intro to Convolutional Neural Networks
- Introduction to CNN
- Why CNN over MLP
- How does Convolution works on images
- How does Color image and grayscale image work over convolutional
- Padding, Stride, Maxpooling Operations
- Convolution Arithmetic
Image Classification Case Study
- Image classification on HandWritten Dataset
- Face Mask Detection
CNN Architecture
- Pre trained Model Introduction
- Alex net, Vgg 16 and Inception
- Resnet and Skip Connections
Case Study with Transfer Learning
- Plant Diseases Prediction using Transfer Learning
- Cifar using Transfer Learning
- Improving Face Mask Detection Model using Transfer Learning
Object Detection
- Intro To object Detection
- R-CNN
- Fast R-CNN
- Faster R-CNN
YOLO Algorithm
- Intro to Yolo Algorithm
- How Yolo works?
- Introduction to Roboow
YOLO Casestudy
- Helmet Detection using Yolo
Image Segmentation with Case Study
- Introduction to Image Segmentation
- A case study on Image Segmentation
Project on Computer Vision
- Understand image-based business problems and dene computer vision objectives.
- Collect and preprocess image and video datasets.
- Perform image augmentation and normalization techniques.
- Build computer vision models using OpenCV, CNNs, and Transfer Learning.
- Develop applications such as Object Detection, Face Recognition, Image Classication, & Pose Estimation.
- Train and evaluate deep learning models using appropriate CV metrics.
- Optimize model performance for real-time inference.
- Integrate computer vision pipelines with live webcam and video streams.
- Deploy computer vision applications using Streamlit.
- Build end-to-end AI-powered vision systems for real-world applications.
Languages & Tools Covered in Data Science course

Why Innomatics Stands Out the Best!
Why Data Science at Innomatics Research Labs?
- 500+ Industry experts from Fortune 500 companies
- Dedicated In-house data scientist team accessible round the clock
- 200+ Hours of intensive practical-oriented training
- Flexible Online and Classroom training sessions
- 5+ Parallel Data science batches running currently on both weekdays & weekends
- Backup Classes and Access to the Learning Management System (LMS)
- One-to-one mentorship and Free Technical Support
- FREE Data science internship on our projects & products
- Projects and use cases derived from businesses
- 30+ POCs and use cases to work, learn, and experiment
- Biweekly industry connections from industry experts from various sectors
- Opportunity to participate in Meet-ups, Hackathons, and Conferences
- Dedicated training programs for NON-IT professionals
- 100% Placement Assistance
- Globally Recognized Certification from NASSCOM FutureSkills Prime
What is the scope of Certified Data Scientists in India?
Data Science Continues to be One of the Most In-Demand Careers of The Digital Era
Data Science remains one of the most attractive and high-growth career options globally. With the rapid adoption of Artificial Intelligence, Machine Learning, Generative AI, and Big Data technologies, the demand for skilled Data Scientists continues to surge across industries.
According to recent industry reports, India is among the fastest-growing AI and data analytics markets in the world. Organizations across sectors such as IT, Healthcare, Finance, E-commerce, EdTech, and Manufacturing are heavily investing in data-driven decision-making, creating massive career opportunities for certified Data Science professionals.
Over the past few years, India has witnessed significant growth in AI adoption, with businesses increasingly leveraging data to enhance customer experience, optimize operations, and drive innovation. The integration of Generative AI and advanced analytics has further increased the demand for professionals who can extract meaningful insights from complex datasets.
The salary prospects for Data Scientists in India remain highly competitive. The average salary of a Data Scientist in major tech hubs like Hyderabad, Bangalore, and Pune ranges between ₹8–12 LPA. Entry-level Data Scientists typically earn between ₹6–10 LPA, while experienced professionals and AI specialists can earn ₹25 LPA or higher. In top-tier organizations and leadership roles, compensation packages can reach crores depending on expertise and domain specialization.
With continuous learning, hands-on project experience, and industry-recognized certifications, aspiring professionals can build a strong and future-proof career in Data Science. As organizations continue their digital transformation journey, the demand for skilled Data Scientists is expected to grow exponentially in the coming years.
Job opportunities (Careers) in Data Science Technology
Data Scientists are needed for businesses in every Industry. Even tech giants such as Google, Amazon, Apple, Facebook, Microsoft are constantly in need of Data Science experts who have in-depth knowledge in data extraction, data mining, visualization, and more. Here are some leading careers in Data Science
Business Intelligence Developer
With an average salary of $89,333, they design and develop business strategies for quick decision-making and growth.
Data Scientist
With an average salary of $139,480, they explore, analyze, visualize, and organize data for the companies. They analyze the complex data sets and processes to find patterns for decision making and predicting the business and drive strategies.
Applications Architect
With an average salary of around $134,520, they track applications behavior and applied in the business to analyze the way they interact with the user.
Industry Architect
With an average salary of $126,353, they analyze the business system and optimize accordingly to support the development of updated technologies and system requirements.
Enterprise Architect
With an average salary of $161,323 they work with stakeholders, including management and subject matter experts (SME), to develop a view of an organization’s strategy, information, processes, and IT assets.
Data Architect
With an average salary is $137,630, they build data solutions that can be applied on multiple platforms.
Data Analyst
With an average salary of $83,989, they transform and manipulate large sets of data, which incorporate web analytics tracking and testing.
Data Engineer
With an average salary of $151,498, they perform real-time processing on data that is visualized and stored.
Success Stories of Innomatics Alumni

































Here are the Success Stories of our Innominions
Frequently Asked Questions (FAQs)
Your Title Goes Here
Your content goes here. Edit or remove this text inline or in the module Content settings. You can also style every aspect of this content in the module Design settings and even apply custom CSS to this text in the module Advanced settings.
What will I learn in NASSCOM FutureSkills Prime certified Data Science?
In Data Science, you will learn how to find valuable data, analyze and apply mathematical skills to it to use in business for making great decisions, developing a product, forecasting, and building business strategies.
What is the average salary of a Data Scientist?
In India, Data Scientist salaries vary widely, typically ranging from ₹3 LPA to ₹20 LPA, depending on skills and experience. Here’s a quick breakdown:
Data Analyst: ₹3–7 LPA
Junior Data Scientist: ₹6–9 LPA
Data Scientist: ₹10–15 LPA
Are there any prerequisites to learn the Data Science course?
One need not have any major knowledge in Data Science. A basic understanding of technology is all enough to get started. It is better to possess knowledge of mathematical and communication skills, Python, R, and SAS tools.
What are my takeaways after completion of the Data Science course?
Based on the program you choose, you will get a course completion certificate from Innomatics. Mastery-level certification from NASSCOM FutureSkills Prime.
What are the career opportunities in Data Science Technology?
As data has become the never-ending part of this world, businesses need people to work with data for effective business processing. Organizations are ready to recruit and pay top dollars to the right dollars, which can leverage the business.
Here are some of the roles you can find in Data Science
- Research Analyst
- Data Scientist
- Data Analyst
- Big Data Analytics Specialist
- Business Analyst Consultant / Manager
- Data analyst
If I study Data Science course in Hyderabad, is placement guaranteed?
Apart from the training, we do provide placement and career assistance with capstone projects and hands-on training after completing the course successfully. We do offer internship programs, mockup interviews, hackathons to gain more knowledge and explore a wide range of job opportunities.
Will I get any career support after the Data Science training?
All our trainees will have access to the Learning Management System (LMS), where they can get the backup classes and stay updates, 1-1 interviews, continuous updates on placements, and hackathons.
What is the eligibility criteria to learn Data Science course?
Anyone who has a bachelor’s degree, a passion for data science, and little knowledge of it are eligibility criteria for the Data Science Course.





