Upcoming Batches
Data Science Live Training | 04/07/2022 | 7:00 am | Enroll |
About Course
Why Choose Us?
OnlineITvidhya institute is giving the best data science course online training in the business. considering the huge and misty ideas in data science managing a diverse assortment of subjects, for example, measurements, data science, information investigation, and AI, we are resolved to give the greatest measure of information proficiently to our students with the best workforce accessible in the field of data science. Our teaching staff will facilitate the ideas to make you alright with the subject and addition standard data and conceptualize each module with much attention. Our staff will consistently be accessible to help you with explanations regarding the course and give you the best involvement with learning this course. Complete help will be given by us to you to be an expert Data Scientist.
Who Can Learn Data Science?
Data Science Online Training course by OnlineITvidhya can be picked by any it professional, data analyst, developer, information technology professional, statistician, graduate, or any individual who wants to be a data scientist. Data Science has a business of 30 billion in the following 10 years, the normal bundle a data scientist procures into his pocket is around $1,25,000 per annum. The development rate is going at a high pace at present in the market, openings for work for a data scientist is improving even rapidly.
Improve Your Career
Taking Data Science Online Training with OnlineITvidhya institute will guarantee your career growth. Our data scientist certification you get after the completion of the data science course is substantially valid all around the globe for a lifetime, our classroom and online practice sessions will reinforce you to confront the circumstances in the associations or innovation industry. The capacities that you need, to be a decent data scientist around the world are being instructed by our staff significantly.
- Recap of Demo
- Introduction to Types of Analytics
- Project life cycle
- An introduction to our E learning platform
- Data Types
- Measure Of central tendency
- Measures of Dispersion
- Graphical Techniques
- Skewness & Kurtosis
- Box Plot
- R
- R Studio
- Descriptive Stats in R
- Python (Installation and basic commands) and Libraries
- Jupyter note book
- Set up Github
- Descriptive Stats in Python
- Pandas and Matplotlib / Seaborn
- Random Variable
- Probability
- Probability Distribution
- Normal Distribution
- SND
- Expected Value
- Sampling Funnel
- Sampling Variation
- CLT
- Confidence interval
- Assignments Session-1 (1 hr)
- Introduction to Hypothesis Testing
- Hypothesis Testing with examples
- 2 proportion test
- 2 sample t test
- Anova and Chisquare case studies
- Visualization
- Data Cleaning
- Imputation Techniques
- Scatter Plot
- Correlation analysis
- Transformations
- Normalization and Standardization
- Principles of Regression
- Introduction to Simple Linear Regression
- Multiple Linear Regression
- Multiple Logistic Regression
- Confusion matrix
- False Positive, False Negative
- True Positive, True Negative
- Sensitivity, Recall, Specificity, F1 score
- Receiver operating characteristics curve (ROC curve)
- R shiny
- Streamlit
- Supervised vs Unsupervised learning
- Data Mining Process
- Hierarchical Clustering / Agglomerative Clustering
- Measure of distance
- Numeric – Euclidean, Manhattan, Mahalanobis
- Categorical – Binary Euclidean, Simple Matching Coefficient, Jaquard’s Coefficient
- Mixed – Gower’s General Dissimilarity Coefficient
- Types of Linkages
- Single Linkage / Nearest Neighbour
- Complete Linkage / Farthest Neighbour
- Average Linkage
- Centroid Linkage
- Measure of distance
- Visualization of clustering algorithm using Dendrogram
- Non-Hierarchial
- Measurement metrics of clustering – Within Sum of Squares, Between Sum of Squares, Total Sum of Squares
- Choosing the ideal K value using Scree plot / Elbow Curve
- A geneal intuition for DBSCAN
- Different parameters in DBSCAN
- Metrics used to evaluate the performance of model
- Pro’s and Con’s of DBSCAN
- PCA and tSNE
- Why dimension reduction
- Advantages of PCA
- Calculation of PCA weights
- 2D Visualization using Principal components
- Basics of Matrix algebra
- What is Market Basket / Affinity Analysis
- Measure of association
- Support
- Confidence
- Lift Ratio
- Apriori Algorithm
- User-based collaborative filtering
- Measure of distance / similarity between users
- Driver for recommendation
- Computation reduction techniques
- Search based methods / Item to item collaborative filtering
- Vulnerability of recommender systems
- Workflow from data to deployment
- Data nuances
- Mindsets of modelling
- Elements of Classification Tree – Root node, Child Node, Leaf Node, etc.
- Greedy algorithm
- Measure of Entropy
- Attribute selection using Information Gain
- Implementation of Decision tree using C5.0 and Sklearn libraries
- Encoding Methods
- OHE
- Label Encoders
- Outlier detection-Isolation Fores
- Predictive power Score
- Recurcive Feature Elimination
- PCA
- Splitting data into train and test
- Methods of cross validation
- Accuracy methods
- Bagging
- Boosting
- Random Forest
- XGBM
- LGBM
- Deciding the K value
- Building a KNN model by splitting the data
- Understanding the various generalization and regulation techniques to avoid overfitting and underfitting
- Kernel tricks
- Lasso Regression
- Ridge Regression
- Artificial Neural Network
- Biological Neuron vs Artificial Neuron
- ANN structure
- Activation function
- Network Topology
- Classification Hyperplanes
- Best fit “boundary”
- Gradient Descent
- Stochastic Gradient Descent Intro
- Back Propogation
- Intoduction to concepts of CNN
- Sources of data
- Bag of words
- Pre-processing, corpus Document-Term Matrix (DTM) and TDM
- Word Clouds
- Corpus level word clouds
- Sentiment Analysis
- Positive Word clouds
- Negative word clouds
- Unigram, Bigram, Trigram
- Vector space Modelling
- Word embedding
- Document Similarity using Cosine similarity
- Sentiment Extraction
- Lexicons and Emotion Mining
- Probability – Recap
- Bayes Rule
- Naive Bayes Classifier
- Text Classification using Naive Bayes
- Introduction to time series data
- Steps of forecasting
- Components of time series data
- Scatter plot and Time Plot
- Lag Plot
- ACF – Auto-Correlation Function / Correlogram
- Visualization principles
- Naive forecast methods
- Errors in forecast and its metrics
- Model Based approaches
- Linear Model
- Exponential Model
- Quadratic Model
- Additive Seasonality
- Multiplicative Seasonality
- Model-Based approaches
- AR (Auto-Regressive) model for errors
- Random walk
- ARMA (Auto-Regressive Moving Average), Order p and q
- ARIMA (Auto-Regressive Integrated Moving Average), Order p, d and q
- Data-driven approach to forecasting
- Smoothing techniques
- Moving Average
- Simple Exponential Smoothing
- Holts / Double Exponential Smoothing
- Winters / HoltWinters
- De-seasoning and de-trending
- Forecasting using Python and R
- Concept with a business case
- End to End project Description with deployment using R and Python
Lifetime Access
You will be provided with lifetime access to presentations, quizzes, installation guides and notes.
Assessments
After each training module there will be a quiz to assess your learning.
24*7 Support
We have a lifetime 24*7 Online Expert Support to resolve all your Technical queries.
Forum
We have a community forum for our learners that facilitates further learning through peer interaction and knowledge sharing.
Chaitu

I thought it was very difficult but after attending these classes this is very easy for me. Thank U so much onlineITvidhya... Staff is very good... Good response for
Rishab

It's a great opportunity for me to learn DataScience as a fresher. Thank you very much for the training. The management is good. Thank You.
Darshan

Excellent Training from the trainer and support from the management.