Artificial Intelligence Demystified
Machine Learning
Introduction Artificial Intelligence has become a very popular term today. There is sure to be at least one article in the newspaper daily on …
Solutions for Skilltest Machine Learning : Revealed
Machine Learning
Python
R
Introduction Automation and Intelligence has always been a driving force for technological advancements. Techniques like machine learning enable these advancements in every domain possible. …
Skilltest Statistics II – Solutions
Business Analytics
Introduction Statistics is one of the key ingredient any data scientist must know to have a long successful career in data science industry. After the …
Beginners Guide to Topic Modeling in Python
Business Analytics
Python
Introduction Analytics Industry is all about obtaining the “Information” from the data. With the growing amount of data in recent years, that too mostly …
The Ultimate Plan to Become a Data Scientist in 2016
Business Analytics
Infographics
Machine Learning
Python
Qlikview
R
Introduction Data Scientist is one of the hottest jobs of this decade. The demand for data scientists is much higher than available candidates (Source). …
New Year Resolutions for a Data Scientist
Business Analytics
Machine Learning
Python
R
Introduction New Year is not just replacing your table calendar with a new one or waking up next morning rubbing your eyes. It’s celebrating …
7 Important Ways to Summarise Data in R
Business Analytics
R
Introduction People remain confused when it comes to summarizing data real quick in R. There are various options. But, which one is the best …
Simple Guide to Logistic Regression in R
Business Analytics
R
Introduction Every machine learning algorithm works best under a given set of conditions. Making sure your algorithm fits the assumptions / requirements ensures superior …
Hacks to perform faster Text Mining in R
Big data
Business Analytics
R
Introduction Data science demands versatility. Move away from your regular methods, challenge your ways of working, explore new ways of doing things more efficiently. …
R-analyst Cheat sheet: Data Visualization in R
Business Analytics
Infographics
R
Introduction Data visualization has become an integral part of data science work flow. Hence, your main tool needs to have strong capabilities on both …
7 most commonly asked questions on Correlation
Business Analytics
Introduction The natural trajectory of learning statistics begins with measures of central tendency followed by correlation, regression to other advanced concepts. Amongst these initial concepts, …
Test your fit as a Data Scientist
Business Analytics
While there has been a lot of buzz lately around the demand of data scientists. There are limited resources, which provide a clear answer …
Machine Learning basics for a newbie
Business Analytics
Introduction There has been a renewed interest in machine learning in last few years. This revival seems to be driven by strong fundamentals – …
Cheat Sheet for Exploratory Data Analysis in Python
Business Analytics
Infographics
Python
Introduction The secret behind creating powerful predictive models is to understand the data really well. Thereby, it is suggested to maneuver the essential steps of data …
Data visualization guide for SAS
Business Intelligence
SAS
Introduction A picture is worth a thousand words! In today’s competitive environment, companies want faster decision making process, thus ensuring they stay ahead in the race. …
Cheat sheet: Data Visualisation in Python
Business Analytics
Infographics
Python
Introduction It is said ‘A visually presented data speaks for itself’. Data, served in the right visual form, brings out hidden trends and insights …
All out beginner’s guide to MongoDB
Big data
Business Intelligence
Introduction Necessity is the mother of innovation! This is an old proverb, but it still holds damn good! Last decade has pushed the boundaries …
k-Fold Cross Validation made simple
Business Analytics
Python
R
Does your high performing model degrade/perform poorly on an out of time sample? Has your Kaggle Private score come down from your Public score significantly? Not …
PageRank explained in simple terms!
Big data
Business Analytics
In my previous article, we talked about information retrieval. We also talked about how machine can read the context from a free text. Let’s talk about …
7 Steps of Data Exploration & Preparation – Part 2
Business Analytics
Introduction In Part-1 of this series, we looked at the first three steps of Data Exploration & Preparation, namely Variable identification, Univariate and Bivariate analysis. In this …
How to avoid Over-fitting using Regularization?
Business Analytics
Occam’s Razor, a problem solving principle states that “Among competing hypotheses, the one with the fewest assumptions should be selected. Other, more complicated solutions may ultimately …
How to create Box-Plot chart in Qlikview?
Business Intelligence
Qlikview
The use of this article is best illustrated by a case study. So let’s dive straight in. Business Situation: Recently, we entered 2015 and …
Decision Tree Algorithms – Simplified
Business Analytics
In last article, we looked at the basics of Decision tree and how it helps in classifications. We also looked at advantages and disadvantages …
Decision Tree – Simplified!
Business Analytics
I started working as a business analyst in my previous organisation. I transitioned from a Business Intelligence (BI) Analyst to become a Business Analyst. …
Comprehensive Introduction to merging in SAS
Business Analytics
SAS
In my previous article, “Combining data sets in SAS – Simplified“, we discussed three methods to combine data sets – appending, concatenating and Interleaving. …
Basics of Image Processing in Python
Big data
Business Analytics
Python
Writing today’s article was a fascinating experience for me and would also be for the readers of this blog. What’s so different? Two things: firstly the …
Combining datasets in SAS – simplified!
Business Analytics
SAS
One of the most common task, every analyst performs multiple times in a project is combining data sets. There are various ways to combine …
NoSQL Databases : Simplified
Big data
Business Intelligence
My father always hesitates while making big ticket transaction online. He is always scared of machine making an error. Just imagine that you transfer …
Introduction to PIG Latin
Big data
In previous article, we discussed the Hadoop ecosystem ( link ). We also spoke about two most heavily used Hadoop tools i.e. PIG and …
Comprehensive guide to SAS PROC Format
Business Analytics
Business Intelligence
SAS
I have spent a significant part of my career as a data visualization guy. I am very particular about the formatting and presentation of …
Hadoop beyond traditional MapReduce – Simplified
Big data
Business Analytics
Business Intelligence
In previous articles on Hadoop, our focus have been on MapReduce routines. MapReduce are the basic functional unit of a Hadoop system. Following are …
Synthetic Keys in Qlikview – simplified!
Business Intelligence
Qlikview
Before I discuss about Synthetic Keys, let’s look at a typical QV data model (in the diagram on right hand). Here, we can see three …
Next step in the world of SAS IML
Big data
Business Analytics
SAS
In the last article on IML (here) , we introduced you to the world of Matrix language on SAS. We also talked about some …
Index page to learn everything about Analytics
Big data
Business Analytics
Business Intelligence
Python
Qlikview
R
SAS
Web Analytics
Analytics Vidhya has been a tremendous journey for us. Today, when we look back at the journey we have covered so far – it …
Introduction to SAS Macros – Functions
Business Analytics
SAS
In last 2 articles, we looked at the basic concept of SAS Macros and how they become useful to accomplish repetitive tasks easily. We …
Support Vector Machine – Simplified
Business Analytics
The first time I heard the name “Support Vector Machine”, I felt, if the name itself sounds so complicated the formulation of the concept …
Introduction to SAS Macros
Business Analytics
Business Intelligence
SAS
A quick example: Let’s look at the following SAS program: Above SAS code is written to extract policy level details for 09-Sep-14 and let …
An exciting update from us – hopefully a learning aid for you!
One of the most common queries, we receive through several forums is: Which is the best training for me? OR What is the right …
Market mix modeling – Simplified!
Business Analytics
SAS
US market spends on an average more than $140 Billion on just marketing every year. Provided that marketing is such an important component of …
Introduction to Markov chain : simplified!
Big data
Business Analytics
Markov chain is a simple concept which can explain most complicated real time processes.Speech recognition, Text identifiers, Path recognition and many other Artificial intelligence …
Introduction to Random forest – Simplified
Big data
Business Analytics
With increase in computational power, we can now choose algorithms which perform very intensive calculations. One such algorithm is “Random Forest”, which we will discuss …
Introduction to MapReduce
Big data
Business Analytics
MapReduce is a programming model for processing large data sets with a parallel , distributed algorithm on a cluster (source: Wikipedia). Map Reduce when coupled with HDFS …
What is Hadoop? – Simplified!
Big data
Business Analytics
Scenario 1: Any global bank today has more than 100 Million customers doing billions of transactions every month Scenario 2: Social network websites or eCommerce …
4 Tricky R interview questions
Business Analytics
R
SAS
Analytics industry in India is dominated by SAS currently. But, it will be too optimistic to hope that this remains to in years to …
Analytics events in 2014 – India and abroad
Big data
Business Analytics
Business Intelligence
One of the queries I frequently get on my blog is: Which events / conferences are happening in India and are they worth attending? …
Is survival analysis the right model for you?
Big data
Business Analytics
I was a post-graduate in Mechanical Engineering when I joined the analytics industry as a fresher. The only background I had in analytics industry was …
We just turned 1!
Big data
Business Analytics
Business Intelligence
Yes, that’s right! The first article on Analytics Vidhya went live exactly an year ago (20th April 2013). In less than a year, Analytics …
The importance of context for an analyst!
Business Analytics
Business Intelligence
We analysts enjoy crisp, objective and to the point conversations. An ideal conversation for us is when we come straight to the point, discuss …
Tricky Base SAS interview questions : Part-II
Big data
Business Analytics
SAS
SAS is the largest market-share holder for advanced analytics. If you are going to work in analytics industry, it is impossible to escape from the …
8 rules for new age analytics learning!
Big data
Business Analytics
Business Intelligence
Data science has become one of the most dynamic field. Every alternate month I hear about a start up coming up with next gen …
Freelancing consultant – SAS, India’s leading travel portal
Jobs - Business Intelligence
We are looking at a consultant who would help in setting up & integrating SAS for us. This would be a 2 week role and …
How to Use AGGR () function in Qlikview?
Business Intelligence
Qlikview
The main purpose behind creating any dashboard is to summarize and aggregate information in a manner that can be communicated visually and clearly. Traditionally, …
An analytics interview case study
Business Analytics
Case study is the most important round for any analytics hiring. However, a lot of people feel nervous with the mention of undergoing a …
Tips for creating a winning dashboard
Business Intelligence
Qlikview
Recently I came across this article from Software Advice, a website that reviews business intelligence tools, called “Winning Dashboard Creation Tips from the Qlikview Open Data …
Set Analysis in QlikView – simplified!
Business Intelligence
Qlikview
One of the best practices I follow while preparing any report / dashboard is to provide a lot of context. This typically makes a …
Boon from big data or loss of privacy?
Big data
Business Analytics
Today’s post is going to be different. There is no technical subject matter I am going to talk about. But the article is far …
Customized Reporting in Qlikview
Business Intelligence
Qlikview
As a BI professional, I am used to receiving ad-hoc reporting requirements from business users which need a fast turn-around (sometimes under the name …
My resolutions for 2014
Business Analytics
In my last post, I mentioned how 2013 has been a phenomenal year for me. I can’t wait to continue the momentum in 2014. …
Highlights of 2013
Big data
Business Analytics
Business Intelligence
Qlikview
SAS
Web Analytics
2013 has been an outstanding year for me personally. Among other things, there have been 2 key highlights for this year: Becoming a father …
Being paranoid about data accuracy!
Big data
Business Analytics
Business Intelligence
SAS
As the day was coming to a close, I thought of fitting in another meeting. Two analysts in my team had been working for …
Review: Tableau 8.1
Business Intelligence
Qlikview
As a Business Analyst, I have been a predictive modeler for most of my career. Majority of this time was spent on SAS along …
Getting your clustering right (Part II)
Big data
Business Analytics
SAS
I was starring at the computer screen for the final clustering result. Finally, I opened the output file and found the first cluster with …
Getting your clustering right (Part I)
Business Analytics
SAS
Web Analytics
Clustering is one of the toughest modelling techniques. It takes not only sound technical knowledge, but also good understanding of business. We have split …
Trick to enhance power of Regression model
Business Analytics
SAS
We, as analysts, specialize in optimization of already optimized processes. As the optimization gets finer, opportunity to make the process better gets thinner. One …
Must read books (and blogs) on Web Analytics
Business Analytics
Business Intelligence
Web Analytics
I love reading! By reading something every day before sleeping, I not only continue my learning, but also end my day on a fulfilling …
Five habits of highly successful analysts
Big data
Business Analytics
Business Intelligence
I have interacted with various successful analysts over last 7 years. During these interactions, I found out some common habits in them. After observing these …
Must read books on data visualization
Big data
Business Analytics
Business Intelligence
It is not a co-incidence that all highly successful analyst have excellent data visualization skills. As a matter of fact, I think data visualization …
Upcoming trends in data visualization
Big data
Business Analytics
Business Intelligence
Recently, we were blessed with a baby girl. Among a lot of other things, one thing which keeps mesmerizing me is the continuous change …
A small break to celebrate!
Let me not give it away simply…. Assuming that today’s date is T and T – 1 represents yesterday. Further, If I …
How to become an analytics rockstar?
Business Analytics
Business Intelligence
I still remember first day in my first job. I walked in the office with high ambitions and little understanding of what it …
How to start a career in Business Analytics?
Business Analytics
Business Intelligence
SAS
Web Analytics
Every time I attend any analytics forum or interact with students, two questions stand out on account of number of times they are asked: …
How to create Financial models flawlessly?
Business Analytics
Business Intelligence
Qlikview
Recently, I met one of my friend working in strategy team of a bank over lunch. I felt bad for something which he mentioned …
Basics of Predictive modeling
Business Analytics
Imagine how the world would change when any advertisement you receive is only about a product you are interested in. How beautiful it would …
Welcome to Analytics Vidhya!
Welcome to Analytics Vidhya! For those of you, who are wondering what is “Analytics Vidhya”, “Analytics” can be defined as the science of extracting …