Posts

Some Common Useful DAX Functions for Beginners

Image
Welcome back guys. Just like i said in my previous post, getting started with Data Analysis Expressions (DAX) can be intimidating, but becoming knowledgeable will guide you through unlocking new insights into your data. I believe the following DAX functions can get you started on the right path. Let’s work through some common business scenarios. 1. FILTER: The FILTER function is used to return a subset of a table or expression, as shown below. Let’s say that you want to get a count of items sold based on specific amount range.e.g Amount between 150 and 250. We will use the COUNTROWS function (Just like the name, it counts record), which counts the number of rows in the specified table, along with the FILTER function to achieve this: Count of sales between 150 and 250 = COUNTROWS(FILTER('Sales', 'Sales'[Sales...

Introduction to DAX (Quick Overview)

Image
This is just an overview of DAX language . We will begin calculations and extensive analysis on my next post.Enjoy. What is DAX?: Data Analysis Expression(DAX) is a Functional Language currently used in Power BI Desktop ,Analysis Services Tabular and Power Pivot for Excel. It is a collection of functions that can be used to calculate and return one or more values. In DAX, functions can contain other, nested functions and value references depending on your model and key business questions to be answered. DAX includes some of the functions used in Excel formulas plus other functions designed specifically to work with relational data and perform dynamic aggregations. Why DAX?: The difficult questions raised facing a business or an organization can be simplified using DAX, to shed more light into your data. Learning how to create effective DA...

Fundamentals of MapReduce (New to MapReduce?)

Image
So people have been asking me to give some details on MapReduce concept. This is a very interesting topic to write about. If you have read my previous post, you would have seen my introduction to Big Data and Hadoop. Now I am going to talk about MapReduce has the heart of Hadoop. Some of you might be new to this, but do not worry, it is going to be described in a way you will quickly understand. To Java developers, it might be much easier, but if you do not have experience in java skills, you can still learn some basic java and master MapReduce. MapReduce is a programming framework that allows performance of distributed and parallel processing on large data sets in a distributed environment. I am talking massive scalability across hundreds or thousands of servers in a Hadoop cluster. Just imagine that for a second. If you see in the diagram above, we have the “Input, Map task, Reduce task ...

Scala and Python for Apache Spark

Image
What is Scala?: Scala combines object-oriented and functional programming in one concise, high-level language. Scala's static types help avoid bugs in complex applications, and its JVM and JavaScript runtimes let you build high-performance systems with easy access to huge ecosystems of libraries. What is Python?: Python is an interpreted, object-oriented, high-level programming language with dynamic semantics. Its high-level built in data structures, combined with dynamic typing and dynamic binding, make it very attractive for Rapid Application Development, as well as for use as a scripting or glue language to connect existing components together. Both Python and Scala programming languages offer a lot of productivity to programmers. They are useful tools among data scientists. Most learn both languages for Apache Spark. However, majority prefer Scala to ...

Introduction to BIG DATA: Examples, Types & Characteristics

Image
Big Data! Yes, do you really know what exactly it is, and its influence to the world today?. In order to understand the term 'Big Data' , we first need to know what 'Data' is. Oxford dictionary defines 'data' as - "The quantities, characters, or symbols on which operations are performed by a computer, which may be stored and transmitted in the form of electrical signals and recorded on magnetic, optical, or mechanical recording media. " Now imagine what ‘Big Data’ is. Big Data is a term used for a collection of data sets that are large and complex, which are difficult to store and process using available database management tools or traditional data processing applications. Examples of Big Data: The following are some of the examples of 'Big Data'- The New York Stock Exchange g...

Super Store (Tableau Data Visualization)

Image

Growth and Revenue (Tableau Data Visualization)

Image