Posts

Fundamentals of MapReduce (New to MapReduce?)

Image
So people have been asking me to give some details on MapReduce concept. This is a very interesting topic to write about. If you have read my previous post, you would have seen my introduction to Big Data and Hadoop. Now I am going to talk about MapReduce has the heart of Hadoop. Some of you might be new to this, but do not worry, it is going to be described in a way you will quickly understand. To Java developers, it might be much easier, but if you do not have experience in java skills, you can still learn some basic java and master MapReduce. MapReduce is a programming framework that allows performance of distributed and parallel processing on large data sets in a distributed environment. I am talking massive scalability across hundreds or thousands of servers in a Hadoop cluster. Just imagine that for a second. If you see in the diagram above, we have the “Input, Map task, Reduce task ...

Scala and Python for Apache Spark

Image
What is Scala?: Scala combines object-oriented and functional programming in one concise, high-level language. Scala's static types help avoid bugs in complex applications, and its JVM and JavaScript runtimes let you build high-performance systems with easy access to huge ecosystems of libraries. What is Python?: Python is an interpreted, object-oriented, high-level programming language with dynamic semantics. Its high-level built in data structures, combined with dynamic typing and dynamic binding, make it very attractive for Rapid Application Development, as well as for use as a scripting or glue language to connect existing components together. Both Python and Scala programming languages offer a lot of productivity to programmers. They are useful tools among data scientists. Most learn both languages for Apache Spark. However, majority prefer Scala to ...

Introduction to BIG DATA: Examples, Types & Characteristics

Image
Big Data! Yes, do you really know what exactly it is, and its influence to the world today?. In order to understand the term 'Big Data' , we first need to know what 'Data' is. Oxford dictionary defines 'data' as - "The quantities, characters, or symbols on which operations are performed by a computer, which may be stored and transmitted in the form of electrical signals and recorded on magnetic, optical, or mechanical recording media. " Now imagine what ‘Big Data’ is. Big Data is a term used for a collection of data sets that are large and complex, which are difficult to store and process using available database management tools or traditional data processing applications. Examples of Big Data: The following are some of the examples of 'Big Data'- The New York Stock Exchange g...

Super Store (Tableau Data Visualization)

Image

Growth and Revenue (Tableau Data Visualization)

Image

How to set SQL Server connection string in ASP.Net?

Firstly;  .NET DataProvider -- Standard Connection with username and password: Put this at the top of your code: using System.Data.SqlClient; Put this  in your code body: SqlConnection conn = new SqlConnection(); conn.ConnectionString = "Data Source=ServerName;" + "Initial Catalog=DataBaseName;" + "User id=UserName;" + "Password=UserPassword;"; conn.Open(); Secondly;  .NET DataProvider -- Trusted Connection: Put this at the top of your code: using System.Data.SqlClient; Put this  in your code body: SqlConnection conn = new SqlConnection(); conn.ConnectionString = "Data Source=ServerName;" + "Initial Catalog=DataBaseName;" + "Integrated Security=SSPI;"; conn.Open(); Thirdly; .NET Configuration manager (Web configuration); Put this at the top of your code: using System.Web.Configuration; using System.Data.SqlClient; Put this in Web.Config: <connectionStrings > <add name="myCo...

Tuples and Lists Type Conversions (Python)

Image
Why would you convert lists to tuples? Let’s say you have such data which you never want to change, then you should use tuple. (tuples == immutable). They work like the array object in JavaScript. You can add items, delete items from a list; but you cannot do that to a tuple, tuples have a fixed size. Why would you convert tuples to lists? Let’s say you want to make changes to the initial tuple created, even though you know you can't modify the data directly. Therefore, you can convert them to lists and then make the change, then convert them back to tuples. (list == mutable). Examples bellow shows conversion between tuple and list and vice versa. Here is an example demonstrating the mutable nature of lists in Python: An example showing the immutable nature of tuples in Python: An example of converting between tuples and lists to edit data: Tip:  When you create a variable, some fixed memory is assigned to the variable. If it i...