Hi Guys,
This blog is for guys who are interested in words like BigData,Hadoop,Spark etc ..
Let me tell you one thing, i may be wrong on many steps .but this is what i like about myself and i am happy with it..
There are no rules and no start or stop point to learn Hadoop,BigData Spark etc..but one of the main rule is to unlearn everything if you are starting from scratch..
It would be good to have java knowledge but not mandatory.
Why Hadoop ,Spark ,etc
There are many definitions but i like simple and short ...
Hadoop is open source platform,to handle bigdata,its processing,analytics on commodity hardware, we need tools like Hadoop,Spark ,etc.
running application on commodity hardware is the main advantage it reduces cost by 50 times....
(BIGDATA>>>>>>>>>>>1GB)
When Hadoop..
You can say that ok i need hadoop to analyse my logs...for learning purposes..(don't do this activity)
use hadoop and its ecosystem for bigger things like Trade analytics,analysis of browsing data on a particular website,make a recommender system,analyse twitter feeds etc...
How Hadoop.
There are many vendors who provide Hadoop Ecosystem.
Cloudera,Hortonworks,MapR,apacheBigTop,IBM BigInsights etc...there is one from microsoft but it is paid...
you need to understand what u want to b . a bigdata analyst , hadoop administrator,or a tester.
I choose Analyst part...for that you need to install cloudera quickstart or hotonworks sandbox or any other hadoop platform on your laptop(>8GB).for installation plz follow below link.or see you tube videos.
This blog is for guys who are interested in words like BigData,Hadoop,Spark etc ..
Let me tell you one thing, i may be wrong on many steps .but this is what i like about myself and i am happy with it..
There are no rules and no start or stop point to learn Hadoop,BigData Spark etc..but one of the main rule is to unlearn everything if you are starting from scratch..
It would be good to have java knowledge but not mandatory.
Why Hadoop ,Spark ,etc
There are many definitions but i like simple and short ...
Hadoop is open source platform,to handle bigdata,its processing,analytics on commodity hardware, we need tools like Hadoop,Spark ,etc.
running application on commodity hardware is the main advantage it reduces cost by 50 times....
(BIGDATA>>>>>>>>>>>1GB)
When Hadoop..
You can say that ok i need hadoop to analyse my logs...for learning purposes..(don't do this activity)
use hadoop and its ecosystem for bigger things like Trade analytics,analysis of browsing data on a particular website,make a recommender system,analyse twitter feeds etc...
How Hadoop.
There are many vendors who provide Hadoop Ecosystem.
Cloudera,Hortonworks,MapR,apacheBigTop,IBM BigInsights etc...there is one from microsoft but it is paid...
you need to understand what u want to b . a bigdata analyst , hadoop administrator,or a tester.
I choose Analyst part...for that you need to install cloudera quickstart or hotonworks sandbox or any other hadoop platform on your laptop(>8GB).for installation plz follow below link.or see you tube videos.
Books to follow:
Definitive Guide from Tom White and Hadoop in action.
No comments:
Post a Comment