Hadoop
4.6K views | +0 today
Follow
Hadoop
Everything around Hadoop
Your new post is loading...
Your new post is loading...
Scooped by Sylvain Kalache
Scoop.it!

Intro To Hadoop

These are slides from a lecture given at the UC Berkeley School of Information for the Analyzing Big Data with Twitter class.
No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

storm at twitter

Talk given at facebook's analytics@webscale conference. Covers storm basics, system overview, architecture at twitter and current use-cases. Featuring Hadoop

No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

HBase Consistency and Performance Improvements

The latest Apache HBase releases, 0.92 and 0.94, contain many improvements over prior releases in terms of correctness and performance improvements. We discuss a couple of these improvements from a development and operations perspective.

No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

Apache HBase on Amazon EMR - Real-time Access to Your Big Data

Apache HBase on Amazon EMR - Real-time Access to Your Big Data | Hadoop | Scoop.it
All Your Base AWS has already given you a lot of storage and processing options to choose from, and today we are adding a really important one. You can now use Apache HBase to store and process extremely large amounts...
No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

Get Hadoop skills, get a job.

Get Hadoop skills, get a job. | Hadoop | Scoop.it
Research mongodb, postgresql, couchdb, redis, hbase, cassandra, riak, hadoop job trends and the demand for mongodb, postgresql, couchdb, redis, hbase, cassandra, riak, hadoop jobs at Indeed.com.
No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

Realtime Analytics for Big Data: A Facebook Case Study

Knowing what your users are doing on your site in real time and matching what they do with more targeted information transforms into better conversion rate and better user satisfaction, which means more money in the end.
No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

What it really means when someone says 'Hadoop'

What it really means when someone says 'Hadoop' | Hadoop | Scoop.it

Big data is among the hottest trends in IT right now, and Hadoop stands front and center in the discussion, but many people don’t seem to know exactly what it means when somebody says “Hadoop.”

No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

Download Hadoop: The Definitive Guide ebook

Download Hadoop: The Definitive Guide ebook | Hadoop | Scoop.it
Hadoop: The Definitive Guide book download (Download Hadoop: The Definitive Guide ebook: Hadoop: The Definitive Guide book download
Tom White


Download Had...)...
No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

We are the big data problem

We are the big data problem | Hadoop | Scoop.it
Like a kid waiting for Christmas to come, I have been watching the Mahout project with great anticipation. When you toss around the concepts of map/reduce and machine learning, there's an awful lot of potential for radical ninjas to ensue. While the magical science-fiction world of artificial intelligence is still in a fetal state within our reality, it is, nonetheless, a growing science
No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

2010 Cloudera Apache Hadoop Webinars « Cloudera » Apache Hadoop for the Enterprise

Cloudera offers enterprises a powerful new data platform built on the popular Apache Hadoop open-source software package. (2010 Cloudera Apache Hadoop Webinars: Cloudera produced several webinars in 2010 providing attendees with insigh... http://bit.ly/gCjXEz)
No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

Looking at the code behind our three uses of Apache Hadoop | Facebook

Looking at the code behind our three uses of Apache Hadoop | Facebook | Hadoop | Scoop.it
Facebook est un réseau social qui vous relie à des amis, des collègues de travail, des camarades de classe ou d’autres personnes qui ont quelque chose à partager avec vous. Grâce à Facebook, vous pourrez rester en contact avec vos amis, charger un nombre illimité de photos, publier des liens et des vidéos… et faire plus ample connaissance avec les personnes que vous rencontrez. (Facebook Blog | Looking at the code behind our three uses of Apache Hadoop http://on.fb.me/eIbJer)
No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

Download Now [HF] Hadoop: The Definitive Guide, 2nd Edition Only In 2Down.us

Download Now [HF] Hadoop: The Definitive Guide, 2nd Edition Only In 2Down.us | Hadoop | Scoop.it
2down.us download NowBook DescriptionDiscover how Apache Hadoop can unleash the power of your data. This comprehensive resource Shows you how to build and maintain reliable, scalable, distributed systems with the Hadoop framework - an open source implementation of MapReduce, the algorithm on which Google built its empire. Programmers will find details for analyzing datasets of any size, and administrators will learn how to set up and run Hadoop clusters.This revised edition covers recent changes to Hadoop, including new features such as Hive, Sqoop, and Avro. It also provides illuminating case studies that illustrate how Hadoop is used to solve specific problems. Looking to get the most out of your data? This is your book. * Use the Hadoop Distributed File System (HDFS) for storing large datasets, then run distributed computations over those datasets with MapReduce * Become familiar with Hadoop's data and I/O building blocks for compression, data integrity, serialization, and persistence * Discover common pitfalls and advanced features for writing real-world MapReduce programs * Design, build, and administer a dedicated Hadoop cluster, or run Hadoop in the cloud * Use Pig, a high-level query language for large-scale data processing * Analyze datasets with Hive, Hadoop's data warehousing system * Take advantage of HBase, Hadoop's database for structured and semi-structured data * Learn ZooKeeper, a toolkit of coordination primitives for building distributed systems"Now you have the opportunity to learn about Hadoop from a master - not only of the technology, but also of common sense and plain talk."-Doug Cutting, ClouderaAbout the AuthorTom White has been an Apache Hadoop committer since February 2007, and is a member of the Apache Software Foundation. He works for Cloudera, a company set up to offer Hadoop support and training. Previously he was as an independent Hadoop consultant, working with companies to set up, use, and extend Hadoop. He has written numerous articles for O'Reilly, java.net and IBM's developerWorks, and has spoken at several conferences, including at ApacheCon 2008 on Hadoop. Tom has a Bachelor's degree in Mathematics from the University of Cambridge and a Master's in Philosophy of Science from the University of Leeds, UK.Book Details * Paperback: 624 pages * Publisher: O'Reilly Media; 2 edition (October, 2010) * Language: English * ISBN-10: 1449389732 * ISBN-13: 978-1449389734Download files from Hotfile : Code: http://hotfile.com/dl/73147882/2bd91....2010.rar.htmlDownload files from Fileserve : Code: http://www.fileserve.com/file/K6m3zsp2down.us Download Now
No comment yet.
Suggested by promptcloud
Scoop.it!

Big Data, the dark knight we all need?

Big Data, the dark knight we all need? | Hadoop | Scoop.it
Over the last few years, state-sponsored data collection has come to the fore thanks to whistle-blowers and ex-spies. Since then, the clamor for calling the line between private and public data for...
No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

HBase @ Twitter

A presentation given at an HBase meetup hosted at Twitter on 7/16/2013.

Kun Le's curator insight, July 19, 2013 12:01 AM

add your insight...

Scooped by Sylvain Kalache
Scoop.it!

Hadoop Successes and Failures to Drive Deployment Evolution

Hadoop Successes and Failures to Drive Deployment Evolution...
No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

Hadoop Distributed File System Reliability and Durability at Facebook

HDFS data, architecture and issues at Facebook

No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

Hadoop and Vertica: The Data Analytics Platform at Twitter

Hadoop Summit 2012 - Hadoop and Vertica: The Data Analytics Platform at Twitter

No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

Storage Infrastructure Behind Facebook Messages

Storage Infrastructure Behind Facebook Messages HBase/HDFS/ZK/MapReduce

No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

HBase Schema Design - Things you need to know

When designing schemas for HBase, be it from scratch or porting an existing application over from a relational database for example, there are a set of architectural constraints that a user should be aware of to avoid common pitfalls.
No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

Why big data means a big year for Hadoop

Why big data means a big year for Hadoop | Hadoop | Scoop.it
Companies from IBM to Amazon are turning to Hadoop to manage the surge in data that needs storing.
No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

What is Hadoop? | Java,maven,Hadoop,Pig,Hive tutorials with examples

What is Hadoop? | Java,maven,Hadoop,Pig,Hive tutorials with examples | Hadoop | Scoop.it
Hadoop analyze and process large amount of data i.e peta bytes of data in parallel with less time located in distributed environment.
No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

What Factors Justify the Use of Apache Hadoop?

What Factors Justify the Use of Apache Hadoop? | Hadoop | Scoop.it
The question posed at this week’s San Francisco Hadoop User Group is a common one: “what factors justify the use of an Apache Hadoop cluster vs. traditional approaches?” The answer you receive depends on who you ask.
No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

Companies which run Hadoop

Companies which run Hadoop | Hadoop | Scoop.it
This page documents an alphabetical list of institutions that are using Hadoop for educational or production uses.
No comment yet.
Scooped by Sylvain Kalache
Scoop.it!

HBase/Hadoop on Mac OS X (Pseudo-Distributed) « Cognizant Transmutaion

HBase/Hadoop on Mac OS X (Pseudo-Distributed) « Cognizant Transmutaion | Hadoop | Scoop.it
I wanted to do some experimenting with various tools for doing Hadoop and HBase activities and didn’t want to have to bother making it work with our Cluster in the Cloud. I just wanted a simple experimental environment on my Macbook Pro running Snow Leopard Mac OS X.
No comment yet.