provides software and services that enable enterprises
Live Chat 1-888-673-6564
The Enterprise Open Source Blog
  • Home
  • Search
  • Contact Us
  • Products and Support
  • Services
  • Enterprise OSS Blog
  • Wazi Technical Blog
  • Resources Library
  • Cloud Services
  • Partners
  • Customers
  • Community
  • Company
  • Careers
  • News and Events

Subscribe by Email

Your email:

Most Popular Posts

  • Enterprise Apache Tomcat 7 Clustering - Designing an Efficient, Reliable and Productive Application Server Cluster
  • Open Source Virtual Whiteboards and Dimdim Review
  • An Enterprise Apache Tomcat Clustering Guide
  • Supporting CentOS In The Cloud With Windows Azure
  • VLC License Change: A lesson in perseverance
  • An In-Depth Look at Tomcat’s Clustering Mechanisms
  • Apache HTTP Server: New Features for Version 2.4
  • Why Closed Source is Better Than Open Source
  • Access Serial Ports through Ruby
  • Building Bots With Kids

Current Articles | RSS Feed RSS Feed

Cloud Technology, Big Data & Hadoop

Posted by Aaron Mandelbaum on Thu, Mar 01, 2012
  
Email This Email Article  
Tweet  
  

Big Data seems to be the word of the year.  Everywhere you look there is Big Data staring you in the face: Blogs dedicated to discussing Big Data; LinkedIn groups with over 10,000 members focusing on Big Data topics.  Just under 15,000 times each month, the exact phrase Big Data, is entered into the search engines of people across the world.

Wikipedia begins to describe Big Data as datasets that grow so large that they become awkward to work with using on-hand database management tools.  Because there is so much content being created at speeds we have never before seen, there is this preconceived notion that in order to be as efficient and productive and intuitive as we all want to be, Big Data needs to be addressed to some degree or another.

Three notions that seem to be facilitating this Big Data movement include:

  1. Faster analysis of larger operational data will help you make better decisions.

  2. More in-depth analysis of customer data will guide you to better customer segmentation.

  3. Insight into larger data sets will help you come up with innovative product design.


Usually when Big Data is mentioned, the reference or inclusion of cloud technology is not far behind.  That discussion includes public and private clouds, scalability, flexibility, open PaaS and all of the other cloud characteristics.  Therein lies the solution that many have already found, and that is Hadoop. "The Apache Hadoop software library is a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model." Not every cloud needs Hadoop, and Hadoop doesn't need the cloud.  But when leveraged collectively you have found a pretty good solution for your Big Data concerns.

Some interesting users of Hadoop include:

  • Amazon, eBay, Rackspace, Twitter, among many others (http://wiki.apache.org/hadoop/PoweredBy).

  • LinkedIn uses it on 5,200 cores to manage 7.2 PB or 7,200,000 GB of data.

  • Facebook uses it on 11,200 cores to manage 12 PB.

  • Yahoo uses it on 36,000 cores to manage 18 PB.


At OpenLogic, we store all of the world's open source code in Hadoop as part of our scanning solution.



Hadoop is now a generally accepted viable open source solution for your Big Data dilemma. It is by far the leading solution in its field. It is the Linux of large-scale data processing and is understood to be the long term solution for organizations of all sizes.  Like all open source, Hadoop is a great win-win.  It saves the wasted energy of developing many different and not nearly as capable closed solutions. Also, because Hadoop has become so capable, it can provide us with the reliable data management foundation on which so many of the large-scale Internet innovations of the last few years have been built.

Harnessing cloud technology and the power that Haddoop provides us with, we can realize the agility that we are all looking for, as we slowly migrate different parts of our enterprise, and even our personal lives, into some dynamic of the public and private clouds. Cloud technology offers the end user many different ways to calculate ROI and justify the move to the cloud.

Big Data can be managed with the right resources and technology while still providing you and your customers with the opportunity to be agile.  Ramp up ramp down, be creative, be innovative,  and use data driven decision making to do it as efficiently as possible.


Source: cloudtweaks.com via CloudTweaks on Pinterest




 

Subscribe to The Enterprise Open Source Blog via email

Follow @openlogic
Follow @cloudswing
Follow @AaronMandelbaum

View Aaron  Mandelbaum's LinkedIn profileView Aaron Mandelbaum's profile

This work is licensed under a Creative Commons Attribution 3.0 Unported License

Creative Commons License.Follow @openlogic
Follow @OSCloudServices

This work is licensed under a Creative Commons Attribution 3.0 Unported License
Creative Commons License.
Tags: Open Source Trends, The Cloud

Comments

Currently, there are no comments. Be the first to post one!
Post Comment
Name
 *
Email
 *
Website (optional)
Comment
 *

Allowed tags: <a> link, <b> bold, <i> italics

Loading...
Error sending email
Email sent successfully

Email article
Email To : 
Your name : 
Message : (maximum 200 characters)

Enterprise OSS Blog Policy

If you read a post on The Enterprise OSS Blog, please leave a comment. Let us know what you think, even if it's just a few words. Comments do not require approval, but they are moderated.OpenLogic reserves the right to remove any comments it deems inappropriate.

 

click-to-chat-with-a-live-open-source-expert

get-a-quote-on-support

download-the-support-evaluation-kit

schedule-a-deep-discovery-demo

Most Popular Posts

  • Enterprise Apache Tomcat 7 Clustering - Designing an Efficient, Reliable and Productive Application Server Cluster
  • Open Source Virtual Whiteboards and Dimdim Review
  • An Enterprise Apache Tomcat Clustering Guide
  • Supporting CentOS In The Cloud With Windows Azure
  • VLC License Change: A lesson in perseverance
  • An In-Depth Look at Tomcat’s Clustering Mechanisms
  • Apache HTTP Server: New Features for Version 2.4
  • Why Closed Source is Better Than Open Source
  • Access Serial Ports through Ruby
  • Building Bots With Kids

Connect With Us!

Browse by Tag

  • 2013 (2)
  • Agile (1)
  • Apache (2)
  • apache tomcat (1)
  • AS 7 (1)
  • as7 (1)
  • Auditing (5)
  • Azure (2)
  • Budget (1)
  • BusyBox (1)
  • CentOS (3)
  • Closed Source Software (1)
  • cloud (4)
  • clustering (1)
  • CMS (1)
  • Code Scanning (1)
  • commercial distribution (1)
  • Community (4)
  • compliance (39)
  • C-Suite (1)
  • Database (1)
  • developers (2)
  • DevOps (15)
  • Drupal (1)
  • enterprise software (2)
  • foss (5)
  • Gitbhub (1)
  • Governance (36)
  • guide (1)
  • Hadoop (2)
  • HBase (2)
  • http 2.4 (1)
  • httpd 2.4 (1)
  • Java (1)
  • javascript (1)
  • jboss (3)
  • JBoss Cluster (1)
  • Joomla (1)
  • Legal (21)
  • Legal & Compliance (62)
  • Legal and Compliance (2)
  • license compliance (1)
  • Licenses (12)
  • Linux (4)
  • lisp code (1)
  • martin fowler (1)
  • Mobile (3)
  • mod_cluster (2)
  • MySQL (1)
  • Neal Ford (1)
  • open source (19)
  • open source compliance (1)
  • open source components (1)
  • open source events (1)
  • Open Source Governance (2)
  • open source legal issues (1)
  • Open Source Licensing (3)
  • Open Source Management (38)
  • Open Source Policy (3)
  • open source software (15)
  • Open Source Software Adoption (4)
  • open source software policy (1)
  • Open Source Training (1)
  • Open Source Trends (337)
  • Open Source vs. Commercial Software (3)
  • OSS (5)
  • OSS Packages (2)
  • PaaS (1)
  • paredit (1)
  • picketlink (1)
  • Policy (4)
  • PostgreSQL (1)
  • Presentations (1)
  • Programming (2)
  • red hat (1)
  • RHEL (1)
  • Ruby (1)
  • Scanning (27)
  • Scanning & Governance (12)
  • Scanning & Provisioning (30)
  • Security (13)
  • Shibboleth (1)
  • software compliance (1)
  • Software Development (2)
  • Software Development Lifecycle (7)
  • software infrastructure (1)
  • Solr (1)
  • Support (48)
  • Support & Services (2)
  • SUSE (1)
  • Technical Governance (1)
  • The Cloud (35)
  • The C-Suite (2)
  • tomcat (4)
  • Training (9)
  • Ubuntu (1)
  • Uncategorized (69)
  • Windows (1)
  • Windows Azure (1)
  • Wordpress (1)
  • Zookeeper (1)
Home | Search | Contact Us | Products and Support | Services | Enterprise OSS Blog | Wazi Technical Blog | Resources Library | Cloud Services | Partners | Customers | Community | Company | Careers | News and Events
Products
OpenLogic Exchange (OLEX)
License Compliance Module
OSS Discovery
OSS Deep Discovery
OpenUpdate
Services
Open Source Support
CentOS Support
Scanning & Compliance
Open Source Training
Professional Services
Solutions
Support & Indemnification
Open Source Governance
Open Source Scanning
Open Source Provisioning
Consulting & Training
Contact Us
1-888-673-6564


© 2013 OpenLogic, Inc. All rights reserved.
Site Map  |  Privacy Policy