Home > big data, E-Progress, hadoop, Innovation, open source, Social Media, Tech > Day 2: Big Data Innovation Summit 2014 #DataWest14

Day 2: Big Data Innovation Summit 2014 #DataWest14

.

Levi's StadiumHello again big data fans – from where I’ve learned the San Francisco 49’ers will be playing their 2014 NFL season at Levi’s Stadium… Santa Clara!

(BTW, the stadium – from what I could see – is beautiful!  I’m a big NFL fan, and there’s now another reason to come to the San Jose area, other than all the cloud / big data conferences.)

Got a lot of great feedback on yesterday’s “Day 1″ post of the summit, so here are some observations from the final day of the conference.

  • Yahoo’s Duru Ahanotu spoke through driving efficiency in how data teams are organized, going through the permutations of generalists vs specialists and centralized vs de-centralized, and how to best address teams in each model.
    .
  • PayPal’s Moises Nascimento (who is a very captivating speaker) drove the point home, that though we are now adopting many of the new data technologies like Hadoop and NoSQL, most of our existing data sources and toolsets still provide value – so there is value in leveraging ALL data sources.
    .
  • Moises also made a point of highlighting that data manipulation is best handled at the SYSTEM level, while data analysis is better managed at the ENTERPRISE level
    .
  • In HP’s discussion, they introduced the concept of the GEOBYTE – 10^30 bytes, a size of data that the human race is expected to hit in the next few years.

To provide context on the magnitude of a GEOBYTE (10^30 bytes), there is estimated to only be 10^19 GRAINS OF SAND ON THE EARTH.  Think about that for a second.

  • The team also highlighted their view on “Big BI” vs “Big Data”
    • Big BI – same types of analysis but on more data; more batch processing; results that were not easily actionable
    • Big Data – joining datasets that have not been previously joined, near real time analysis, action oriented results
      .
  • I thought Ancestry.com had one of the best sessions of the event, as they went deep into the GERMLINE algorithm that was the foundation of their business technology, and how they had to create jermline (now with a “j”) based on Hadoop / HDFS to create a SCALABLE matching engine.  As we all know, SCALE matters. The performance and speed benchmarks between the “G” project and the “j” project were mindblowing.
    .
  • Finally, sat in on the Netflix session – in addition to being a big fan of Netflix, as both a consumer and a tech observer, I’ve always been impressed with the way Netflix has evolved their business, and continues to do so.  In this session, they went into great detail on their use of the Amazon cloud services, and their open source projects as a layer above to enhance functionality and deploy features.  Topics touched on included red / black deployment to allow ease of features into production, and the importance of graceful degradation, so that a failure can be less of a catastrophic event for the end user.
    .

    • One very telling statement is really a commentary on the value of use and participation in the open source process – Netflix was clear that they see value in being an open source contributor / leader is that it preserves the future of their systems – rather than sitting back and letting the industry decide their direction with tools and tech, Netflix uses open source to help drive and lead the industry to where they see value.
      .
  • (I did resist the urge to ask the Netflix presenter when the next season of “House of Cards” would come out. :) )
    .

One of the frequent questions that came up at the Dell booth was “what is Dell doing in big data?”

The answer?  Actually… quite a bit, and for quite a while.

Between the Dell Apache Hadoop HW+SW+Services Solution, the Toad BI suite, the Kitenga analytics toolsets, and our growing HPC business, Dell has been a part of this movement since its early days.  I’d recommend you drop us a line at Hadoop@Dell.com or visit us at http://www.Dell.com/Hadoop to learn more.

If you were out at the show this week, be sure to leave a comment on your thoughts as well.

Hope everyone has safe trips home, and we’ll see you at the next big data get-together!

Until next time,

JBG
@jbgeorge
BDIS 2014

About these ads
  1. April 11, 2014 at 6:12 pm

    Reblogged this on AllThingsBigData and commented:
    Great News from the Big Data Innovation Summit.
    Welcome all to the land of the innovators and 49ers.

  2. April 11, 2014 at 6:12 pm

    Great news from the Big Data Innovation Conference.
    Welcome all to the Silicon Valley , land of the innovators and 49ers.

  1. No trackbacks yet.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Google+ photo

You are commenting using your Google+ account. Log Out / Change )

Connecting to %s

Follow

Get every new post delivered to your Inbox.

%d bloggers like this: