Archive

Archive for the ‘hadoop’ Category

Day 2: Big Data Innovation Summit 2014 #DataWest14

April 11, 2014 2 comments

.

Levi's StadiumHello again big data fans – from where I’ve learned the San Francisco 49’ers will be playing their 2014 NFL season at Levi’s Stadium… Santa Clara!

(BTW, the stadium – from what I could see – is beautiful!  I’m a big NFL fan, and there’s now another reason to come to the San Jose area, other than all the cloud / big data conferences.)

Got a lot of great feedback on yesterday’s “Day 1” post of the summit, so here are some observations from the final day of the conference.

  • Yahoo’s Duru Ahanotu spoke through driving efficiency in how data teams are organized, going through the permutations of generalists vs specialists and centralized vs de-centralized, and how to best address teams in each model.
    .
  • PayPal’s Moises Nascimento (who is a very captivating speaker) drove the point home, that though we are now adopting many of the new data technologies like Hadoop and NoSQL, most of our existing data sources and toolsets still provide value – so there is value in leveraging ALL data sources.
    .
  • Moises also made a point of highlighting that data manipulation is best handled at the SYSTEM level, while data analysis is better managed at the ENTERPRISE level
    .
  • In HP’s discussion, they introduced the concept of the GEOBYTE – 10^30 bytes, a size of data that the human race is expected to hit in the next few years.

To provide context on the magnitude of a GEOBYTE (10^30 bytes), there is estimated to only be 10^19 GRAINS OF SAND ON THE EARTH.  Think about that for a second.

  • The team also highlighted their view on “Big BI” vs “Big Data”
    • Big BI – same types of analysis but on more data; more batch processing; results that were not easily actionable
    • Big Data – joining datasets that have not been previously joined, near real time analysis, action oriented results
      .
  • I thought Ancestry.com had one of the best sessions of the event, as they went deep into the GERMLINE algorithm that was the foundation of their business technology, and how they had to create jermline (now with a “j”) based on Hadoop / HDFS to create a SCALABLE matching engine.  As we all know, SCALE matters. The performance and speed benchmarks between the “G” project and the “j” project were mindblowing.
    .
  • Finally, sat in on the Netflix session – in addition to being a big fan of Netflix, as both a consumer and a tech observer, I’ve always been impressed with the way Netflix has evolved their business, and continues to do so.  In this session, they went into great detail on their use of the Amazon cloud services, and their open source projects as a layer above to enhance functionality and deploy features.  Topics touched on included red / black deployment to allow ease of features into production, and the importance of graceful degradation, so that a failure can be less of a catastrophic event for the end user.
    .

    • One very telling statement is really a commentary on the value of use and participation in the open source process – Netflix was clear that they see value in being an open source contributor / leader is that it preserves the future of their systems – rather than sitting back and letting the industry decide their direction with tools and tech, Netflix uses open source to help drive and lead the industry to where they see value.
      .
  • (I did resist the urge to ask the Netflix presenter when the next season of “House of Cards” would come out. 🙂 )
    .

One of the frequent questions that came up at the Dell booth was “what is Dell doing in big data?”

The answer?  Actually… quite a bit, and for quite a while.

Between the Dell Apache Hadoop HW+SW+Services Solution, the Toad BI suite, the Kitenga analytics toolsets, and our growing HPC business, Dell has been a part of this movement since its early days.  I’d recommend you drop us a line at Hadoop@Dell.com or visit us at http://www.Dell.com/Hadoop to learn more.

If you were out at the show this week, be sure to leave a comment on your thoughts as well.

Hope everyone has safe trips home, and we’ll see you at the next big data get-together!

Until next time,

JBG
@jbgeorge
BDIS 2014

Advertisements

Day 1: Big Data Innovation Summit 2014

April 10, 2014 Leave a comment

.

Hello from sunny, Santa Clara!BDIS Keynote Day 1

My team and I are here at the BIG DATA INNOVATION SUMMIT representing Dell (the company I work for), and it’s been a great day one.

I just wanted to take a few minutes to jot down some interesting ideas I heard today:

  • In Daniel Austin’s keynote, he addressed that the “Internet of things” should really be the “individual network of things” – highlighting that the number of devices, their connectivity, their availability, and their partitioning is what will be key in the future.
    .
  • One data point that also came out of Daniel’s talk – every person is predicted to generate 20 PETABYTES of data over the course of a lifetime!
    .
  • Juan Lavista of Bing hit on a number of key myths around big data:
    • the most important part of big data is its size
    • to do big data, all you need is Hadoop
    • with big data, theory is no longer needed
    • data scientists are always right 🙂

QUOTE OF THE DAY:  “Correlation does not yield causation.” – Juan Lavista (Bing)

  • Anthony Scriffignano was quick to admonish the audience that “it’s not just about data, it’s not just about the math…  [data] relationships matter.”
    .
  • The state of Utah state government is taking a very progressive view to areas that analytics can help drive efficiency in at that level – census data use, welfare system fraud, etc.  And it appears Utah is taking a leadership position in doing so.

I also had the privilege of moderating a panel on the topic of the convergence between HPC and the big data spaces, with representatives on the panel from Dell (Armando Acosta), Intel (Brent Gorda), and the Texas Advanced Computing Center (Niall Gaffney).  Some great discussion about the connections between the two, plus tech talk on the Lustre plug-in and the SLURM resource management project.

Additionally, Dell product strategists Sanjeet Singh and Joey Jablonski presented on a number of real user implementations of big data and analytics technologies – from university student retention projects to building a true centralized, enterprise data hub.  Extremely informative.

All in all, a great day one!

If you’re out here, stop by and visit us at the Dell booth.  We’ll be showcasing our hadoop and big data solutions, as well as some of the analytics capabilities we offer.

(We’ll also be giving away a Dell tablet on Thursday at 1:30, so be sure to get entered into the drawing early.)

Stay tuned, and I’ll drop another update tomorrow.

Until next time,

JOSEPH
@jbgeorge

NOW HIRING: Cloud and Big Data Solution Marketing Rockstars

June 13, 2013 1 comment

.

As Dell (the company that I work for) continues to service customers in all facets of OpenStack and Hadoop implementations, we are beginning another season of growth on the Revolutionary Solutions team.

The Dell Revolutionary Solutions Team delivers the Dell OpenStack-Powered Cloud Solution and the Dell Apache Hadoop Solution, leads the Crowbar open source project, and manages the Emerging Solutions Ecosystem Partner Program that includes a number of key partners such as Suse, Inktank, Cloudera, Datameer, and Pentaho.

   

We are looking for a variety of engineers and presales teams, but I will focus on the product management and marketing roles in this post.

  • Now Hiring!Technical Product Managers – Product managers to be technical SMEs (roadmaps, requirements, etc) on partner products in the cloud and big data spaces, most notably OpenStack and Hadoop, but could also be focused on other emerging solutions spaces – Link to Job Posting
       
  • Product Marketing Managers – Marketing experts to own and lead go-to-market strategy and deliverables in the cloud and big data spaces (marketing strategy, sales enablement, etc).  Again, this would certainly cover our OpenStack and Hadoop solutions today, but could also focus on future emerging solutions spaces.  – Link to Job Posting
      
  • Open Source Community Manager / Evangelist – Community oriented professionals with strong networks, strong social media presence, and an ability to bring collaborators and customers together to work on common goals – Link to Job Posting
      
  • Marketing Directors – Experienced people managers to drive business objectives, product vision, and go-to-market strategy, specifically in the areas of Product Management and Product Marketing – Link to Job Posting

  

In our experience, the best candidates

  • have a track record of ownership
  • have a techincal background
  • are experienced in their discipline
  • are participants in cloud, big data, virtualization, or similar emerging technologies

  

Pass it on to a friend or apply yourself – I look forward to hearing from you!

Until next time,

JBGeorge
@jbgeorge

ONE MILLION: Dell Data Center Solutions (DCS) Ships Millionth Server

November 7, 2012 Leave a comment

.

DCS Ships Millionth ServersThe team at Dell (the company I work for) celebrated a huge milestone last week…

The Data Center Solutions group (aka DCS) officially shipped their millionth cloud server!

As the director of DCS marketing, Drew Schulke, pointed out, DCS’s influence is all around you – “If you used a search engine, updated your status via social media, or generated a map on the internet to get from ‘point A’ to ‘point B’ today, there is a good chance that a DCS server made that possible.”  (You can read Drew’s entire blog here.)

As many of you know, I lead a team of product managers and strategists on the Dell Revolutionary Cloud and Big Data team, who focus on areas such as OpenStack and Hadoop, who deliver the Dell Crowbar software framework, and lead the Crowbar open source community.

So what does DCS have to do with our OpenStack / Hadoop / Crowbar team? 

Actually quite a bit.

DCS set out to meet the needs of the hyperscale customer, and as it turns out, most of those customers are looking at cutting edge technology trends like OpenStack, Hadoop, and open source toolsets.

In fact, our solutions feature PowerEdge C servers, the density optimized server line that was born out of the work done by DCS, and are being run by our solutions customers on a regular basis as they stand up cloud and big data environments.

So by pulling together integrated HW+SW+Services solutions around OpenStack and Hadoop, leveraging innovative Dell IP like Crowbar and the unique advancements coming out of the DCS group, we’re able to make our customers even more successful – which is what its all about.

Win-win-win!

It’s a great milestone for the innovative DCS team – join me in celebrating this momentus milestone!

Until next time,

JOSEPH
@jbgeorge

More info:

NOW HIRING: Dell’s Revolutionary Cloud and Big Data Team Expands

November 5, 2012 Leave a comment

.

Now Hiring!We’re growing!

The Revolutionary Cloud and Big Data Team at Dell (the company I work for) is looking to expand our team of rockstars, so we’re putting the word out. Specifically we’re looking for architects, engineers, developers, and I’m looking to hire a few more senior product managers to join my team of subject matter experts.

Just for context, we’re the team that has taken to market the Dell OpenStack-Powered Cloud Solution, the Dell Apache Hadoop Solution, and the Dell Crowbar software framework and open source project.

And if you’re a rockstar in any of those spaces, we’d like to talk to you.

SPOILER ALERT – If you’re interested in talking to us about a technical spot on our team, you can email us your info and resume at OpenStack@Dell.com or Hadoop@Dell.com.


What is this team about?

www.Dell.com/OpenStackA few years ago, the Dell Data Center Solutions team came into being with a mission of servicing the biggest hyperscale environments in the world, which included many of the market’s top cloud providers. It has succeeded in its mission in dominating the density optimized space (check out more on that here), and in fact, just shipped it’s ONE MILLIONTH SERVER.

An extension of DCS’s mission soon became clear – as many customers were looking to accelerate into spaces like cloud and big data, providing them integrated solutions would ease their implementation of these technologies. And so our Revolutionary Cloud and Big Data Solutions team was born – to deliver integrated solutions based on cutting edge technologies like OpenStack and Hadoop (and more), as well as innovative Dell projects like Crowbar, in an effort to enable customers to grow and thrive in their businesses with our products, innovation, and expertise.


Who are we?

The team at Dell is made up of a number of people, like myself, that you’d recognize from OpenStack and Hadoop circles – folks like Rob Hirschfeld, Greg Althaus, Kamesh Pemmaraju, and others. We all come from a variety of backgrounds – some from big companies in the technology spaces and many from startups – we happen to have quite a few entreprenuers on our team! And we try to service our customers in the best way possible – agile development processes, open source friendly, community oriented, etc.


What are we trying to do?Austin Meetups

Our mission is to develop and deliver HW+SW+Services solutions to market that will enable our customers to be successful. Clear and simple.

Here’s a sampling of what our team has done over the course of our existence:

In addition, we’re big believers in the community – we regularly hold hackfests to help move these communities forward, lead community meetups in Austin and Boston working with other key vendors that co-sponsor with us (you may be surprised), are regularly active in IRC, skype discussions, conference breakout sessions, and more.

It’s a fast-paced, customer focused, ever evolving group and its a great place to deliver tanglible, difference making solutions to customers.

It’s not for the faint of heart, but it’s DEFINITELY for the mover and shaker.


OpenStack@Dell.comWho we want to hear from

We’re looking to expand in a number of areas, but specifically we’re looking for technical talent

  • Developers / QA
  • Technical Product Managers and Strategists
  • Architects and Technical Leads

If I’ve piqued your interest, drop me a note and your resume at OpenStack@Dell.com.

Look forward to hearing from cloud / big data / open source rockstars.

Until next time,

JOSEPH
@jbgeorge

HADOOP WEBINAR: “New Business Insights with Hadoop Analytics”

November 1, 2012 2 comments

.

 Hadoop World last week was a blast, so hopefully, you’re still on a Hadoop high and checking out all the new ideas coming from Dell (the company I work for), and others in the space.

DellAnd to keep the good times going, Dell is joining forces with our partner, Datameer, to host a webinar deep diving into Hadoop analytics.

Jeff Stacey, Dell’s Senior Product Manager of Big Data Solution (which includes our Dell Apache Hadoop Solution), will be co-hosting this webinar with Datameer as they dig into real-world examples and use cases of how companies are taking advantage of hardware and software advances to analyze data with Hadoop. They’ll take a look at numerous data sources that are being leveraged, and how this wealth of data is already providing critical new insights in industries ranging from financial services to new media.

  
Here’s all you need to know:

  • Date: Wed, Nov 7, 2012
  • Time: 10:00 AM Pacific / 1:00 PM Eastern
  • What: Dell + Datameer webainar – “New Business Insights with Hadoop Analytics”
  • Register here: LINK

  
DatameerCome check out how analytic use cases spanning marketing, internet security, asset risk management, product usage and IT infrastructure are already driving competitive advantages and operational efficiencies.

If you’d like to learn more about how Dell is making our customers successful with Hadoop via the Dell Apache Hadoop Solution, visit www.Dell.com/Hadoop or drop us a line at Hadoop@Dell.com.

See you at the webinar!

Until next time

 JOSEPH
@jbgeorge

Highlights from the 2012 Hadoop World

October 29, 2012 Leave a comment

Strata Hadoop World

.

Had a great time at last week’s Hadoop World, so wanted to write up a few of my thoughts from the event.

  • This year’s Hadoop World was the best attended to date – I believe I heard the attendee number to be at 2500 vs 1400 last year!  It’s great to see this kind of growth among the community considering there were only 500 attendees just four years ago.
      
  • In some similarities to what I’m seeing in the OpenStack community, this conference seemed to boast more from the “user” ranks as opposed to just developers as in the recent past.  It speaks volumes to the general adoption that Hadoop is seeing in the market.
      
  • Dell, the company I work for, and our Ecosystem Partner Datameer hosted a networking event for a number of folks at Hadoop World at the prestigious Circo NYC restaurant – great food and a great time with some innovative Hadoop implementers.  Got to really get indepth how real people are implementing Hadoop in their enviornments today.  Appreciate those that took the time out to attend, and for those who missed out, see you next time!
      
  • Cloudera announced their beta project called “Impala”, which allows users to perform real-time queries of their data, a feature that a number of Hadoop users have been anticipating.  According to Cloudera, Impala can process queries up to 30 times faster than Hive / MapReduce – very cool, and I look forward to checking it out.
      
  • Finally, Dell made an announcement about our donation of “Zinc”, an ARM-based server concept to the Apache Software Foundation, with support from our partner, Calxeda, where we see ARM infrastructures as an interesting technology for Hadoop environments.  The donation includes hosting and technical support for the Apache community. and we’re hosting the server concept at an Austin-based co-location.  The Apache Hadoop project has actually performed more than a dozen builds within the first 24 hours of the servers’ deployment.   (You can check out the full press release here to learn more.)
      

All in all, Hadoop World is another hit!  It was a great event overall and I look forward to next year’s conference.

To learn more about the Dell Apache Hadoop Solution and more about what Dell is doing in this space, visit us at www.Dell.com/Hadoop

And if you want to chat about how Dell can help you with your Hadoop initiative, drop me an email at Hadoop@Dell.com.

Until next time,

JOSEPH
@jbgeorge