Cloudera CEO Tom Reilly doesn’t often mince words when it comes to describing to his competition in the Hadoop space, or Cloudera’s position among those other companies. In October 2013, Reilly told me he didn’t consider Hortonworks or MapR to be Cloudera’s real competition, but rather larger […]
Cloudera CEO declares victory over big data competition originally published by Gigaom, © copyright 2015.
The success of a company is so often related to how well they understand their customers. With today’s growing fascination with big data, users’ opinions are sometimes ignored.
Titles can be misleading. For example, the O’Reilly Strata + Hadoop World conference took place in San Jose, California, this week but Hadoop wasn’t the star of the show. Based on the news I saw coming out of the event, it’s another Apache project — Spark […]
I have written quite a bit about GDELT (the Global Database of Events, Languages and Tone) over the past year, because I think it’s a great example of the type of ambitious project only made possible by the advent of cloud computing and big data systems. In a […]
A massive database now translates news in 65 languages in real time originally published by Gigaom, © copyright 2015.
Rather than continuing to rely on traditional extract, transform, and load (ETL) tools to get data into Big Data applications running on Hadoop, Paxata at the Strata + Hadoop 2015 conference today unfurled an update to its software that lets organizations use a REST API to load data.
Continuing its quest to make Microsoft Azure comfy for the non-Windows world, Microsoft just launched a preview of its Hadoop-based cloud tool (HDInsight) that runs on Linux. It’s also making its Azure ML machine learning service widely available now with new support for Python as well as the already-planned support for […]
Microsoft embraces Python, Linux in new big data tools originally published by Gigaom, © copyright 2015.
Machine learning company H2O has released a new Python API for its Sparkling Water application. Sparkling Water extends H2O’s core product, a predictive analytics platform, to Apache Spark, an open-source in-memory big data processing platform that is growing in popularity.
Pivotal, the cloud computing and big data company that spun out from EMC and VMware in 2013, is open sourcing its entire portfolio of big data technologies and is teaming up with Hortonworks, IBM, GE, and several other companies on a Hadoop effort called the Open […]
Pivotal open sources its Hadoop and Greenplum tech, and then some originally published by Gigaom, © copyright 2015.
First, we love this report on the Linux Foundation’s assessment of how open source projects such as OpenStack, Cloud Foundry and Docker are driving both innovation and enterprise readiness in cloud technology.Apache Hadoop® is another excellent example of how wide adoption of projects dramatically shifts the market, as we see in this article on Deutsche Bank’s latest Hadoop study (big data OSS). OSS has both become a wonderful way for companies to collaborate on technology and also create high growth business such as the record breaking business performance with Pivotal Cloud Foundry. Two other related and noteworthy items include independent analyst Steve Chambers’ highlights on Cloud Foundry’s impressive first year, retracting a previous “bearish” attitude, and Matt Asay’s analysis asking if Cloud Foundry will be the next Red Hat.In our own experience, we have seen customers shift their buying, preferring OSS-based solutions as much as possible. Tesora’s shift to OSS within the OpenStack ecosystem is a great example of this.Individual contributors are the lifeblood of OSS, but you don’t have to be a developer checking in code to contribute. Here are 8 ways you can contribute to open source projects without writing code.Sometimes, OSS projects may seem to run in their own silos or ecosystem niches. Part of the power of OSS comes when contributors help increase the “innovation surface area” by bridging and connecting technologies. Several excellent examples can be seen here in multiple Spring and CF projects bridging PaaS, cloud, Apache Hadoop®, and MySQL:
Finally, in an effort to better align investment with the primary challenges Pivotal is trying to solve, Pivotal is looking for new sponsors for Groovy and Grails.
First, Gigaom suggests all developers need to become familiar with big data technologies and use cases since soon every business application will likely incorporate some big data functionality.For example, big data is making its way into digital travel services—Expedia plans to “double the size” of their Apache Hadoop® cluster in 2015 to help solve its big data challenges in the UK, having previously only used DB2 and Microsoft SQL databases.Not convinced yet? Here is SaaS visionary Mark Benioff and two separate executive research surveys saying big data and predictive analytics are top priorities and that CEOs desire big data solutions: 1) PwC CEO Survey Recap: Mobile, Data Mining, and Analysis most important 2) IDG Enterprise Big Data Research. Expect funding for future projects and all the market requirements you are building towards to reflect such priorities.Cloud Foundry is useful for big data and analytical applications as this blog about Cloud Foundry for Data Scientists reveals, and in how Pivotal built a Super Bowl social sentiment analysis application in less than a day on Cloud Foundry using microservices.Editor’s Note: Apache, Apache Hadoop, Hadoop, and the yellow elephant logo are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries.