Big data startup Databricks is now certifying applications for Spark

Spark was created as a processing framework for Hadoop that’s both faster and easier to use than the traditional MapReduce framework, and it’s catching on fast among folks writing big data applications.

Gigaom

Databricks , a new startup dedicated to commercializing the Apache Spark data-processing framework , has launched a « Certified on Spark » program for software vendors that want to tout their abilities to run on the increasingly popular technology. Spark was created as a processing framework for Hadoop that’s both faster and easier to use than the traditional MapReduce framework, and it’s catching on fast among folks writing big data applications.

Spark’s popularity is based on a few factors, including that it supports numerous programming languages (all of which are easier to write in than MapReduce) and supports faster data analysis both in-memory and on disk. It also allows for iterative queries on existing datasets, which — along with its speed — makes it more ideal for machine learning workloads. There are a number of workload-specific implementations on top of Spark, too, including Shark for interactive SQL queries, SparkR for statistical…

View original post 190 mots de plus

Publicités

Poster un commentaire

Classé dans DATA, HADOOP

Laisser un commentaire

Entrez vos coordonnées ci-dessous ou cliquez sur une icône pour vous connecter:

Logo WordPress.com

Vous commentez à l'aide de votre compte WordPress.com. Déconnexion / Changer )

Image Twitter

Vous commentez à l'aide de votre compte Twitter. Déconnexion / Changer )

Photo Facebook

Vous commentez à l'aide de votre compte Facebook. Déconnexion / Changer )

Photo Google+

Vous commentez à l'aide de votre compte Google+. Déconnexion / Changer )

Connexion à %s