Skip to content

amplab/shark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

e745177 · Apr 23, 2014
Apr 23, 2014
Apr 9, 2014
May 20, 2013
Apr 3, 2014
Apr 23, 2014
Apr 9, 2014
Apr 23, 2014
Apr 9, 2014
Dec 6, 2012
Feb 16, 2014
Apr 9, 2014
Oct 21, 2013

Repository files navigation

Shark (Hive on Spark)

Shark is a large-scale data warehouse system for Spark designed to be compatible with Apache Hive. It can answer Hive QL queries up to 100 times faster than Hive without modification to either the existing data or queries. Shark supports Hive's query language, metastore, serialization formats, and user-defined functions.

Shark 0.9.0 requires:

  • Scala 2.10.3
  • AMPLab's Hive 0.11
  • Spark 0.9.x

For current documentation, see the Shark Project Wiki