Loading…
Apache: Big Data 2016 has ended
Register Now or Visit the Website for more Information 
Tuesday, May 10 • 2:00pm - 2:50pm
Hive on ACID - Alan Gates, Hortonworks

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Apache Hive provides SQL access for data in Hadoop. Traditionally data in Hadoop is write once read many. But with traditional data
warehousing use cases moving to Hadoop there is a need to support transactional update and delete of records. Hive has recently implemented
ACID compliant row level insert, update, and delete as well as very low latency ingestion of streaming data from tools like Storm and Flume. This is done with snapshot isolation between queries. This talk will cover the intended use cases, architectural challenges of implementing updates and deletes in a write-once file system, and details of changes to the file storage formats and transaction management system.

Speakers
avatar for Alan Gates

Alan Gates

Co-founder and Architect, Hortonworks
Alan is a founder of Hortonworks and an original member of the engineering team that took Pig from a Yahoo! Labs research project to a successful Apache open source project. Alan has done extensive work in Hive, including adding ACID transactions. Alan has a BS in Mathematics from... Read More →


Tuesday May 10, 2016 2:00pm - 2:50pm PDT
Georgia A