Tag Archives: Apache Tajo

Integrating Tajo with Hive

Tajo algebra parsing process

This piece is based on a post by Jae-hwa Jung, Apache Tajo committer and Gruter senior developer, originally published on his Korean language blog. Apache Tajo, a big data warehouse on Hadoop, uses its own SQL processing engine for query processing, unlike Hive which uses MapReduce. It supports both short-term queries which can range from seconds ...

Apache Tajo at the Bay Area HUG

gruter_tajo_meeup

On Nov. 5, the Bay Area Hadoop User Group (HUG) held a special meetup at the LinkedIn headquarters in Mountain View, CA. The event showcased two new Apache Incubator projects: Tajo and Samza. Dr. Hyun-sik Choi introducing Apache Tajo In the session "Apache Tajo: A Big Data Warehouse on Hadoop”, Dr. Hyun-sik Choi—Research Director at Gruter and ...

SQL-on-Hadoop: What does “100 times faster than Hive” actually mean?

This piece was originally posted by Hyeong-jun Kim, CTO and Chief Architect at Gruter, on his Korean language blog. SQL-on-Hadoop technologies have been drawing considerable attention in the big data analytics area of late. These solutions enable developers—that is, users and data analysts—to process big data stored on Hadoop via a familiar SQL interface rather than ...