Gruter’s paper accepted at IEEE BigData 2015

  Gruter scientists are researching cutting-edge techniques on big data processing system, and a subset of our work has been accepted in the IEEE BigData 2015 Conference, titled "An Evaluation of Alternative Shared-nothing Architecture for Analytical Processing Systems". Dr. Hyunsik Choi, Gruter research director, is going to give his talk on the conference, held on Oct 29 ...

Broadcast Join in Tajo

Example of Repartition Join

(This post is originally published in https://jihoonson.wordpress.com/.) Join is one of the most expensive operations in relational world. Many researchers have been studied for efficient join processing. In distributed systems, there are two well-known join execution algorithms, i.e., repartition join and broadcast join. (There are many other join algorithms including recently introduced Track join, hyper shuffle join, and Tributary join, but ...

Gruter at K-Global London 2015

K-Global-London-2015_1

After the Hadoop Summit 2015, Gruter headed to K-GLOBAL@London, held June 16-18 at ExCel, London. K-Global@London is the government-led event to promote Korean ICT businesses in Great Britain, the fourth largest ICT market in the world. In this event, more than 20 Korean tech companies showcased their technologies for mobile solutions, IoT and cloud services. ...

Gruter at Hadoop Summit 2015

Hadoop-Summit-2015_1

Last June 9-11, the 8th Annual Hadoop Summit took place in the Convention Center, San Jose. Hadoop Summit is one of the biggest Hadoop conferences in the world, where cutting edge Hadoop technologies are introduced and shared. Many tech giants such as Yahoo!, Hortonworks, Microsoft, EMC, HP, SAP, Teradata, and so on converged in this ...

Gruter at AWS Summit Seoul 2015

AWS2015_Gruter_Tajo_1

Gruter team ran a booth showcasing Tajo on AWS products at AWS Summit Seoul 2015, one of the biggest cloud computing conference held in Seoul, South Korea, on Apr. 21. In the booth, attendees took an early look at the new version of Gruter TaaS (Tajo-as-a-Service), which enables AWS users to setup Tajo cluster within ...

Setting up an Apache Tajo Cluster on Amazon EMR

TajoEMR_image02

Note. Bootstrap action script for EMR 4.x was added. Check out the differences introduced in 4.x with release of EMR 4.0 at Jul 2015. Apache Tajo™, or simply “Tajo”, is an open-source relational and distributed big data warehouse (“Big DW”) system which runs on Apache Hadoop and other stores. Tajo is designed for low-latency and scalable ...

Apache Tajo™ 0.10.0 now available!

gruter_tajo_logo

The Apache Software Foundation announced the release of Apache Tajo v0.10 on Mar 9. The release heralds significant enhancements to the enterprise “SQL-on-Hadoop” big data warehouse solution, including performance improvements and wider ecosystem integration. "Tajo has evolved over the last couple of years into a mature 'SQL-on-Hadoop' engine," said Hyunsik Choi, Vice President of Apache Tajo and Gruter ...

Apache Tajo on Hadoopsphere.com

In a two-part article series entitled "Technical Deep Dive Into Apache Tajo", HadoopSphere.com conducts a Q&A with Dr. Hyunsik Choi, PMC Chair of Apache Tajo. In the first article of the series, Choi explains Tajo's design logic, including its unique distributed processing framework, pluggable storage manager, and advanced query optimization capabilities. see: http://www.hadoopsphere.com/2015/02/technical-deep-dive-into-apache-tajo.html

Gruter Joins Hortonworks Technology Partner Program

gruter_hortonworks_technology_partnership

PRESS RELEASE Integration with Hortonworks Data Platform Takes the Apache Tajo Big Data Warehouse Solution to the Global Enterprise Market Palo Alto, Jan. 20, 2015—Gruter, a big data company which builds next-generation data warehouse systems, today announced that it has joined the Hortonworks® Technology Partner Program. Gruter’s new partnership with Hortonworks, the leading distributor of Apache™ Hadoop®, ...