Category Archives: Technology

Apache Tajo™ 0.11.3 Released

The Apache Tajo community is pleased to announce that version 0.11.3 has been released and is able to download. It is a minor release. According to Hyunsik Choi, Vice President of Apache Tajo and Gruter Research Director, Tajo 0.11.3 includes the resolution of 5 major issues and temporarily disabled the ‘NOT IN’ predicate.

Apache Tajo™ 0.11.2 released

The Apache Tajo community is pleased to announce that version 0.11.2 has been released and is able to download. It is a minor release. According to Hyunsik Choi, Vice President of Apache Tajo and Gruter Research Director, Tajo 0.11.2 includes the resolution of 30 issues including minor features, bug fixes and performance improvements.

Gruter Enterprise Tajo is now available on AWS marketplace

gruter_cloumonELK

We are happy to announce that Gruter Enterprise Tajo has been released on AWS marketplace. It is the new, simple and cost-efficient way to deploy Tajo on AWS. Gruter Enterprise Tajo (G.E.T) is a pre-configured Tajo AMI (Amazon Machine Image) packaged by Gruter. G.E.T helps AWS users to deploy their Tajo cluster on Amazon EC2 within ...

CloumonELK

gruter_cloumonELK

A big data platform monitoring tool based on ELK stack We are proud to announce that CloumonELK is now open source! CloumonELK is a handy tool for monitoring various big data technologies, including Apache Tajo, Hadoop, HBase, ZooKeeper, Flume and ElasticSearch. Based on the popular ELK (ElasticSearch, Logstash and Kibana) stack and with pre-defined configurations and built-in ...

Apache Tajo™ 0.11.1 released

The Apache Software Foundation announced the release of Apache Tajo v0.11.1 on Jan. 3 .This is a minor release. According to Hyunsik Choi, Vice President of Apache Tajo and Gruter Research Director, Tajo 0.11.1 sees the resolution of 30 issues including minor features, bug fixes and performance improvements. * http://tajo.apache.org/releases/0.11.1/announcement.html

Gruter at K-Global China 2015

Gruter_ApacheCon_BigData_2015_jihoonSon

K-Global@China took place in Shanghai on Dec. 15-16. 2015. It was the government-led exhibition to promote Korean ICT business such as Cloud, Big Data, IoT and Smart City Infrastructure Platforms. In the China-Korea Cloud Business Meeting with the leaders of the 15 companies, Gruter Data Analyst, Youngkyong Ko presented the Apache Tajo. The event was a good opportunity ...

Apache Tajo™ 0.11.0 released!

The Apache Software Foundation announced the release of Apache Tajo v0.11.0 on Oct 28. The new release heralds significant enhancements for easier data integration and interoperability with other hadoop ecosystems. "Tajo 0.11.0 represents a very important milestone. It introduced critical features and functions that let us build out a modern data warehouse system," said Hyunsik Choi, Vice ...

Tajo at ApacheCon – Big Data Europe 2015

Gruter_ApacheCon_BigData_2015_jihoonSon

Gruter senior developer and Apache Tajo co-founder, Dr. Jihoon Son, was a presenter at Apache:Big Data Europe, held in Budapest, Hungary on September 30. Son's challenging session looked at the new features in the coming release Tajo 0.11, including query federation, JDBC-based storage support and self-describing data format support among others. With the coming release scheduled ...

Gruter’s paper accepted at IEEE BigData 2015

  Gruter scientists are researching cutting-edge techniques on big data processing system, and a subset of our work has been accepted in the IEEE BigData 2015 Conference, titled "An Evaluation of Alternative Shared-nothing Architecture for Analytical Processing Systems". Dr. Hyunsik Choi, Gruter research director, is going to give his talk on the conference, held on Oct 29 ...

Broadcast Join in Tajo

Example of Repartition Join

(This post is originally published in https://jihoonson.wordpress.com/.) Join is one of the most expensive operations in relational world. Many researchers have been studied for efficient join processing. In distributed systems, there are two well-known join execution algorithms, i.e., repartition join and broadcast join. (There are many other join algorithms including recently introduced Track join, hyper shuffle join, and Tributary join, but ...