Skip navigation.

Tugdual Grall

Syndicate content
A blog of technologies I am working on and interested in... MongoDB, Web, Java, Node, Mac and more...Tugdual Grall
Updated: 41 min 11 sec ago

Convert CSV file to Apache Parquet... with Drill

Tue, 2015-08-18 08:44
Read this article on my new blog A very common use case when working with Hadoop is to store and query simple files (CSV, TSV, ...); then to get better performance and efficient storage convert these files into more efficient format, for example Apache Parquet. Apache Parquet is a columnar storage format available to any project in the Hadoop ecosystem. Apache Parquet has the following Tugdual Grall

Apache Drill : How to Create a New Function?

Tue, 2015-07-21 11:04
Read this article on my new blog Apache Drill allows users to explore any type of data using ANSI SQL. This is great, but Drill goes even further than that and allows you to create custom functions to extend the query engine. These custom functions have all the performance of any of the Drill primitive operations, but allowing that performance makes writing these functions a little trickier Tugdual Grall