Follow on Google News News By Tag Industry News News By Location Country(s) Industry News
Follow on Google News | ![]() Inquidia Consulting Releases Parquet Output Plugin for Pentaho Data IntegrationNew Component Helps Hadoop Users Easily Deploy Compressed Columnar Storage Format
By: Inquidia Consulting Parquet is a columnar file format used in Hadoop with built-in dictionary encoding, compression, and the ability to only read the columns of interest. The increasingly popular format was first released in 2013 and has gained traction with the Hadoop user base since its launch. This adoption has been driven by the massive compression and performance benefits of Parquet over a non-columnar file format such as text or Avro. However, despite this rise in adoption, it remained challenging to ingest data into Hadoop in the Parquet format without broad, simple-to-use Parquet support across all Hadoop frameworks. Inquidia Labs took on this challenge and developed the Parquet Output Plugin for Pentaho Data Integration. Chris Deptula, senior architect for the Inquidia Labs project, led development of the plugin which is currently available through the Pentaho PDI Marketplace and here through Github. “In almost all Hadoop deployments Inquidia is working on, Parquet is being used in some form. Our clients want to be able to reap the performance benefits of Parquet without the long list of challenges to actually implementing it,” said Deptula. “Inquidia’ Inquidia’s Parquet Output Plugin for Pentaho Data Integration is currently available for free in the Pentaho Marketplace, or available on github at https://github.com/ About Inquidia Consulting Inquidia is an innovative professional services firm delivering full spectrum data engineering and analytics services that help our customers inquire, learn and take action with their data. We are passionate about data. End
|
|