Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add chunked Parquet reader #28

Closed
morazow opened this issue Sep 15, 2021 · 0 comments · Fixed by #29
Closed

Add chunked Parquet reader #28

morazow opened this issue Sep 15, 2021 · 0 comments · Fixed by #29
Labels
feature Product feature

Comments

@morazow
Copy link
Contributor

morazow commented Sep 15, 2021

Situation

Parquet files are organized into row groups. These chunks of a single parquet file then can be read instead of reading the whole file. The PIOJ should support reading specified row groups.

Acceptance Criteria

  • Chunked Parquet reader is implemented
  • In addition, Parquet file virtual splitter is provided
  • Tests are added
@morazow morazow added feature Product feature source:exasol labels Sep 15, 2021
morazow added a commit that referenced this issue Sep 20, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature Product feature
Projects
None yet
Development

Successfully merging a pull request may close this issue.

1 participant