Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix segfilecount of AO/AOCO when bulk insertion: COPY #530

Merged
merged 1 commit into from
Jul 23, 2024

Conversation

avamingli
Copy link
Contributor

Fix #529.

For COPY FROM on AO/AOCO tables, we need to try switch physical seg files on the fly during bulk insertion. Else, only one insertDesc will be used and the GUC around segfilecount does't take effect.
That's important for parallel plan.

For AO specially, bulk insertion is optimized to reuse var block if possible, leads to additional check.

Also fix memory leak for used_segment_files which is allocated at enter_dml_state() and shoule be preed when dml finished.

Authored-by: Zhang Mingli [email protected]

fix #ISSUE_Number


Change logs

Describe your change clearly, including what problem is being solved or what feature is being added.

If it has some breaking backward or forward compatibility, please clary.

Why are the changes needed?

Describe why the changes are necessary.

Does this PR introduce any user-facing change?

If yes, please clarify the previous behavior and the change this PR proposes.

How was this patch tested?

Please detail how the changes were tested, including manual tests and any relevant unit or integration tests.

Contributor's Checklist

Here are some reminders and checklists before/when submitting your pull request, please check them:

  • Make sure your Pull Request has a clear title and commit message. You can take git-commit template as a reference.
  • Sign the Contributor License Agreement as prompted for your first-time contribution(One-time setup).
  • Learn the coding contribution guide, including our code conventions, workflow and more.
  • List your communication in the GitHub Issues or Discussions (if has or needed).
  • Document changes.
  • Add tests for the change
  • Pass make installcheck
  • Pass make -C src/test installcheck-cbdb-parallel
  • Feel free to request cloudberrydb/dev team for review and approval when your PR is ready🥳

@avamingli avamingli requested a review from my-ship-it July 22, 2024 10:57
Fix apache#529.

For COPY FROM on AO/AOCO tables, we need to try switch
physical seg files on the fly during bulk insertion.
Else, only one insertDesc will be used and the GUC around
segfilecount does't take effect.
That's important for parallel plan.

For AO specially, bulk insertion is optimized to reuse var
block if possible, leads to additional check.

Also fix memory leak for used_segment_files which is allocated
at enter_dml_state() and shoule be preed when dml finished.

Authored-by: Zhang Mingli [email protected]
@avamingli
Copy link
Contributor Author

avamingli commented Jul 22, 2024

external_fts pipeline config

gp_appendonly_insert_files=0

which is not expected, remove that.

@avamingli avamingli merged commit 60790f0 into apache:main Jul 23, 2024
11 checks passed
@avamingli avamingli deleted the fix_ao_copy branch July 23, 2024 06:33
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Bug] AO tables didn't switch seg file when bulk insertion: COPY
2 participants