Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SUREFIRE-2212] surefire-report-parser: parse element content only if needed to prevent OOME #687

Closed
wants to merge 2 commits into from

Conversation

bisswanger
Copy link

@bisswanger bisswanger commented Nov 15, 2023

While parsing large reports, we identified OutOfMemoryError being raised in the surefire-report-parser in case the test file contains large CDATA sections below system-err / system-out in the report.

While this is using a SAX parser, it parses all element contents into a string and that can cause issues for large test reports.
Proposed fix: only keep element contents in memory in case the value is used somewhere for the result.
i.e. system-err / system-out is just discarded.

Sample project which is demonstrating the issue by creating a huge dummy test report and then uses the parser:
https://github.com/bisswanger/surefire-parser-sample (see README for details)

Following this checklist to help us incorporate your
contribution quickly and easily:

  • Make sure there is a JIRA issue filed
    for the change (usually before you start working on it). Trivial changes like typos do not
    require a JIRA issue. Your pull request should address just this issue, without
    pulling in other changes.
  • Each commit in the pull request should have a meaningful subject line and body.
  • Format the pull request title like [SUREFIRE-XXX] - Fixes bug in ApproximateQuantiles,
    where you replace SUREFIRE-XXX with the appropriate JIRA issue. Best practice
    is to use the JIRA issue title in the pull request title and in the first line of the
    commit message.
  • Write a pull request description that is detailed enough to understand what the pull request does, how, and why.
  • Run mvn clean install to make sure basic checks pass. A more thorough check will
    be performed on your pull request automatically.
  • You have run the integration tests successfully (mvn -Prun-its clean install).

If your pull request is about ~20 lines of code you don't need to sign an
Individual Contributor License Agreement if you are unsure
please ask on the developers list.

To make clear that you license your contribution under
the Apache License Version 2.0, January 2004
you have to acknowledge this by using the following check-box.

@bisswanger bisswanger changed the title parse element content only if needed to prevent OOME surefire-report-parser: parse element content only if needed to prevent OOME Nov 15, 2023
@bisswanger bisswanger changed the title surefire-report-parser: parse element content only if needed to prevent OOME [SUREFIRE-2212] surefire-report-parser: parse element content only if needed to prevent OOME Nov 17, 2023
Copy link
Member

@michael-o michael-o left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This looks reasonable. Please turn your sample into a unit test and I will test and merge it.

@bisswanger
Copy link
Author

@michael-o : thanks for the feedback.
Added a corresponding unit test

@michael-o michael-o self-requested a review December 4, 2023 12:32
@michael-o
Copy link
Member

@michael-o : thanks for the feedback. Added a corresponding unit test

The test looks reasonable. Waiting for the tests to complete.

@michael-o michael-o closed this in 05322d9 Dec 4, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants