GraphRAG performance enhacements #924

rbrugaro · 2024-11-20T19:44:06Z

Issue: When property graph store gets filled (~12K nodes, 15K relationships) insertion time in dataprep gets slow.
Extraction + insertion starts at ~30 sec and once it gets filled grows to (~12K nodes, 15K relationships) ~800 sec
Perf bottleneck this cypher call in llama-index to do node upsert:
https://github.com/run-llama/llama_index/blob/795bebc2bad31db51b854a5c062bedca42397630/llama-index-integrations/graph_stores/llama-index-graph-stores-neo4j/llama_index/graph_stores/neo4j/neo4j_property_graph.py#L334

Performance optimizations in this PR:

Move neo4j GraphStore initialization out of detaprep and retrieve function so it's only performed once at the begining
Disable schema_refresh of neo4j graph when not necessary because for large graph this is very slow.
Switch to OpenAILike class from llama-index to work with vllm or tgi endpoints without code changes (only docker compose.yaml changes)
Added concurrency and batching for generating community summaries and generating answers from summaries

Signed-off-by: Rita Brugarolas <[email protected]>

for more information, see https://pre-commit.ci

…o directly run build communities Signed-off-by: Rita Brugarolas <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Rita Brugarolas <[email protected]>

for more information, see https://pre-commit.ci

Signed-off-by: Rita Brugarolas <[email protected]>

Signed-off-by: rbrygaro <[email protected]>

for more information, see https://pre-commit.ci

eero-t · 2024-12-18T09:57:56Z

There are lot of dataprep backends and neoj4/llama is not the default one used in docker compose files and Helm charts.

Do the ones used by default have also similar bottleneck?

rbrugaro · 2025-01-07T04:23:28Z

There are lot of dataprep backends and neoj4/llama is not the default one used in docker compose files and Helm charts.

Do the ones used by default have also similar bottleneck?

@eero-t thanks for commenting. Althought there are other dataprep backends functionality is very different from this microservice. This dataprep models the microsoft graphRAG (https://github.com/microsoft/graphrag) dataprep backend which performs entity/relationship extraction (using LLM), builds a graph, clusters the graph nodes to generate communities, generates community summaries and those are later retrieved by retriever to answer query by generating a final answer from partial answers.

For "toy datasets" these bottlenecks are negligible but for large dataset slow down is significant.

Signed-off-by: rbrygaro <[email protected]>

for more information, see https://pre-commit.ci

ashahba

Thanks @rbrugaro
My comments and change requests are mostly generic and feel free to challenge me 😄

comps/dataprep/neo4j/llama_index/extract_graph_neo4j.py

comps/retrievers/neo4j/llama_index/retriever_community_answers_neo4j.py

tests/retrievers/test_retrievers_neo4j_llama_index_on_intel_hpu.sh

joshuayao · 2025-01-15T02:33:38Z

Hi @rbrugaro, could you please resolve the comments? Thanks.

comps/dataprep/neo4j/llama_index/config.py

xiguiw · 2025-01-15T03:52:58Z

Issue: When property graph store gets filled (~12K nodes, 15K relationships) insertion time in dataprep gets slow. Extraction + insertion starts at ~30 sec and once it gets filled grows to (~12K nodes, 15K relationships) ~800 sec Perf bottleneck this cypher call in llama-index to do node upsert: https://github.com/run-llama/llama_index/blob/795bebc2bad31db51b854a5c062bedca42397630/llama-index-integrations/graph_stores/llama-index-graph-stores-neo4j/llama_index/graph_stores/neo4j/neo4j_property_graph.py#L334

WIP solution:

mode initialization out of detaprep and retrieve function so only performed once

...

Do you mean the mode initialization of TGI/TEI service? If yes, I suppose the LLM model initialization are in TGI/TEI service, so model initialization should be performed only once. Both dataprep and retrieval only refer to the service instead of creating a local one.
The dependency should be in document and docker-compose.yml file.

Does GraphRAG enabled in ChatQnA? the dataprep (GraphRAG) share the same TGI/TEI service (docker container) with ChatQnA LLM?

comps/dataprep/neo4j/llama_index/config.py

rbrugaro · 2025-01-15T04:47:42Z

Issue: When property graph store gets filled (~12K nodes, 15K relationships) insertion time in dataprep gets slow. Extraction + insertion starts at ~30 sec and once it gets filled grows to (~12K nodes, 15K relationships) ~800 sec Perf bottleneck this cypher call in llama-index to do node upsert: https://github.com/run-llama/llama_index/blob/795bebc2bad31db51b854a5c062bedca42397630/llama-index-integrations/graph_stores/llama-index-graph-stores-neo4j/llama_index/graph_stores/neo4j/neo4j_property_graph.py#L334
WIP solution:

mode initialization out of detaprep and retrieve function so only performed once

...

Do you mean the mode initialization of TGI/TEI service? If yes, I suppose the LLM model initialization are in TGI/TEI service, so model initialization should be performed only once. Both dataprep and retrieval only refer to the service instead of creating a local one. The dependency should be in document and docker-compose.yml file.

Does GraphRAG enabled in ChatQnA? the dataprep (GraphRAG) share the same TGI/TEI service (docker container) with ChatQnA LLM?

@xiguiw sorry for the confusion and thx for the comments. I updated the PR description maybe is clearer now. I was referring to the initialization of the GraphStore. The inference microservices are just deployed and initialized once. dataprep and retriever code just send endpoint requests to those with llama-index api.
I am not sure what you mean by GraphRAG and ChatQnA sharing same docker containers. This yaml defines the deployment: https://github.com/rbrugaro/GenAIComps/blob/GRAG_1.2/comps/dataprep/neo4j/llama_index/neo4j_llama_index.yaml

lkk12014402

LGTM

Signed-off-by: rbrygaro <[email protected]>

Verified test_retrievers_neo4j.sh OK Dataprep still pre-refactor (not merged yet in main) Signed-off-by: rbrygaro <[email protected]>

for more information, see https://pre-commit.ci

comps/retrievers/src/integrations/neo4j.py

Signed-off-by: rbrygaro <[email protected]>

letonghan

lgtm

change request meet

Issue: When property graph store gets filled (~12K nodes, 15K relationships) insertion time in dataprep gets slow. Extraction + insertion starts at ~30 sec and once it gets filled grows to (~12K nodes, 15K relationships) ~800 sec Perf bottleneck this cypher call in llama-index to do node upsert: https://github.com/run-llama/llama_index/blob/795bebc2bad31db51b854a5c062bedca42397630/llama-index-integrations/graph_stores/llama-index-graph-stores-neo4j/llama_index/graph_stores/neo4j/neo4j_property_graph.py#L334 Performance optimizations in this PR: 1. Move neo4j GraphStore initialization out of detaprep and retrieve function so it's only performed once at the begining 2. Disable schema_refresh of neo4j graph when not necessary because for large graph this is very slow. 3. Switch to OpenAILike class from llama-index to work with vllm or tgi endpoints without code changes (only docker compose.yaml changes) 4. Added concurrency and batching for generating community summaries and generating answers from summaries --------- Signed-off-by: Rita Brugarolas <[email protected]>

rbrugaro and others added 8 commits November 20, 2024 19:40

move graph and index to initialization func.fix trimming

66fe26f

Signed-off-by: Rita Brugarolas <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

c34a0ac

for more information, see https://pre-commit.ci

disable schema_refresh from startup and added skip ingestion option t…

8898f57

…o directly run build communities Signed-off-by: Rita Brugarolas <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

92697aa

for more information, see https://pre-commit.ci

upgrade llama_index_graph_stores_neo4j

b336825

Signed-off-by: Rita Brugarolas <[email protected]>

fix llamaindex neo4j package dependency

1871015

Signed-off-by: Rita Brugarolas <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

c651005

for more information, see https://pre-commit.ci

extend timeout to be able to process large document at once

4920771

Signed-off-by: Rita Brugarolas <[email protected]>

rbrugaro mentioned this pull request Dec 11, 2024

[Feature] GraphRAG perf improvement #1025

Closed

joshuayao linked an issue Dec 12, 2024 that may be closed by this pull request

[Feature] GraphRAG perf improvement #1025

Closed

rbrugaro and others added 2 commits December 16, 2024 06:12

Switch to OpenAILike to work w vllm/tgi, added concurrency and batching

078e04f

Signed-off-by: rbrygaro <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

bc64208

for more information, see https://pre-commit.ci

joshuayao added this to the v1.2 milestone Jan 7, 2025

rbrugaro added 3 commits January 7, 2025 23:59

Merge branch 'main' into GRAG_1.2

6e06d37

cleaned unnecessary requireements and unused imports

34b2825

Signed-off-by: rbrygaro <[email protected]>

update component tests

dbed12d

Signed-off-by: rbrygaro <[email protected]>

rbrugaro mentioned this pull request Jan 11, 2025

Fix: Update GraphRAG to be compatible with latest component changes opea-project/GenAIExamples#1384

Closed

Merge branch 'main' into GRAG_1.2

9ca6321

rbrugaro marked this pull request as ready for review January 11, 2025 00:45

rbrugaro requested review from lkk12014402, lvliang-intel, XinyuYe-Intel, letonghan, ftian1 and chensuyue as code owners January 11, 2025 00:45

rbrugaro added WIP r1.2 and removed r1.2 labels Jan 11, 2025

Merge branch 'main' into GRAG_1.2

ae893a9

rbrugaro removed the WIP label Jan 13, 2025

Merge branch 'main' into GRAG_1.2

bfc4a77

rbrugaro added the r1.2 label Jan 13, 2025

rbrugaro and others added 4 commits January 13, 2025 19:37

🐛 fix validation error for OpenAILike

4395801

Signed-off-by: rbrygaro <[email protected]>

add graph logs for CI-only debug

8362052

Signed-off-by: rbrygaro <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

6acac1c

for more information, see https://pre-commit.ci

Merge branch 'main' into GRAG_1.2

09ff934

ashahba previously requested changes Jan 15, 2025

View reviewed changes

lvliang-intel reviewed Jan 15, 2025

View reviewed changes

comps/dataprep/neo4j/llama_index/config.py Show resolved Hide resolved

xiguiw reviewed Jan 15, 2025

View reviewed changes

comps/dataprep/neo4j/llama_index/config.py Show resolved Hide resolved

lkk12014402 approved these changes Jan 15, 2025

View reviewed changes

lkk12014402 and others added 4 commits January 15, 2025 13:27

Merge branch 'main' into GRAG_1.2

6cb2850

💄 cosmetic edits and reduce logs

8c4b785

Signed-off-by: rbrygaro <[email protected]>

Merge main after RETRIEVERS refactor resolving all conflicts

8a5cc84

Verified test_retrievers_neo4j.sh OK Dataprep still pre-refactor (not merged yet in main) Signed-off-by: rbrygaro <[email protected]>

[pre-commit.ci] auto fixes from pre-commit.com hooks

27c6ba7

for more information, see https://pre-commit.ci

letonghan requested changes Jan 17, 2025

View reviewed changes

comps/retrievers/src/integrations/neo4j.py Outdated Show resolved Hide resolved

revert retriever invoke output type

2cc5a24

Signed-off-by: rbrygaro <[email protected]>

letonghan approved these changes Jan 17, 2025

View reviewed changes

chensuyue merged commit db8acd5 into opea-project:main Jan 19, 2025
21 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GraphRAG performance enhacements #924

GraphRAG performance enhacements #924

rbrugaro commented Nov 20, 2024 •

edited

Loading

eero-t commented Dec 18, 2024

rbrugaro commented Jan 7, 2025

ashahba left a comment

joshuayao commented Jan 15, 2025

xiguiw commented Jan 15, 2025

rbrugaro commented Jan 15, 2025

lkk12014402 left a comment

letonghan left a comment

GraphRAG performance enhacements #924

GraphRAG performance enhacements #924

Conversation

rbrugaro commented Nov 20, 2024 • edited Loading

eero-t commented Dec 18, 2024

rbrugaro commented Jan 7, 2025

ashahba left a comment

Choose a reason for hiding this comment

joshuayao commented Jan 15, 2025

xiguiw commented Jan 15, 2025

rbrugaro commented Jan 15, 2025

lkk12014402 left a comment

Choose a reason for hiding this comment

letonghan left a comment

Choose a reason for hiding this comment

rbrugaro commented Nov 20, 2024 •

edited

Loading