[Feature Request]: Re-use database connection pool in production #4751

LambertW · 2025-01-02T01:32:07Z

What would you like to happen?

I'm currently using Apache Hop for ETL integration. However, I have some doubts about how to deploy and run it on the production environment server. How to manage the database connection pool well to avoid the overhead of repeated creations?

I saw hop-server should be the silimalier solution for long term running and could re-use connetion pool while setup scheduled jobs. but i could not found any documents about how to deploy hop with projects to servers and use hop-server to run this project with specific workflow/ pipeline files.

Issue Priority

Priority: 3

Issue Component

Component: Documentation, Component: Hop Server

bamaer · 2025-01-07T08:25:10Z

connection pooling was removed in Apache Hop after the fork from PDI.
Connections in a pipeline/workflow are created at the start and release (max) at the end of the execution. Connection pooling adds little to no value in these scenarios (as was already the case in PDI/Kettle).
We could consider documenting the absence of connection pooling in the documentation though.

dave-csc · 2025-01-21T14:49:54Z

Joining this discussion, since I had a similar issue.

In may case the absence of connection pooling has caused a sort of temporary ban in a remote database, since I was making too many connections in a relatively short time (i.e. the time needed to get the results of a SELECT query...): after 2-3 queries my workflow deadlocked with no information in the logs.

To workaround this, I grouped the query-related pipelines in a new pipeline and set the Run configuration as transactional for the latter one. It worked for my needs since I had to do SELECTs only.

The case above can indeed be considered an added value for connection pooling. Also, it seems it's not currently possible to close a database transaction, but keeping the connection alive for other queries...

LambertW added awaiting triage new feature labels Jan 2, 2025

github-actions bot added P3 Nice to have Documentation Hop Server labels Jan 2, 2025

bamaer closed this as completed Jan 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request]: Re-use database connection pool in production #4751

[Feature Request]: Re-use database connection pool in production #4751

LambertW commented Jan 2, 2025

bamaer commented Jan 7, 2025

dave-csc commented Jan 21, 2025

[Feature Request]: Re-use database connection pool in production #4751

[Feature Request]: Re-use database connection pool in production #4751

Comments

LambertW commented Jan 2, 2025

What would you like to happen?

Issue Priority

Issue Component

bamaer commented Jan 7, 2025

dave-csc commented Jan 21, 2025