Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Zingg on Databricks Example Error #992

Open
mechevarria opened this issue Jan 2, 2025 · 7 comments
Open

Zingg on Databricks Example Error #992

mechevarria opened this issue Jan 2, 2025 · 7 comments

Comments

@mechevarria
Copy link

Describe the bug
When running the Databricks Zingg example, the code fails at the line

zingg.writeLabelledOutputFromPandas(candidate_pairs_pd,args)

To Reproduce
Steps to reproduce the behavior:
An example repo with the notebook is here
This example uses Unity catalog volumes instead of deprecated dbfs paths and was run on a 15.4 LTS runtime

Expected behavior
Expected for the example to completely run

Screenshots
If applicable, add screenshots to help explain your problem.
Screenshot 2025-01-02 at 1 34 08 PM

Additional context
Add any other context about the problem here.

@sonalgoyal
Copy link
Member

sonalgoyal commented Jan 3, 2025

Before you execute the cell, make sure that the zingg variable has been initialized with label(cell with heading Prepare for user labeling) and not trainMatch phase. It seems that Zingg is not set to label. Can you please check and report back if you still see an issue?

@mechevarria
Copy link
Author

@sonalgoyal looks like the notebook got a little further today rerunning all the cells. I did hit another issues a few cells down when generating docs
Screenshot 2025-01-03 at 12 21 09 PM

@sonalgoyal
Copy link
Member

Seems like the documenter is unable to write to the volume at the docs location. Could it be a permission issue?

@mechevarria
Copy link
Author

I'm the owner of the volume and the cluster. The example project was able to write to the other folders in the volume so I'm not sure where the permission issue is popping up from. I also manually created the docs folder and got the same problem
Screenshot 2025-01-06 at 10 32 48 AM

@mechevarria
Copy link
Author

I'm also running into a separate issue in another environment where trainMatch is failing
Screenshot 2025-01-06 120812

@sonalgoyal
Copy link
Member

I'm the owner of the volume and the cluster. The example project was able to write to the other folders in the volume so I'm not sure where the permission issue is popping up from. I also manually created the docs folder and got the same problem Screenshot 2025-01-06 at 10 32 48 AM

In that case, this is likely happening since UC volumes are not supported on the Zingg Community Version.

@sonalgoyal
Copy link
Member

I'm also running into a separate issue in another environment where trainMatch is failing Screenshot 2025-01-06 120812

How many records have you labelled? This error indicates that either matches ornon matches are 0 in the labeled data. If you have labeled sufficiently, we have seen this issue go away when the cluster is restarted.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants