Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Using YDF in databricks with spark #130

Open
marcoreuter opened this issue Sep 7, 2024 · 2 comments
Open

Using YDF in databricks with spark #130

marcoreuter opened this issue Sep 7, 2024 · 2 comments

Comments

@marcoreuter
Copy link

Hi!

I really love using your package, but I am using databricks with spark. Is there any way to make the distributed training of the package work in this environment currently? I've been trying, but with no luck. Are there plans for a future integration?

Thank you!

@rstz
Copy link
Collaborator

rstz commented Sep 16, 2024

Hi,

we haven't tried yet, but we're actively working on improving distributed training support. I'll keep this open and report any news / experiments we can share.

@DawnweiSong
Copy link

In case anyone else like me encountering the same problem, here is my solution in DataBricks DCS 13.3:

!mkdir -p /tmp/xsong
!wget -O /tmp/xsong/ydf-0.9.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl https://files.pythonhosted.org/packages/ea/00/f36d107ce13552008b5f30d1b2348528cb97944c2f1247dfd913cc3f91b1/ydf-0.9.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
%pip install /tmp/xsong/ydf-0.9.0-cp310-cp310-manylinux_2_17_x86_64.manylinux2014_x86_64.whl
%pip install --upgrade protobuf
dbutils.library.restartPython()

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants