Cannot run/install finetuning colab notebook

Question

Cannot run/install finetuning colab notebook

dotXem opened this issue 5 months ago · comments

Describe the bug

The demo colab notebook for finetuning Llama-2-7b is crashing at the third runnable cell when trying to import torch.

---------------------------------------------------------------------------
TypeError                                 Traceback (most recent call last)
[<ipython-input-3-dac5961b998e>](https://localhost:8080/#) in <cell line: 5>()
      3 import logging
      4 import os
----> 5 import torch
      6 import yaml
      7 

12 frames
[/usr/lib/python3.10/_pyio.py](https://localhost:8080/#) in __init__(self, buffer, encoding, errors, newline, line_buffering, write_through)
   2043                 encoding = "utf-8"
   2044             else:
-> 2045                 encoding = locale.getpreferredencoding(False)
   2046 
   2047         if not isinstance(encoding, str):

TypeError: <lambda>() takes 0 positional arguments but 1 was given

To Reproduce

Go to https://colab.research.google.com/drive/1r4oSEwRJpYKBPM0M0RSh0pBEYK_gBKbe
Connect T4 GPU
Run the first three cells
Last cell should fail with the error message

Expected behavior
It should work!

Environment (please complete the following information):

(not sure if relevant)

Arnav Garg · Answer 1 · Tue Jan 16 2024 01:48:21 GMT+0800 (China Standard Time)

Hi @dotXem! Thanks for reporting the issue - I can confirm that I'm able to repro it with the steps you've provided. Let me get back to you with a root cause and fix soon! Apologies that this didn't work as expected out of the box.

Arnav Garg · Answer 2 · Tue Jan 16 2024 02:07:40 GMT+0800 (China Standard Time)

@dotXem I've found the issue and I've updated the notebook(s) on the Ludwig README including the one you're trying - are you able to give it a quick run through to see if the issue is fixed?

For context, it seems like the way we were setting UTF8 encoding as the default wasn't interplaying nicely with torch 2.1, and it seems like we weren't using the recommended way. I just updated it to use the preferred method and it seems to work well.

This is what I changed

Current:

import locale; locale.getpreferredencoding = lambda: "UTF-8"

New:

import locale; locale.setlocale(locale.LC_ALL, 'en_US.UTF-8')

Let me know how it goes!

Maxime DE BOIS · Answer 3 · Tue Jan 16 2024 04:37:55 GMT+0800 (China Standard Time)

It's working ! Thanks for the quick fix ! Le lun. 15 janv. 2024, 19:07, Arnav Garg ***@***.***> a écrit :

…

@dotXem <https://github.com/dotXem> I've found the issue and I've updated the notebook(s) on the Ludwig README including the one you're trying - are you able to give it a quick run through to see if the issue is fixed? For context, it seems like the way we were setting UTF8 encoding as the default wasn't interplaying nicely with torch 2.1, and it seems like we weren't using the recommended way. I just updated it to use the preferred method and it seems to work well. This is what I changed Current: import locale; locale.getpreferredencoding = lambda: "UTF-8" New: import locale; locale.setlocale(locale.LC_ALL, 'en_US.UTF-8') Let me know how it goes! — Reply to this email directly, view it on GitHub <#3881 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/AC3IM4YWCF5YL45TJYC2TH3YOVV7RAVCNFSM6AAAAABB3S6RZCVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTQOJSGYYDAOBWGU> . You are receiving this because you were mentioned.Message ID: ***@***.***>