When will the D34 or D32 models be added?

#11
by harshmalik98 - opened

image
I want to SFT the D34 or D32 models, the example used here tells us to use the D32 model, but this is not available from what i can tell. Hence i get this error!
ValueError: The checkpoint you are trying to load has model type nanochat but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.

nanochat students org

@harshmalik98 Installing Transformers from source with the command pip install git+https://github.com/huggingface/transformers.git, then restarting the session will resolve this issue.

Sign up or log in to comment