Model is trained on his own orca style dataset as well as some airoboros apparently to increase creativity
Quants:
https://huggingface.co/TheBloke/dolphin-2.0-mistral-7B-GPTQ
You must log in or # to comment.
deleted by creator
It depends on the learning rate, typically it’s ideal and higher quality to learn really slowly over a lot of epochs but it’s cheaper and obviously faster to learn fast over fewer epochs
Also the dataset size is important to consider
deleted by creator