Discussion about this post

User's avatar
Shiva Kakkar's avatar

This is such an awesome day post. I have been looking for a guide to finetuning. Unfortunately, most guides focus on technicalities of finetuning but no one talks about the structure of the data. Your writeup is really valuable!

Expand full comment
SomeMako's avatar

Hey, thank you for explaining the details! Actually while reading an article, a question arised: does the learning method have the opportunity to start training the model as if it would already have some of it's context already loaded? To think of it, there's probably no need at all to make an AI model to learn on how to properly start a conversation with all the context filled with zeroes. Even more, there could be no need for an RP or chat model to learn on how to write a prompt for itself at all: it should work well inside a prompt, but not to learn on how to be the one who writes prompts.

Expand full comment
2 more comments...

No posts