First, we need a dataset for which we’ll be able to tell if the model has trained. Let's create one that will make our model talk like Yoda. We can get a bunch of questions from TriviaQA, and generate responses by prompting an LLM to answer the question while pretending it’s Yoda. Running the script, I get a few thousand prompts and responses that look something like this:
20+ curated newsletters
。关于这个话题,新收录的资料提供了深入分析
Максим Габриелян (ведущий редактор отдела «Мир»)
Microsoft Office Professional Plus 2019 remains one of the most widely used productivity suites for work and home. It includes the familiar tools people rely on to write documents, build spreadsheets, create presentations, manage emails, and organize data.
-H 'Authorization: Bearer <apiToken' \