minot (part 2)

Replies:

for today:

  • one investment meeting ⚙️ ( as always, further information withheld)
  • tutoring 💡 ( details withheld, but some information may make it to a post) 🌎 ( due to staff illness, tutoring was canceled)
  • maybe try again to get Claude to see the light about the lexer/parser system 💡 ( currently, the lexer is looking ahead to capture an entire text block for the emphasis tag, rather than letting the parser do this)
  • add HTML metadata linking to the RSS feed here.
  • maybe get the LLM dashboard available publicly 💡 ( but first, I would like 3 or 4 more of the "proficiency" tests on the dashboard.)

apparently https://github.com/ollama/ollama/issues/7978 is fixed. This was the "Ollama always returns structured-JSON responses in alphabetical order" bug. Which, when the two fields were thought and answer, was a major problem.

I need Claude to remove the code that was added to work-around this problem. Which is at least a 1-hour time commitment.

At that point, Claude should be able to construct new "proficiency" benchmarks fairly easily. 💡 ( with a perfect system, it would take only 5 minutes per test. In the current world, I am hoping for 20-30 minutes.)