minot (part 4)
Posted by Alexander Power
Received: 2025-03-28 13:41:49
Channel: Cities - Project Journal
In reply to: minot (part 3) (View Chain)
yesterday:
- three new "greenland" benchmarks: letter count, unit conversion, part-of-speech detection. 💡 ( it should not be a surprise to the contemporary reader that the models struggle the most with "letter count" - how many "r"s are in strawberry.)
still to do:
- code cleanup 💡 ( the "run" method is written in slightly different form seven times)
- more benchmarks
- dashboard UI improvements