Gallery
click anywhere to replay
ux-writing-1
27.8B parameters, frozen. One small adapter, warmed by the fire.
What fine-tuning actually touched.
cool
hot
transformer layer →  (dashed: module not present — 48 of 64 blocks use linear attention)