SubQ
First frontier model with Subquadratic Sparse Attention (SSA). 12M token context, 52x faster than FlashAttention at 1M tokens, costs under 5% of Claude Opus.
Visit Website arrow_outwardEnjoying this?
Subscribe to get new solo-founder resources when they are actually published.