Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Manuel Barca-Grama's picture
1 4 382

Manuel Barca-Grama

jackangel
Gargaz's profile picture Sakalti's profile picture 21world's profile picture
ยท

AI & ML interests

None yet

Recent Activity

liked a model about 14 hours ago
inclusionAI/ZwZ-8B
liked a model 1 day ago
mradermacher/Rio-3.0-Open-Mini-i1-GGUF
reacted to marksverdhei's post with ๐Ÿ‘ 4 days ago
Poll: Will 2026 be the year of subquadratic attention? The transformer architecture is cursed by its computational complexity. It is why you run out of tokens and have to compact. But some would argue that this is a feature not a bug and that this is also why these models are so good. We've been doing a lot of research on trying to make equally good models that are computationally cheaper, But so far, none of the approaches have stood the test of time. Or so it seems. Please vote, don't be shy. Remember that the Dunning-Kruger effect is very real, so the person who knows less about transformers than you is going to vote. We want everyone's opinion, no matter confidence. ๐Ÿ‘ if you think at least one frontier model* will have no O(n^2) attention by the end of 2026 ๐Ÿ”ฅ If you disagree * Frontier models - models that match / outperform the flagship claude, gemini or chatgpt at the time on multiple popular benchmarks
View all activity

Organizations

None yet

jackangel 's Spaces 1

pinned
Runtime error

codellama-13b-instruct-gguf

๐Ÿฆ€

May 14, 2024
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs