cyrano@lemmy.dbzer0.com to Lemmy Shitpost@lemmy.world · 17 days agoAGI achieved 🤖lemmy.dbzer0.comexternal-linkmessage-square252fedilinkarrow-up1910arrow-down113
arrow-up1897arrow-down1external-linkAGI achieved 🤖lemmy.dbzer0.comcyrano@lemmy.dbzer0.com to Lemmy Shitpost@lemmy.world · 17 days agomessage-square252fedilink
minus-squareZacryon@feddit.orglinkfedilinkarrow-up2·16 days agoI know that words are tokenized in the vanilla transformer. But do GPT and similar LLMs still do that as well? I assumed they also tokenize on character/symbol level, possibly mixed up with additional abstraction down the chain.
I know that words are tokenized in the vanilla transformer. But do GPT and similar LLMs still do that as well? I assumed they also tokenize on character/symbol level, possibly mixed up with additional abstraction down the chain.