NHacker Next
  • new
  • past
  • show
  • ask
  • show
  • jobs
  • submit
Hierarchical Autoregressive Modeling for Memory-Efficient Language Generation (arxiv.org)
pama 26 days ago [-]
At least the authors acknowledge it for what it is: a tiny model on a tiny corpus and worse than the comparable transformers in terms of accuracy. I like the experimentation with new designs and one doesnt always need to show near SOTA results. From a brief inspection, however, I think it will be hard for the work to become a high profile conference acceptance without significan additional work.
jeffjeffbear 26 days ago [-]
I would really like to see more testing with a deeper hierarchy and alpha and beta nonzero.
mxkopy 25 days ago [-]
Skimming it I get this incredible sci-fi feeling of AI being the thing that solves P vs. NP (the diagrams are reminiscent of boolean/arithmetic circuits which have produced some results in the compcomp space)
Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact
Rendered at 13:00:48 GMT+0000 (Coordinated Universal Time) with Vercel.