Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
jne2356
on Jan 18, 2021
|
parent
|
context
|
favorite
| on:
GPT-Neo – Building a GPT-3-sized model, open sourc...
OpenAI only trained the full sized GPT-3 once. Hyperparameter sweep was conducted on significantly smaller models (see:
https://arxiv.org/abs/2001.08361
)
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: