Carlos García's picture
Open to Collab

Carlos García

cgarciams
·

AI & ML interests

Building a GPT-2 medium size (approx. 400 M parameters) model from scratch, using PyTorch, the OpenWebText dataset, tiktoken, AdamW optimizer and FlashAttention. Just for fun.

Recent Activity

updated a model 1 day ago
cgarciams/gpt_124m
published a model 3 days ago
cgarciams/gpt_124m
published a dataset 13 days ago
cgarciams/hle-text
View all activity

Organizations

Universidad de Zaragoza's profile picture