Alan Tseng's picture

Alan Tseng

agentlans

·

agentlans

AI & ML interests

Small data, boring AI

Recent Activity

updated a collection 2 days ago

updated a model 2 days ago

agentlans/ai-and-human-judgement

published a model 2 days ago

agentlans/ai-and-human-judgement

View all activity

Organizations

None yet

updated a collection 2 days ago

Documents

4 items • Updated 2 days ago • 1

updated a model 2 days ago

agentlans/ai-and-human-judgement

Any-to-Any • Updated 2 days ago

published a model 2 days ago

agentlans/ai-and-human-judgement

Any-to-Any • Updated 2 days ago

updated a dataset 2 days ago

agentlans/traditional-chinese

Viewer • Updated 2 days ago • 5.37M • 721

updated a dataset 3 days ago

agentlans/FineWeb2-Edu-JA-EN

Viewer • Updated 3 days ago • 2.17k • 25 • 1

published a dataset 3 days ago

agentlans/FineWeb2-Edu-JA-EN

Viewer • Updated 3 days ago • 2.17k • 25 • 1

commented on Can Predicted Dynamics Exist in the Physical World? 3 days ago

I'm no expert, but this reminds me of traditional control theory like PID controllers and Kalman filters. Seems like the real benefit of modern AI is learning and adapting the complex, high-level policies you mentioned in the blog post. For autonomous robots and drones, combining classical control with reinforcement learning seems to be the key.

reacted to sergiopaniego's post with ❤️ 3 days ago

Post

6204

new banger blog alert 🚨

@ariG23498 is starting a blog series about profiling in pytorch and part 1 just dropped

takes you from the simplest scenario to actually knowing what your gpu is doing. if you have never opened a profiler trace this is where you start

covers torch.profiler from scratch. reading tables and traces, overhead bound vs compute bound, the full dispatch chain from python to gpu kernels, and what torch.compile is actually fusing under the hood

find it here: https://huggingface.co/blog/torch-profiler

1 reply

·

replied to sergiopaniego's post 3 days ago

This is a really detailed guide to PyTorch profiling—definitely a tool that many AI engineers overlook. That said, the data can still be quite overwhelming. A quick cheatsheet decoding what the core metrics mean (especially for custom kernels, Tensor Cores, or multi-GPU setups) would be a great addition.*

For anyone wanting to go even lower-level on NVIDIA hardware, Nsight Compute is also worth a look for some serious profiling: https://developer.nvidia.com/nsight-compute

Edit: OK I saw the summary at the end of the post. But a concise, self-contained cheatsheet with little graphics would help a lot.

updated a dataset 3 days ago

agentlans/easy-mode

Viewer • Updated 3 days ago • 79.4k • 44

updated a model 4 days ago

agentlans/Skywork-Reward-V2-Llama-3.1-8B-4bit

Text Classification • 8B • Updated 4 days ago • 22

published a model 4 days ago

agentlans/Skywork-Reward-V2-Llama-3.1-8B-4bit

Text Classification • 8B • Updated 4 days ago • 22

updated a model 4 days ago

agentlans/Skywork-Reward-V2-Llama-3.1-8B-8bit

Text Classification • 8B • Updated 4 days ago • 38

published a model 4 days ago

agentlans/Skywork-Reward-V2-Llama-3.1-8B-8bit

Text Classification • 8B • Updated 4 days ago • 38

published a dataset 5 days ago

agentlans/easy-mode

Viewer • Updated 3 days ago • 79.4k • 44

updated a dataset 10 days ago

agentlans/regional-english

Viewer • Updated 10 days ago • 3M • 47

published a dataset 10 days ago

agentlans/regional-english

Viewer • Updated 10 days ago • 3M • 47

updated a model 10 days ago

agentlans/bge-small-en-text-quality

Text Classification • 33.4M • Updated 10 days ago • 56

published a model 10 days ago

agentlans/bge-small-en-text-quality

Text Classification • 33.4M • Updated 10 days ago • 56

updated a dataset 10 days ago

agentlans/text-quality-v3

Viewer • Updated 10 days ago • 100k • 97 • 1