Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
3
2
Balamurugan Balakreshnan
Balab2021
Follow
0 followers
·
8 following
https://balakreshnan.github.io/
balakreshnan
balamurugan-balakreshnan
AI & ML interests
AI & ML, Deep Learning, Large Language models, Large Vision Models, Large Action Models, Small Language models,
Recent Activity
updated
a Space
about 2 months ago
Balab2021/trackio
published
a Space
about 2 months ago
Balab2021/trackio
published
a model
about 2 months ago
Balab2021/smol-course-SmolVLM2-2.2B-Instruct-trl-sft-ChartQA
View all activity
Organizations
Balab2021
's models
32
Sort: Recently updated
Balab2021/smol-course-SmolVLM2-2.2B-Instruct-trl-sft-ChartQA
Updated
Oct 11
Balab2021/gpt-oss-20b-multilingual-reasoner
Updated
Aug 5
Balab2021/sftqwen_finetuned_model_1-5BHS
Text Generation
•
0.9B
•
Updated
Jul 28
•
2
Balab2021/1B_finetuned_llama3.2_HS
Text Generation
•
0.8B
•
Updated
Jul 25
•
3
Balab2021/Qwen2-0.5B-GRPO-test
Updated
Jul 11
Balab2021/ppo-Huggy
Reinforcement Learning
•
Updated
Feb 17
•
32
Balab2021/Taxi-V3
Reinforcement Learning
•
Updated
Feb 17
Balab2021/q-FrozenLake-v1-4x4-noSlippery
Reinforcement Learning
•
Updated
Feb 17
Balab2021/poca-SoccerTwos
Reinforcement Learning
•
Updated
Feb 12
•
17
Balab2021/dqn-SpaceInvadersNoFrameskip-v4
Reinforcement Learning
•
Updated
Feb 6
•
6
Balab2021/Reinforce-model-4-2
Reinforcement Learning
•
Updated
Feb 6
Balab2021/Reinforce-model-4
Reinforcement Learning
•
Updated
Feb 6
Balab2021/rl_course_vizdoom_health_gathering_supreme
Reinforcement Learning
•
Updated
Feb 5
Balab2021/ppo-CartPole-v1
Reinforcement Learning
•
Updated
Feb 5
Balab2021/LunarLander-v2
Reinforcement Learning
•
Updated
Feb 5
Balab2021/a2c-PandaReachDense-v3
Reinforcement Learning
•
Updated
Feb 5
•
2
Balab2021/ppo-PyramidsTraining
Reinforcement Learning
•
Updated
Feb 4
•
7
Balab2021/ppo-SnowballTarget
Reinforcement Learning
•
Updated
Feb 4
•
14
Balab2021/Reinforce-unit4ex2
Reinforcement Learning
•
Updated
Feb 4
Balab2021/Reinforce-unit4
Reinforcement Learning
•
Updated
Feb 4
Balab2021/q-taxi-v3
Reinforcement Learning
•
Updated
Feb 3
Balab2021/ppo-LunarLander-v2
Reinforcement Learning
•
Updated
Feb 3
•
1
Balab2021/DeepSeek-R1-Distill-Llama-8B-Fine-tunedBespoke
Updated
Jan 29
Balab2021/Florence-2-FT-DocVQA
Any-to-Any
•
0.3B
•
Updated
Sep 21, 2024
•
5
Balab2021/bbphi35ftv1
Text Generation
•
4B
•
Updated
Aug 22, 2024
•
3
Balab2021/phi-3-5-mini-LoRA
Updated
Aug 22, 2024
•
4
Balab2021/bbphi3ftv1
Text Generation
•
4B
•
Updated
Aug 12, 2024
•
3
Balab2021/phi-3-mini-LoRA
Updated
Aug 12, 2024
•
1
Balab2021/llama3
Text Generation
•
8B
•
Updated
Apr 20, 2024
•
4
Balab2021/phi2cricketipl
Text Generation
•
3B
•
Updated
Mar 30, 2024
•
3
Previous
1
2
Next