Hugging Face
Models
Datasets
Spaces
Buckets
new
Docs
Enterprise
Pricing
Log In
Sign Up
13
47
58
Tong Zhu
Spico
Follow
zzzhr97's profile picture
g-ronimo's profile picture
Tianfong's profile picture
26 followers
·
74 following
https://Spico197.github.io
TongZhu197
Spico197
AI & ML interests
Information Extraction, Mixture-of-Experts, LLM
Recent Activity
upvoted
a
paper
8 days ago
GEMS: Agent-Native Multimodal Generation with Memory and Skills
commented
on
a paper
12 days ago
Intern-S1-Pro: Scientific Multimodal Foundation Model at Trillion Scale
upvoted
an
article
about 1 month ago
Your MoE Model Does Not Have to Select Fixed Number of Experts
View all activity
Organizations
Spico
's models
7
Sort:Â Recently updated
Spico/LLaMA-MoE-v1-2_8-UniformSFT
Text Generation
•
7B
•
Updated
Feb 28, 2024
•
4
Spico/LLaMA-MoE-v1-2_8-DynamicSFT
Text Generation
•
7B
•
Updated
Feb 28, 2024
•
5
Spico/sheared-llama-2.7b-deita-6k-sft
Text Generation
•
3B
•
Updated
Feb 25, 2024
•
6
•
1
Spico/internlm2-7b-hf-llama
Text Generation
•
Updated
Feb 23, 2024
•
6
Spico/mirror-chinese-mrcqa-alpha
Updated
Dec 4, 2023
Spico/Humback-Myx
Text Generation
•
Updated
Aug 19, 2023
•
7
•
3
Spico/Humback-M0
Text Generation
•
Updated
Aug 18, 2023
•
11
•
3