OctoThinker/OctoThinker-3B-Hybrid-Zero
Text Generation
β’ 4B β’ Updated β’ 8
β’ 1
OctoThinker/OctoThinker-3B-Hybrid-Base
Text Generation
β’ 3B β’ Updated β’ 4.27k
β’ 1
OctoThinker/OctoThinker-3B-Short-Zero
Text Generation
β’ 4B β’ Updated β’ 5
β’ 1
OctoThinker/OctoThinker-3B-Short-Base
Text Generation
β’ 3B β’ Updated β’ 18
OctoThinker/Llama_32_3B_megamath_web_pro_max_bs4M_seq8k_100B
Text Generation
β’ Updated OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_general_ins_89_10_1_bs4M_seq8k_20B
Text Generation
β’ Updated OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_91_bs4M_seq8k_20B
Text Generation
β’ Updated OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_general_ins_89_10_1_bs4M_seq8k_20B
Text Generation
β’ Updated OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_91_bs4M_seq8k_20B
Text Generation
β’ Updated OctoThinker/Llama_32_3B_megamath_web_pro_max_bs4M_seq8k_20B
Text Generation
β’ Updated OctoThinker/Llama_32_3B_megamath_web_pro_bs4M_seq8k_20B
Text Generation
β’ Updated OctoThinker/Llama_32_3B_finemath_4p_bs4M_seq8k_20B
Text Generation
β’ Updated OctoThinker/OctoThinker-3B-Long-Zero
Text Generation
β’ 4B β’ Updated β’ 7
OctoThinker/OctoThinker-1B-Short-Zero
Text Generation
β’ 1B β’ Updated β’ 10
OctoThinker/OctoThinker-1B-Hybrid-Zero
Text Generation
β’ 1B β’ Updated β’ 2
OctoThinker/OctoThinker-1B-Long-Zero
Text Generation
β’ 1B β’ Updated β’ 4
OctoThinker/OctoThinker-3B-Long-Base
Text Generation
β’ 3B β’ Updated β’ 11
β’ 1
OctoThinker/OctoThinker-1B-Short-Base
Text Generation
β’ 1B β’ Updated β’ 30
OctoThinker/OctoThinker-1B-Hybrid-Base
Text Generation
β’ 1B β’ Updated β’ 34
β’ 1
OctoThinker/OctoThinker-1B-Long-Base
Text Generation
β’ 1B β’ Updated β’ 5
OctoThinker/OctoThinker-8B-Short-Base
Text Generation
β’ 8B β’ Updated β’ 9
β’ 1
OctoThinker/OctoThinker-8B-Hybrid-Base
Text Generation
β’ 8B β’ Updated β’ 12.9k
β’ 2
OctoThinker/OctoThinker-8B-Long-Base
Text Generation
β’ 8B β’ Updated β’ 5
OctoThinker/Llama_32_3B_megamath_web_pro_open_r1_longcot_general_ins_89_10_1_bs4M_seq16k_20B
Updated
OctoThinker/Llama_32_3B_megamath_web_pro_megamath_synth_qa_31_bs4M_seq8k_20B
Updated
OctoThinker/Llama3.2-3B-Zero
4B β’ Updated β’ 2