Luke Alonso PRO
lukealonso
AI & ML interests
None yet
Recent Activity
new activity 4 days ago
canada-quant/DeepSeek-V4-Pro-NVFP4-FP8-MTP:Lossless? new activity 17 days ago
lukealonso/MiMo-V2.5-NVFP4:Looping in OpenCode updated a model 19 days ago
lukealonso/MiMo-V2.5-NVFP4Organizations
None yet
Lossless?
👀 1
1
#1 opened 4 days ago
by
lukealonso
Looping in OpenCode
👀 1
5
#4 opened 25 days ago
by
Jon-Nielsen
The original repository has updated some files. Does this repository need to be updated?
1
#7 opened 19 days ago
by
fanhed
Serving on two devices
3
#3 opened 25 days ago
by
shadowlilac
Will it work on 2X6000 Pros
6
#1 opened about 1 month ago
by
mtcl
Why not GGUF?
#6 opened 20 days ago
by
Nerdsking
Quantization of the Model
1
#9 opened about 1 month ago
by
shiva2022
Link to model and docker image
👍 1
1
#2 opened 26 days ago
by
Jon-Nielsen
Fix tool calling: support array-formatted tool content (vLLM/SGLang)
#8 opened about 1 month ago
by
cudaoom
w1 not matching w3 weight scales
12
#1 opened about 2 months ago
by
dareposte
tokenizer component mismatch and w1_weight_scale_2 must match w3_weight_scale_2. Accuracy may be affected issue
1
#5 opened about 2 months ago
by
mtcl
RuntimeError: The size of tensor a (3072) must match the size of tensor b (6144) at non-singleton dimension 1
3
#5 opened about 2 months ago
by
lianyouzao
From "Doesn't Work" to 641 tok/s: GLM-5.1 NVFP4 on 6× RTX PRO 6000 Blackwell
🔥 1
#4 opened about 2 months ago
by
sakamakismile
Hopper GPU?
1
#2 opened about 2 months ago
by
AndrewMatienko
Request: NVFP4 version of MiniMax-M2.5-REAP-139B (to fit on a single RTX 6000 Pro)
14
#7 opened 3 months ago
by
mondovero
Crash on first request on RTX Pro 6000 x8
👍 1
6
#3 opened 3 months ago
by
koushd
nvfp4
➕👍 2
1
#1 opened 3 months ago
by
ktsaou
VLLM error for kv weight scaling - workaround
7
#6 opened 3 months ago
by
ShaunEvansMD
fp8 kv cache
15
#4 opened 3 months ago
by
festr2
Thanks for your effort
5
#5 opened 3 months ago
by
darkstar3537