Waseem AlShikh's picture

Waseem AlShikh

wassemgtk

·

https://writer.com/

AI & ML interests

Multi-modal, Palmyra LLMs, Knowledge Graph

Recent Activity

updated a model about 14 hours ago

wassemgtk/glm-5.2-visual-runtime

posted an update about 16 hours ago

Built GLM-5.2-visual-runtime: a training-free multimodal runtime gateway that makes GLM-5.2 work like a vision-capable model. It keeps images as persistent visual variables, runs local visual/OCR/chart/palette tools only when needed, and sends compact structured evidence to the reasoning model instead of retraining or modifying weights. The one-click stack includes GLM-5.2 via vLLM, Qwen3-Omni for vision/omni input, local OCR, Postgres, MinIO, and an OpenAI-compatible API. Model repo: https://huggingface.co/wassemgtk/glm-5.2-visual-runtime

updated a Space about 16 hours ago

wassemgtk/glm-5-2-visual-runtime-space

View all activity

Organizations

wassemgtk 's papers 7

arxiv:2602.03338

arxiv:2505.24726

arxiv:2502.06329

arxiv:2408.14906

arxiv:2405.02048

arxiv:2402.17553

arxiv:2307.03692