Party Golem's picture

8 1

Party Golem

partygolem

·

AI & ML interests

None yet

Recent Activity

upvoted a paper 7 days ago

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

upvoted a paper 7 days ago

MegaFlow: Large-Scale Distributed Orchestration System for the Agentic Era

upvoted a paper 5 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

View all activity

Organizations

None yet

upvoted 2 papers 7 days ago

Qwen3-VL-Embedding and Qwen3-VL-Reranker: A Unified Framework for State-of-the-Art Multimodal Retrieval and Ranking

Paper • 2601.04720 • Published 20 days ago • 51

MegaFlow: Large-Scale Distributed Orchestration System for the Agentic Era

Paper • 2601.07526 • Published 16 days ago • 23

upvoted 6 papers 5 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1, 2025 • 79

TiKMiX: Take Data Influence into Dynamic Mixture for Language Model Pre-training

Paper • 2508.17677 • Published Aug 25, 2025 • 14

R-4B: Incentivizing General-Purpose Auto-Thinking Capability in MLLMs via Bi-Mode Annealing and Reinforce Learning

Paper • 2508.21113 • Published Aug 28, 2025 • 110

Morae: Proactively Pausing UI Agents for User Choices

Paper • 2508.21456 • Published Aug 29, 2025 • 5

PVPO: Pre-Estimated Value-Based Policy Optimization for Agentic Reasoning

Paper • 2508.21104 • Published Aug 28, 2025 • 37

Meta-Reasoning Improves Tool Use in Large Language Models

Paper • 2411.04535 • Published Nov 7, 2024 • 1

liked a Space almost 2 years ago

WebGPU Embedding Benchmark

Measure BERT model performance using WebGPU and WASM