45 14 7

Dhaval Patel

DhavalPatel

dhaval-patel-2b287033

AI & ML interests

None yet

Recent Activity

new activity about 21 hours ago

ibm-research/AssetOpsBench:Update data/scenarios/all_utterance.jsonl

updated a dataset about 23 hours ago

ibm-research/AssetOpsBench

new activity about 24 hours ago

ibm-research/AssetOpsBench:Update data/scenarios/all_utterance.jsonl

View all activity

Organizations

upvoted an article 5 days ago

Article

OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments

9 days ago

•

upvoted a paper 6 days ago

SPIRAL: Symbolic LLM Planning via Grounded and Reflective Search

Paper • 2512.23167 • Published Dec 29, 2025 • 1

upvoted an article 13 days ago

Article

Community Evals: Because we're done trusting black-box leaderboards over the community

17 days ago

•

upvoted a collection 18 days ago

Enterprise Agents and Benchmarks

Collection

Enterprise agent ecosystem featuring AssetOpsBench (industrial) and ITBench (SRE, FinOps, CISO), CUGA to accelerate AI Automation • 10 items • Updated 6 days ago • 14

upvoted an article 19 days ago

Article

AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality

about 1 month ago

•

upvoted a collection 4 months ago

AI-Agent-4-Industry-4.0

Collection

This category highlights the collective efforts of the AI Automation team in advancing Industry 4.0 applications and exploring innovations beyond it. • 6 items • Updated Oct 8, 2025 • 7

upvoted 3 collections 5 months ago

upvoted 5 papers 8 months ago

Multi-Agent Design: Optimizing Agents with Better Prompts and Topologies

Paper • 2502.02533 • Published Feb 4, 2025 • 4

AssetOpsBench: Benchmarking AI Agents for Task Automation in Industrial Asset Operations and Maintenance

Paper • 2506.03828 • Published Jun 4, 2025 • 17

Compound AI Systems Optimization: A Survey of Methods, Challenges, and Future Directions

Paper • 2506.08234 • Published Jun 9, 2025 • 9

SmartPilot: A Multiagent CoPilot for Adaptive and Intelligent Manufacturing

Paper • 2505.06492 • Published May 10, 2025 • 2

FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure Modes

Paper • 2506.03278 • Published Jun 3, 2025 • 6

Dhaval Patel

AI & ML interests

Recent Activity

Organizations

DhavalPatel's activity

OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments

Community Evals: Because we're done trusting black-box leaderboards over the community

AssetOpsBench: Bridging the Gap Between AI Agent Benchmarks and Industrial Reality