arxiv:2510.15232
Tongyan Hu
entropyhu
AI & ML interests
None yet
Recent Activity
authored
a paper
7 days ago
FinTrust: A Comprehensive Benchmark of Trustworthiness Evaluation in
Finance Domain
upvoted
a
paper
7 days ago
FinTrust: A Comprehensive Benchmark of Trustworthiness Evaluation in
Finance Domain
upvoted
a
paper
26 days ago
MCPMark: A Benchmark for Stress-Testing Realistic and Comprehensive MCP
Use