关注
Jen-tse Huang
Jen-tse Huang
在 cse.cuhk.edu.hk 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine
W Jiao, W Wang, J Huang, X Wang, S Shi, Z Tu
arXiv preprint: 2301.08745, 2023
502*2023
Improving Adversarial Transferability via Neuron Attribution-Based Attacks
J Zhang, W Wu, J Huang, Y Huang, W Wang, Y Su, MR Lyu
CVPR'22, 14993-15002, 2022
1032022
GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher
Y Yuan, W Jiao, W Wang, J Huang, P He, S Shi, Z Tu
ICLR'24, 2024
392024
ParroT: Translating during Chat using Large Language Models tuned with Human Translation and Feedback
W Jiao, J Huang, W Wang, Z He, T Liang, X Wang, S Shi, Z Tu
EMNLP'23 Findings, 15009-15020, 2023
36*2023
Improving the Transferability of Adversarial Samples by Path-Augmented Method
J Zhang, J Huang, W Wang, Y Li, W Wu, X Wang, Y Su, MR Lyu
CVPR'23, 8173-8182, 2023
252023
Revisiting the Reliability of Psychological Scales on Large Language Models
J Huang, W Wang, MH Lam, EJ Li, W Jiao, MR Lyu
arXiv preprint: 2305.19926, 2023
20*2023
InCharacter: Evaluating Personality Fidelity in Role-Playing Agents through Psychological Interviews
X Wang, Y Xiao, J Huang, S Yuan, R Xu, H Guo, Q Tu, Y Fei, Z Leng, ...
arXiv preprint: 2310.17976, 2023
16*2023
Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench
J Huang, MH Lam, EJ Li, S Ren, W Wang, W Jiao, Z Tu, MR Lyu
arXiv preprint: 2308.03656, 2023
162023
AEON: A Method for Automatic Evaluation of NLP Test Cases
J Huang, J Zhang, W Wang, P He, Y Su, MR Lyu
ISSTA'22, 202-214, 2022
162022
MTTM: Metamorphic Testing for Textual Content Moderation Software
W Wang, J Huang, W Wu, J Zhang, Y Huang, S Li, P He, MR Lyu
ICSE'23, 2387-2399, 2023
132023
All Languages Matter: On the Multilingual Safety of Large Language Models
W Wang, Z Tu, C Chen, Y Yuan, J Huang, W Jiao, MR Lyu
arXiv preprint: 2310.00905, 2023
132023
Tencent's Multilingual Machine Translation System for WMT22 Large-Scale African Languages
W Jiao, Z Tu, J Li, W Wang, J Huang, S Shi
WMT'22, 1049-1056, 2022
132022
On the Humanity of Conversational AI: Evaluating the Psychological Portrayal of LLMs
J Huang, W Wang, EJ Li, MH Lam, S Ren, Y Yuan, W Jiao, Z Tu, MR Lyu
ICLR'24, 2024
11*2024
Not All Countries Celebrate Thanksgiving: On the Cultural Dominance in Large Language Models
W Wang, W Jiao, J Huang, R Dai, J Huang, Z Tu, MR Lyu
arXiv preprint: 2310.12481, 2023
62023
An Image is Worth a Thousand Toxic Words: A Metamorphic Testing Framework for Content Moderation Software
W Wang, J Huang, J Huang, C Chen, J Gu, P He, MR Lyu
ASE'23, 1339-1351, 2023
52023
Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models
T Liang, Z He, J Huang, W Wang, W Jiao, R Wang, Y Yang, Z Tu, S Shi, ...
arXiv preprint: 2310.20499, 2023
32023
How Far Are We on the Decision-Making of LLMs? Evaluating LLMs' Gaming Ability in Multi-Agent Environments
J Huang, EJ Li, MH Lam, T Liang, W Wang, Y Yuan, W Jiao, X Wang, Z Tu, ...
arXiv preprint: 2403.11807, 2024
12024
The Earth is Flat? Unveiling Factual Errors in Large Language Models
W Wang, J Shi, Z Tu, Y Yuan, J Huang, W Jiao, MR Lyu
arXiv preprint: 2401.00761, 2024
12024
A & B == B & A: Triggering Logical Reasoning Failures in Large Language Models
Y Wan, W Wang, Y Yang, Y Yuan, J Huang, P He, W Jiao, MR Lyu
arXiv preprint: 2401.00757, 2024
12024
New Job, New Gender? Measuring the Social Bias in Image Generation Models
W Wang, H Bai, J Huang, Y Wan, Y Yuan, H Qiu, N Peng, MR Lyu
arXiv preprint: 2401.00763, 2024
2024
系统目前无法执行此操作,请稍后再试。
文章 1–20