News
The zero-shot performance concerning medical evidence summarization was evaluated using two models, GPT-3.5 and ChatGPT. Two experimental setups were designed to assess the models' capabilities.
Results that may be inaccessible to you are currently showing.
Hide inaccessible results