Why does the best performing adapter-based parameter-efficient fine-tuning depend on the language model being fine-tuned?

Ask Question

Asked 2 years, 7 months ago

Modified 2 years, 7 months ago

Viewed 123 times

https://arxiv.org/abs/2304.01933 shows that the best performing adapter-based parameter-efficient fine-tuning depends on the language model being fine-tuned:

E.g., LORA is the best adapter for LlaMa-7B, while S-adapter is the best adapter for BLOOM-7.1B.

Why does the best performing adapter-based parameter-efficient fine-tuning depend on the language model being fine-tuned?

edited May 5, 2023 at 17:32

asked May 5, 2023 at 8:24

Franck Dernoncourt

48.7k35 gold badges183 silver badges301 bronze badges

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Stack Exchange Network

Why does the best performing adapter-based parameter-efficient fine-tuning depend on the language model being fine-tuned?

0

Your Answer

Hot Network Questions

Why does the best performing adapter-based parameter-efficient fine-tuning depend on the language model being fine-tuned?

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest

Related

Hot Network Questions