How do mixture-of-experts layers affect transformer models?
This new LLM technique has started improving the results of models without additional training.
I’m currently the Director of AI at Rebuy, a personalized search and recommendations platform for D2C e-commerce brands. Prior to Rebuy, I was a Research Scientist at Alegion. Additionally, I worked for Salesforce Commerce Cloud for two years.
This new LLM technique has started improving the results of models without additional training.