Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

关于BGE的蒸馏问题 #137

Open
hgwu4869 opened this issue Nov 6, 2023 · 2 comments
Open

关于BGE的蒸馏问题 #137

hgwu4869 opened this issue Nov 6, 2023 · 2 comments
Labels
question Further information is requested

Comments

@hgwu4869
Copy link

hgwu4869 commented Nov 6, 2023

请问text2vec-bge-large-chinese这个模型,是基于BGE做知识蒸馏得到的吗?
如果是的话,请问能提供蒸馏这部分的代码吗?
虽然已经给出参考了的sentence transformer的哪部分代码,但如果有直接可run的代码会更方便些。

@hgwu4869 hgwu4869 added the question Further information is requested label Nov 6, 2023
@shibing624
Copy link
Owner

  1. 不是蒸馏,是二次训练,发现bge对短文本相似度给分普遍较高,故针对短文本,用cosent方法在sts-b-zh数据集上训练后得到的;
  2. sentence transformer里面有示例,可以直接跑

@hgwu4869
Copy link
Author

hgwu4869 commented Nov 6, 2023

好的,在README里BGE和模型蒸馏连在一起,所以误解了。
那么请问README里,如下图所示的模型蒸馏这部分是想说明什么呢?在该项目里有知识蒸馏相关的代码示例能直接跑吗?

text2vec-bge-模型蒸馏

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants