Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Semantic similarity between essays and a theme #166

Open
RaphaelSilv opened this issue Aug 27, 2023 · 1 comment
Open

Semantic similarity between essays and a theme #166

RaphaelSilv opened this issue Aug 27, 2023 · 1 comment

Comments

@RaphaelSilv
Copy link

RaphaelSilv commented Aug 27, 2023

Hi guys, thanks for this fantastic project.

I intend to use it to measure the similarity between essays written by students and a given theme. The theme is a one-line sentence and each essay has a couple of paragraphs. I have a dataset where essays written in conformation to the theme have a positive score ranging from 20 - 200 while essays that don't consider the theme receive 0.
From what I've glanced at the original article, and played around using the relevant pre-trained bert model in opposition to the default language, it might be a very doable thing, although not perfect. I still have some doubts about how to use the weighting, which I hope will improve the measurements I expect to get.

Anyhoo, any advice on how to approach this task? Please, any do's or don'ts are welcomed 😃

@RaphaelSilv
Copy link
Author

RaphaelSilv commented Aug 30, 2023

Hey guys, some updates:

After the calculations, I got the average of the scores and plotted the results into a scatter plot chart.

As the images show I didn't get any correlation of any of the BERTscores with the essays score. Does anyone have any inputs that would help to improve these results? Thank you!

image
image
image

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant