[MODULE] - Noun chunker/ splitter #384

LeonardPuettmannKern · 2023-10-18T12:25:01Z

Please describe the module you would like to add to bricks
A brick that returns an embedding list containing only the nouns of a text, so that they can be used as pointers.

Do you already have an implementation?

ATTRIBUTE = "text" 

def noun_splitter(record):
    nouns_sents = []
    for sent in record[ATTRIBUTE].sents:
        nouns = [token.text for token in sent if token.pos_ == "NOUN" and len(token.text) > 1]
        if nouns:
            nouns_sents.extend([" ".join(nouns[i:i+1]) for i in range(0, len(nouns), 1)])
    return list(set(nouns_sents))

Additional context
Can be implemented with SpaCy.

LeonardPuettmannKern added enhancement New feature or request cognition labels Oct 18, 2023

LeonardPuettmannKern mentioned this issue Oct 18, 2023

Noun splitter #385

Merged

13 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MODULE] - Noun chunker/ splitter #384

[MODULE] - Noun chunker/ splitter #384

LeonardPuettmannKern commented Oct 18, 2023

[MODULE] - Noun chunker/ splitter #384

[MODULE] - Noun chunker/ splitter #384

Comments

LeonardPuettmannKern commented Oct 18, 2023