Hi all, I built a wordlist-generator that uses a semantic knowledge graph instead of pure string manipulation.
You feed it a list of keywords and it builds a hypernym DAG using WordNet, expands it with LLM-generated hyponyms, scores leaf pairs by semantic similarity (Wu-Palmer), and permutes synonyms to produce the final wordlist. For terms WordNet doesn't know (brand names, games, slang) an LLM iteratively finds a valid hypernym using a Wikipedia summary as context.
The wordlist use case is the obvious one, but honestly the core engine is just a semantic expander: given a few seed words, it grows a contextually coherent vocabulary around them. I can see it being useful for:
- NLP / ML — data augmentation, building domain-sp ecific vocabularies, corpus enrichment
- Ontology / knowledge graphs* quick concept mapping from a small seed set.
Supports OpenAI or local models via llama.cpp.
Code: https://github.com/ivegotanheadache/WonaBee
Curious if anyone sees other uses for this kind of approach and likes it
[link] [comments]
from hacking: security in practice https://ift.tt/9L21BhR
Comments
Post a Comment