wiktionary-parser

Code for the paper: Wikinflection: Massive semi-supervised generation of multilingual inflectional corpus from Wiktionary (Metheniti and Neumann, 2018)

morphology linguistics inflection computational-linguistics wiktionary wiktionary-parser

Updated Jun 23, 2020
Python

snizio / italian-wiktionary-parser

Star

This repository contains a python script for parsing an xml dump of the Italian Wiktionary (Wikizionario); it also contains the parsed dictionary in a JSON file and a ONLI (italian database of neologisms) scraper with the scraped data in a CSV file

nlp italian corpus-linguistics italiano onli wiktionary-parser wiktionary-data neologisms

Updated May 26, 2025
Python

Surkal / WiktionnaireParser

Star

A library for parsing the french wiktionary

python python3 francais french wiktionary wiktionary-parser

Updated Feb 20, 2022
Python

javalc6 / wikiparser-java

Star

Light Wiki parser and renderer developed in Java and Lua, from wiktionary xml dump to html

java lua mediawiki wikipedia wiktionary multi-language-support wiktionary-parser wiktionary-renderer

Updated Apr 7, 2025
Java

serasset / dbnary

Star

DBnary extractor mirror - See https://gitlab.com/gilles.serasset/dbnary

java lua rdf wiktionary wiktionary-parser ontolex-lexicography

Updated Jul 25, 2025
Java

slowwavesleep / RuWiktionaryParser

Star

Extraction of the Russian word forms and their segmentation from the Russian Wiktionary

morphology segmentation wiktionary russian-language wiktionary-parser word-forms

Updated May 1, 2021
Python

lennon-c / python-wikitext-parser-guide

Star

A Hands-On Guide to Parsing Wikitext with Python

python tutorial guide wiktionary hands-on wiktionary-parser wikitext-parser dewiktionary

Updated Mar 31, 2025
Python

Vuizur / ruwiktionary-htmldump-parser

Star

Parses the Russian Wiktionary HTML dumps into JSON and generates ereader dictionaries

parser language-learning russian wiktionary wiktionary-parser

Updated Aug 10, 2023
Python

beviah / ezglot

Star

Selected data processing scripts including language agnostic multilingual wiktionary parser

multilingual dictionary extractor templates pronunciation levenshtein-distance wikitext ipa similarity-measures language-resources wiktionary thesaurus-data wiktionary-parser wiktionary-data wiktionary-tool wiktionary-dataset word-distance

Updated Mar 31, 2024
Python

Vuizur / dewiktionary-htmldump-parser

Star

A scraper which extracts data from the German Wiktionary HTML dump.

german czech wiktionary wiktionary-parser wiktionary-dump

Updated Jul 19, 2022
Python

dicc-io / wiktioparse

Star

Wiktionary Parser written in Ruby

ruby parser json wiktionary wiktionary-parser wiktionary-data

Updated Dec 17, 2020
Lua

lennon-c / de_wiktio

Star

A Python package to parse and extract data from the German Wiktionary. It allows users to access wikitext content, either by fetching it directly online or by loading a dump file locally.

wikitext wiktionary wiktionary-parser wiktionary-dump dewiktionary

Updated Jan 9, 2025
Python

yuzhoumo / latinator-3000

Star

Web interface for parsing Wiktionary for results in specific languages

dictionary wiktionary-parser

Updated Jan 13, 2022
JavaScript

kennethsible / german-anki

Star

English-German (Sorted by Frequency)

language-learning anki-deck wiktionary-parser

Updated Oct 22, 2024
Python

Improve this page

Add a description, image, and links to the wiktionary-parser topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the wiktionary-parser topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

wiktionary-parser

Here are 30 public repositories matching this topic...

tatuylonen / wiktextract

reader-dict / monolingual

suyashb95 / WiktionaryParser

gambolputty / wiktionary-de-parser

clefourrier / EtymDB

wswu / yawipa

lenakmeth / Wikinflection

snizio / italian-wiktionary-parser

Surkal / WiktionnaireParser

javalc6 / wikiparser-java

serasset / dbnary

slowwavesleep / RuWiktionaryParser

lennon-c / python-wikitext-parser-guide

Vuizur / ruwiktionary-htmldump-parser

beviah / ezglot

Vuizur / dewiktionary-htmldump-parser

dicc-io / wiktioparse

lennon-c / de_wiktio

yuzhoumo / latinator-3000

kennethsible / german-anki

Improve this page

Add this topic to your repo