Linguistics Software for BSD

Browse free open source Linguistics software and projects for BSD below. Use the toggles on the left to filter open source Linguistics software by OS, license, language, programming language, and project status.

  • Our Free Plans just got better! | Auth0 Icon
    Our Free Plans just got better! | Auth0

    With up to 25k MAUs and unlimited Okta connections, our Free Plan lets you focus on what you do best—building great apps.

    You asked, we delivered! Auth0 is excited to expand our Free and Paid plans to include more options so you can focus on building, deploying, and scaling applications without having to worry about your security. Auth0 now, thank yourself later.
    Try free now
  • Gen AI apps are built with MongoDB Atlas Icon
    Gen AI apps are built with MongoDB Atlas

    Build gen AI apps with an all-in-one modern database: MongoDB Atlas

    MongoDB Atlas provides built-in vector search and a flexible document model so developers can build, scale, and run gen AI apps without stitching together multiple databases. From LLM integration to semantic search, Atlas simplifies your AI architecture—and it’s free to get started.
    Start Free
  • 1
    Virastyar

    Virastyar

    Virastyar is an spell checker for low-resource languages

    Virastyar is a free and open-source (FOSS) spell checker. It stands upon the shoulders of many free/libre/open-source (FLOSS) libraries developed for processing low-resource languages, especially Persian and RTL languages Publications: Kashefi, O., Nasri, M., & Kanani, K. (2010). Towards Automatic Persian Spell Checking. SCICT. Kashefi, O., Sharifi, M., & Minaie, B. (2013). A novel string distance metric for ranking Persian respelling suggestions. Natural Language Engineering, 19(2), 259-284. Rasooli, M. S., Kahefi, O., & Minaei-Bidgoli, B. (2011). Effect of adaptive spell checking in Persian. In NLP-KE Contributors: Omid Kashefi Azadeh Zamanifar Masoumeh Mashaiekhi Meisam Pourafzal Reza Refaei Mohammad Hedayati Kamiar Kanani Mehrdad Senobari Sina Iravanin Mohammad Sadegh Rasooli Mohsen Hoseinalizadeh Mitra Nasri Alireza Dehlaghi Fatemeh Ahmadi Neda PourMorteza
    Leader badge
    Downloads: 634 This Week
    Last Update:
    See Project
  • 2
    iramuteq
    IRAMUTEQ : Interface de R pour les Analyses Multidimensionnelles de Textes et de Questionnaires. Logiciel de traitement de données pour des corpus texte ou de type individus/caractères. Permet notamment de réaliser des analyses de type "ALCESTE"
    Leader badge
    Downloads: 1,491 This Week
    Last Update:
    See Project
  • 3

    Presage

    the intelligent predictive text entry platform

    Presage (formerly Soothsayer) is an intelligent predictive text entry system. Presage generates predictions by modelling natural language as a combination of redundant information sources. Presage computes probabilities for words which are most likely to be entered next by merging predictions generated by the different predictive algorithms. Presage's modular and extensible architecture allows its language model to be extended and customized to utilize statistical, syntactic, and semantic predictive algorithms. Presage's predictive capabilities are implemented by predictive plugins. Predictive plugins use services provided by the platform to implement multiple prediction techniques.
    Leader badge
    Downloads: 267 This Week
    Last Update:
    See Project
  • 4
    Mishkal: Arabic Text Vocalization

    Mishkal: Arabic Text Vocalization

    Arabic Text Vocalization system

    Automatic system of vocalization of arabic text.
    Downloads: 94 This Week
    Last Update:
    See Project
  • Build Securely on AWS with Proven Frameworks Icon
    Build Securely on AWS with Proven Frameworks

    Lay a foundation for success with Tested Reference Architectures developed by Fortinet’s experts. Learn more in this white paper.

    Moving to the cloud brings new challenges. How can you manage a larger attack surface while ensuring great network performance? Turn to Fortinet’s Tested Reference Architectures, blueprints for designing and securing cloud environments built by cybersecurity experts. Learn more and explore use cases in this white paper.
    Download Now
  • 5
    Artha ~ The Open Thesaurus
    Artha is a handy thesaurus based on WordNet with distinct features like global hotkey look-up, passive desktop notifications, regular expression based search, etc.. Artha may be used as a free open-source replacement to the proprietary WordWeb Pro.
    Leader badge
    Downloads: 70 This Week
    Last Update:
    See Project
  • 6
    Google Translate PHP

    Google Translate PHP

    Free Google Translate API PHP Package

    A simple and effective PHP library for translating text using Google Translate without needing an API key. It allows developers to integrate real-time translation features into their applications with minimal setup and supports multiple languages, leveraging Google Translate’s unofficial endpoint.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 7
    Varamozhi is a free English-Malayalam transliteration library. It can transliterate Malayalam text between Malayalam and English scripts. Varamozhi takes as the input, the mapping between a Malayalam font and a transliteration scheme; outputs functions i
    Leader badge
    Downloads: 96 This Week
    Last Update:
    See Project
  • 8
    Apertium: Machine Translation Toolbox

    Apertium: Machine Translation Toolbox

    The free and open-source rule-based machine translation platform

    Apertium is a toolbox to build open-source shallow-transfer machine translation systems, especially suitable for related language pairs: it includes the engine, maintenance tools, and open linguistic data for several language pairs.
    Downloads: 16 This Week
    Last Update:
    See Project
  • 9
    Free Dictionaries
    Free translating dictionaries. Source format: TEI-P5 XML. Delivery formats: DICT, Stardict, etc. The dictionaries may include information on the pronunciation, etymology and such, in a platform-independent format. Access: web/plugins/standalone.
    Leader badge
    Downloads: 59 This Week
    Last Update:
    See Project
  • Resolve Support Tickets 2x Faster​ Icon
    Resolve Support Tickets 2x Faster​

    Full access to Enterprise features. No credit card required.

    What if You Could Automate 90% of Your Repetitive Tasks in Under 30 Days? At ServoDesk, we help businesses like yours automate operations with AI, allowing you to cut service times in half and increase productivity by 25% - without hiring more staff.
    Try ServoDesk for free
  • 10

    Wordcorr

    Data management for comparative linguistics

    Wordcorr automates the tedious and risky process of tabulating and managing the sound correspondences used in working out the historical development of natural languages. Initial support was from NSF.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 11
    UnsupervisedMT

    UnsupervisedMT

    Phrase-Based & Neural Unsupervised Machine Translation

    Unsupervised Machine Translation is a research repository that implements both phrase-based SMT and neural MT approaches for translation without parallel corpora. The neural component supports multiple architectures—seq2seq, biLSTM with attention, and Transformer—and allows extensive parameter sharing across languages to improve data efficiency. Training relies on denoising auto-encoding and back-translation, with on-the-fly, multithreaded generation of synthetic parallel data to continually refresh supervision signals. The project also provides scripts to fetch and preprocess monolingual data, learn BPE codes, and train cross-lingual embeddings that bootstrap unsupervised alignment between languages. Beyond the core EMNLP 2018 setup, the codebase exposes additional, optional capabilities such as multi-language training, language model pretraining with shared parameters, and adversarial training.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 12
    Fresh Memory

    Fresh Memory

    Flashcards application with Spaced Repetition method

    Fresh Memory is an application that helps to learn large amounts of any material with Spaced Repetition method. The most important subject is learning foreign words, but Fresh Memory can be also used to learn anything else. The learning data is stored as flash cards and dictionaries. The flash cards may have several fields, and the user controls what combination of fields to learn. The flashcards can have formatted text and images.
    Downloads: 5 This Week
    Last Update:
    See Project
  • 13
    Al-Mintiq: Arabic eSpeak

    Al-Mintiq: Arabic eSpeak

    Arabic voice files for eSpeak system

    Arabic files and voices for eSpeak Text to speech system, المنطيق : ملفات اللغة العربية لبرنامج توليد الكلام من النص إسبيك
    Downloads: 13 This Week
    Last Update:
    See Project
  • 14
    LaBB-CAT

    LaBB-CAT

    A linguistic annotation store

    LABB-CAT is a browser-based linguistics research tool that stores recordings and regular-expression searchable text transcripts of interviews. The search results, entire transcripts, and media, can be viewed or exported in a variety of format
    Downloads: 11 This Week
    Last Update:
    See Project
  • 15
    oopinyinguide
    OO Pinyin Guide is a Java extension for OpenOffice 3 or higher. It enables the user to add pinyin transliteration over Chinese characters inside a text document. This tool can be useful for people learning or teaching Chinese.
    Leader badge
    Downloads: 5 This Week
    Last Update:
    See Project
  • 16
    srt-translator

    srt-translator

    Subtitle translator from one natural language to other.

    Translating subtitles in format SubRip from one natural language to other. It is based on Google Translate without API and therefore without payment. Translator have automatic and manual spell checkers.
    Downloads: 8 This Week
    Last Update:
    See Project
  • 17
    The Dictionary System
    The application Dictionary System (DS) is a web application designed for creation of one-way bilingual dictionaries or encyclopaedias offering a working environment for creation of a dictionary and a web page which enables the general public to search in the dictionary. It is so-called DWS application (Dictionary Writing System) or DPS (Dictionary Production / Publishing System). Aplikace Dictionary System (dále DS) je webová aplikace. Je to tzv. DWS aplikace (Dictionary Writing System) nebo také tvz. DPS (Dictionary Production/Publishing System). Aplikace Dictionary System nabízí pracovní prostředí pro tvorbu jednosměrných dvojjazyčných slovníků nebo encyklopedií a webové stránky, které umožňují vyhledávat ve slovníku široké veřejnosti.
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 18

    NetBeans Dictionaries

    Additional dictionary files for the NetBeans spellchecker.

    Additional dictionary files for the NetBeans spellchecker.
    Downloads: 4 This Week
    Last Update:
    See Project
  • 19
    TikZ-dependency

    TikZ-dependency

    A LaTeX library to draw all sorts of dependency trees and graphs

    TikZ-dependency allows you to draw dependency graphs in LaTeX documents with little or no effort. The package has a very easy to learn, high level interface that can be used to draw simple dependency trees, complex non projective graphs, bubble parses, and in general any kind of graph which is based on a sequence of nodes and edges among these. It is based on PGF/TikZ and it can be used either with latex or pdflatex. It comes with a very comprehensive documentation that will get you started in 10 minutes, even without any prior knowledge of TikZ. It also provides a lot of styling facilities, to let you personalize the look and feel of the graphs at your liking.
    Downloads: 1 This Week
    Last Update:
    See Project
  • 20
    Mani
    Coptic - English and Coptic - Czech dictionary related to Crum's coptic dictionary, written in C++, based on MySql, with Qt GUI. Is developed as part of project Marcion, containing only coptic data without study environment.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 21
    ATTENTION! Morfologik is now at GitHub: https://github.com/morfologik/
    Downloads: 2 This Week
    Last Update:
    See Project
  • 22
    The Open Translation Engine (OTE) is a web-based translation dictionary manager. The OTE allows a community of users to create and manage one or many translation dictionaries. The OTE is written in PHP and uses a MySQL database.
    Leader badge
    Downloads: 2 This Week
    Last Update:
    See Project
  • 23
    AzConvert is an open source program to convert different scripts of Azerbaijani language (Latin, Arabic and Cyrillic) to each other. It's written in Qt.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 24
    Core program and associated utilities for building a machine translation system using the Example-Based paradigm, where previously-translated text is used to infer new translations of previously-unseen text.
    Downloads: 2 This Week
    Last Update:
    See Project
  • 25
    Thinknowlogy

    Thinknowlogy

    The world's only naturally intelligent knowledge technology

    Natural intelligence is the utilization of naturally occurring logic. This naturally occurring logic provides concrete clues for organizing natural objects, like: - Grouping objects that belong together, - Separating objects that don't belong together, - Archiving objects that have become less important. Natural language and spatial information are sources of natural intelligence: - Natural language is providing concrete logic for organizing knowledge objects, - Spatial information provides concrete logic for organizing spatial objects (utilized in, e.g., self-driving cars). In this way, our brains know how to organize their knowledge and spatial information. I focus on natural language because this source of natural intelligence is hardly understood by scientists. Hence, the inability of Large Language Models to organize changes in their knowledge independently.
    Downloads: 2 This Week
    Last Update:
    See Project
  • Previous
  • You're on page 1
  • 2
  • 3
  • 4
  • 5
  • Next