A system for quickly generating training data with weak supervision
-
Updated
May 2, 2024 - Python
A system for quickly generating training data with weak supervision
The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.
skweak: A software toolkit for weak supervision applied to NLP tasks
A machine learning tool for automated prediction engineering. It allows you to easily structure prediction problems and generate labels for supervised learning.
Pure Python, lightweight, Pillow-based solver for Amazon's text captcha.
Augmentation pipeline for rendering synthetic paper printing, faxing, scanning and copy machine processes
Augmenty is an augmentation library based on spaCy for augmenting texts.
Natural Language Data Augmentation Tool for Conversational Systems
Generating training data from the Carla driving simulator in the KITTI dataset format
Collection of casual conversations that can be used with the Rasa Stack
COVID-19 Coughs files for training AI models
Convert all files in git repository to .txt files. Useful for training LLMs on your codebase.
Full resources supporting the publication "A Pragmatic Guide to Geoparsing Evaluation."
🔎 Classification helper for sex classification feature of InstaPy
Interactive Weak Supervision: Learning Useful Heuristics for Data Labeling
PyTorch reimplementation of computing Shapley values via Truncated Monte Carlo sampling from "What is your data worth? Equitable Valuation of Data" by Amirata Ghorbani and James Zou [ICML 2019]
Machine Learning project aimed at converting images into .obj 3D models by representing them as Blender hair-type particle systems.
A command line interface to combine text information from subtitles with voice data in the video. Provides a convenient way to generate training data for speech-recognition purposes.
Benchmarking tools for applying AI/ML to data assimilation
A simple implement of TransE, the ML algorithm published in 2013
Add a description, image, and links to the training-data topic page so that developers can more easily learn about it.
To associate your repository with the training-data topic, visit your repo's landing page and select "manage topics."