Skip to content

Scripts for processing and generating animal advocacy datasets, including synthetic feedback generation. Originally used to create datasets now merged into the Animal Advocacy Facts collection on HuggingFace.

License

Notifications You must be signed in to change notification settings

Open-Paws/data-processing-scripts

Repository files navigation

Data Processing Scripts

Overview

This repository contains scripts for processing and generating datasets relevant to animal advocacy AI development. These tools help create, clean, and validate data used in training AI systems aligned with animal advocacy values. A key component is the synthetic feedback generator, which was used to create part of the Animal Alignment Feedback Dataset.

Note: The datasets processed by these scripts have now been incorporated into a much larger and more comprehensive dataset: Continued Pre-Training.

About

Scripts for processing and generating animal advocacy datasets, including synthetic feedback generation. Originally used to create datasets now merged into the Animal Advocacy Facts collection on HuggingFace.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published