The 40,000 ingredient dataset offered by Innosol is a proprietary database of fragrance and flavor ingredients designed primarily for use in Artificial Intelligence (AI) and Machine Learning (ML) applications in the flavor and fragrance (F&F) industries.
Key Features of the Innosol Dataset
- Size and Scope: The database contains over 40,000 unique flavor and fragrance ingredients (chemicals).
- Data Points: Each ingredient in the dataset reportedly includes hundreds of detailed data points, such as:
- Aroma and taste profiles (organoleptics).
- Chemical Abstracts Service (CAS) numbers.
- SMILES strings (Simplified Molecular Input Line Entry System).
- Solubility and evaporation parameters.
- Sustainability information.
- Purpose: The dataset is intended to help companies train AI algorithms for tasks like:
- Generating new, unique fragrance and flavor formulas.
- Identifying sustainable ingredient options.
- Optimizing product development and R&D.
- Analyzing market trends and national brand formulas.
- Format: The data is provided as a flat file, making it “Python ready” and compatible with various data analysis and machine learning tools.
- Licensing: Innosol licenses this trade-secret dataset to other companies, positioning it as a foundational tool for R&D in the F&F industry, rather than a publicly available resource.
For more information, contact us.
