Create Your Own Dataset with Interzoid
Generate custom datasets using AI and specialized algorithms for any use case
Starting with a text file, you can now create any dataset you can imagine. Using
our AI-powered data platform, you simply describe the output parameters you want to be appended to the values
in your text file based on a topic that you describe, and like magic your new dataset is created and ready for use.
For example, if your input file is a list of country names, you could add parameters of "population", "capital", "leader", and
"largest export" along with a topic of "information about countries" (this helps the AI) into the form. That's it! Click "Run"
and watch data generated and appended to your initial input file at high-speeds using our parallel-processing AI-driven platform.
These created values are added as new columns to each value in the input file.
You can then download the results and use as-is or import into your favorite tools such as Excel, Google Sheets,
a database, or a CRM package.
Geographic data, financial data, healthcare data, scientific data, sports data, anything
you can think of, you can now quickly generate at will.
Follow the steps below to begin generating your custom data.
1Obtain an API Key
To begin, you must obtain an Interzoid API key. Log in to your account or register for one at www.interzoid.com. Once registered, your API key will be used to authenticate each API call, one API call per input value record, and for keeping track of usage (one api call = one record = one credit).
2Configure Output Parameters
On the left sidebar of the "Create Your Own" tab, you can define the structure of your output data:
- Each parameter you add represents a column that will be generated in your output dataset
- You can add up to 10 output parameters by clicking the "Add Parameter" button
- Name your parameters descriptively (e.g., "email", "phone", "address")
- Remove parameters with the delete icon (must maintain at least one parameter for output)
3Describe Data to Create
In the main panel's first section, enter a detailed description of the data you want to generate in the "Topic" field. This is important for the AI, and can be experimented with to get the desired results. Be specific about what kind of data you need with respect to each parameter you defined in the left panel, as this will be as the basis for generating your custom dataset.
This powerful feature leverages AI, specialized algorithms, and extensive knowledge bases to create and generate virtually any dataset you can imagine.
4Configure Input Parameters
The tool requires a CSV or TSV file that includes the lookup values that will be used to create the output dataset.
Configure the following settings in the main panel's second section:
- Column Number: Specify which column in your input file contains the lookup values that will be used as the basis for generating your output data. If there is only one column, just use a value of "1".
- Algorithm: Set to "latest" by default, ensuring you're using our most current AI algorithms
- Input File Format: Choose between CSV (comma-separated values) or TSV (tab-separated values) format
- File Source: Select "Web URL" to use a remotely hosted file or "Local File" to upload one from your computer
- Input File URL/Path: Enter the URL for web-hosted files (ensure it is a raw text file with no HTML) or use "Choose Input File..." to upload a local file
- File Output Type: Select the format for your output file (CSV or TSV)
- Show All Columns in Output: Check this option to include all original input columns (if there is more than one) in the output file alongside the generated data
5Run the Job
Once you have entered your API key and configured all settings, click the Run button to execute the data generation process. The system is built on a parallel processing platform enabling up to a hundred records to be processed simultaneously, delivering lightning speeds. Note that because of multi-node parallel processing architecture, output records will be in a different order than the input file.
A spinner will indicate that the job is processing, and the generated data for each record will be displayed in the results window upon completion.
6Review and Download Results
The output is shown in the dedicated results window that supports horizontal scrolling for long output. You can review the results and click the Download Results button to save the generated dataset for additional use.
Below the results, you'll also see the "Batch API Call Utilized" section which shows the API call that was used (with sensitive information masked). This enables the processing to occur as part of a script, from a command line, as part of a data pipeline, ETL/ELT processes, etc.
Note: Be sure to have your input files in the proper standard format required for CSV and TSV files.
CSV Text Qualifiers: If any of your data field values contain the delimiter character of a comma, you should enclose that particular field in double quotes (e.g., "John, Doe, Inc.") per industry standards. This ensures that the delimiter inside the quotes is treated as part of the data rather than a separator, which could otherwise cause issues reading data. Export functions of various applications will typically enforce this format. If you are running a file with only a single column of input data, the TSV format is recommended since the entire line will be read as a column, avoiding the issue of unquoted data that contains commas.
Need help? Contact support@interzoid.com for assistance.
Congratulations! You are now ready to use Interzoid's "Create Your Own" data generation tool to create custom datasets for any application or use case.