Data Preparation and Import
Loading a File
Click the Open File button on the launcher screen and select a file. To use sample data, choose from the "Sample Data" section on the launcher screen. See Getting Started for detailed steps.
Supported File Formats
MIDAS supports four file formats: CSV, TSV, MDS, and ZIP.
CSV (Comma-Separated Values)
The most common data format. Columns are separated by commas (,). File extension is typically .csv.
TSV (Tab-Separated Values)
A file format where columns are separated by tab characters. File extension is typically .tsv or .txt.
MDS (MIDAS Project File) MIDAS's native project file format. Contains datasets, analysis settings, and reports.
ZIP Archive files containing CSV or TSV files. Multiple files can be loaded at once.
Excel files (.xlsx) cannot be loaded directly. Save your spreadsheet as CSV from Excel's "Save As" menu.
Character Encoding UTF-8 encoding is supported. When saving CSV from Excel, select "CSV UTF-8 (Comma delimited)" format.
File Structure
MIDAS reads the first row as column names and subsequent rows as data.
- Row 1: Column names (header row)
- Row 2 onwards: Data rows
Example:
Name,Age,Country
Alice,25,USA
Bob,30,Japan
Charlie,28,UK
Data Types
MIDAS automatically determines data types when loading.
boolean
Boolean values represented by true/false, 1/0, yes/no, y/n, etc.
int64 (integer)
Numbers without decimal points (e.g., 1, 42, -10).
float64 (floating point)
Numbers with decimal points (e.g., 3.14, 0.5, -2.71).
date
Date data (e.g., 2025-11-17, 2025/11/17).
datetime
Data including both date and time (e.g., 2025-11-17 14:30:00).
string Text data that does not match any of the above types.
enum Categorical data with a fixed set of values. Cannot be automatically inferred; you create them manually. First, create an Enum definition from Data > Enum Definitions, setting the values and their order. Then use Column Type Conversion to convert a string column to the Enum. Since value order can be defined, enum is suitable for ordinal scale data.
Data types are displayed in column headers as Age (int64). If a data type is not correctly determined, use Column Type Conversion to fix it. The original dataset is not modified; conversion results are created as a new dataset.
Measurement Scales
MIDAS automatically assigns a measurement scale (Nominal, Ordinal, Interval, Ratio) to each column. Measurement scales affect the available graph types and statistical methods. Right-click a column in the Data Table to change its scale.
See Data Types and Measurement Scales for what each scale means and how it affects analysis.
Next steps
- Data Table - View, filter, and sort your loaded data
- Creating Graphs - Visualize your data
See also
- Data Types and Measurement Scales - Statistical meaning of measurement scales
- Advanced Graph Creation - Flexible visualization with Grammar of Graphics
- Sample Datasets - Practice data