# Data

Each row of the dataset describes an investment vehicle at a certain date.

Here follows a concise description of the columns of the files:

`X_train`:

* `date`: A sequentially increasing integer representing a date. Time between subsequent dates is a constant, denoting an unknown but fixed frequency at which the data is sampled. The initial training dataset is composed of 268 dates.&#x20;
* `id`: A unique identifier representing the investment vehicle at a given date. Note that the same asset has a different `id` at each date.
* `feat_1, ..., feat_n`: Anonymized features describing an investment vehicle at a given date.

`X_test`:

* Same structure as `X_train` but comprises only a few dates. This file is used to simulate the submission process locally via `crunch.test()`, or `crunch test` (if using the CLI). A successful local test usually means no errors during execution on the submission platform.

The dataset is obfuscated.

### Files

* **X\_train.parquet**

<figure><img src="/files/0VZzJsBB4rkD9vsjrVVi" alt=""><figcaption></figcaption></figure>

**y\_train.parquet**

<figure><img src="/files/Vu9THlaymE285Tm9fbAa" alt="" width="375"><figcaption></figcaption></figure>


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://crunchdao-1.gitbook.io/quant-venture-capital-documentation/the-tournament/data.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
