In this lesson, we will take a look at creating an empty pipeline. First, let's import the Pipeline class:
from zipline.pipeline import Pipeline
In a new cell, let's define a function to create our pipeline. Wrapping our pipeline creation in a function sets up a structure for more complex pipelines that we will see later on. For now, this function simply returns an empty pipeline:
def make_pipeline():
return Pipeline()
In a new cell, let's instantiate our pipeline by running make_pipeline()
:
my_pipe = make_pipeline()
Now that we have a reference to an empty Pipeline, my_pipe
, let's run it to see what it looks like. Before running our pipeline, we first need to import run_pipeline
, a research-only function that allows us to run a pipeline over a specified time period.
from zipline.research import run_pipeline
Since we will be using the same data bundle repeatedly in this tutorial, we can set it as the default bundle to avoid always having to type the name of the bundle in each call to run_pipeline
:
from quantrocket.zipline import set_default_bundle
set_default_bundle("usstock-learn-1d")
{'status': 'successfully set default bundle'}
Let's run our pipeline for one day (2010-01-05) with run_pipeline
and display it.
result = run_pipeline(my_pipe, start_date='2010-01-05', end_date='2010-01-05')
A call to run_pipeline
returns a pandas DataFrame indexed by date and security. Let's see what the empty pipeline looks like:
result
date | asset |
---|---|
2010-01-05 | Equity(FIBBG000C2V3D6 [A]) |
Equity(QI000000004076 [AABA]) | |
Equity(FIBBG000BZWHH8 [AACC]) | |
Equity(FIBBG000V2S3P6 [AACG]) | |
Equity(FIBBG000M7KQ09 [AAI]) | |
... | |
Equity(FIBBG011MC2100 [AATC]) | |
Equity(FIBBG000GDBDH4 [BDG]) | |
Equity(FIBBG000008NR0 [ISM]) | |
Equity(FIBBG000GZ24W8 [PEM]) | |
Equity(FIBBG000BB5S87 [HCH]) |
7841 rows × 0 columns
The output of an empty pipeline is a DataFrame with no columns. In this example, our pipeline has an index made up of all ~8000 securities (truncated in the display) for Jan 5th, 2010, but doesn't have any columns.
In the following lessons, we'll take a look at how to add columns to our pipeline output, and how to filter down to a subset of securities.
Next Lesson: Factors