catalyst

mirror of https://github.com/wassname/catalyst.git synced 2026-07-01 01:58:43 +08:00

Author	SHA1	Message	Date
Scott Sanderson	ef4f642e62	ENH: Compute engine architecture for FFC API. This patch lays the groundwork for a compute engine designed to facilitate construction of factor-based universe screening and portfolio allocation. It contains: A new module, `zipline.modelling`, containing entities that can be used to express computations as dependency graphs. Each node in such a graph is an instance of the base `Term` class, defined in `zipline.modelling.term`. Dependency graphs are executed by instances of `FFCEngine`, defined in `zipline.modelling.engine`. A new module, `zipline.data.ffc`, containing loaders and dataset definitions for inputs to the modelling API. New `TradingAlgorithm` api methods: `add_factor`, and `add_filter`. These methods can only be called from `initialize`, and are used to inform the algorithm that each day it should compute the given terms. Computed factor results are made available through a new attribute of the `data` object in `before_trading_start` and `handle_data`. Computed filter results control which assets are available in the factor matrix on each day.	2015-07-29 12:30:46 -04:00
jfkirk	b84ac01cbf	ENH: Adds futures trading and asset management logic to TradingAlgorithm and performance classes	2015-06-11 11:35:49 -04:00
Eddie Hebert	0fa44471be	MAINT: Change expected type of treasury curves from load to DataFrame. Instead of converting the curves back and forth from dictionaries to DataFrame and back, use the DataFrame format when passing to environment.	2015-04-20 10:26:09 -04:00
Benjamin Berman	ef598c7130	BUG: Handle a ValueError on from_csv calls The cached market data could be corrupted. Pandas raises a ValueError in that case, and this error handles it.	2015-04-14 12:40:37 -04:00
Scott Sanderson	885db87dea	MAINT: Use logger instead of printing in loader.py Makes it easier to filter logs when they're not desired.	2015-04-14 12:40:37 -04:00
warren-oneill	b62fadc76f	adding NYSE trading_day and trading_days as default in load_market_data()	2015-04-08 16:57:23 -04:00
warren-oneill	aa872afdf4	adding updates from master	2015-04-08 16:57:12 -04:00
warren-oneill	49c168b3d0	adding trading_day and trading_days as variables to load_market_data	2015-04-08 16:56:13 -04:00
Jonathan Kamens	e942275108	STY: Flake8 Upgrade the version of the flake8, pep8, and mccabe PyPI packages, and make the code changes necessary for compatibility with the updated packages.	2015-03-19 17:21:25 -04:00
Jonathan Kamens	c46a3afa3c	BUG: Don't download benchmarks / treasury curves unnecessary Fix an off-by-one error which was causing us to download the benchmark and treasury curves over and over again even when they weren't needed.	2015-03-08 09:31:50 -04:00
Luke Schiefelbein	1542b41fbd	BUG: Fix price caching for tickers with '/' char On Ubuntu (assume this is true for all posix) tickers containing a slash char ("CRD/A", "BRK/A", both valid tickers with yahoo api accessible timeseries) lead to a path error in loader.py line 286.	2014-11-19 11:26:27 +01:00
Thomas Wiecki	820115f7be	MAINT: Replace iterkv with iteritems. iterkv is being deprecated as of pandas 0.14.	2014-10-22 17:25:37 +02:00
twiecki	4bdecd6402	STY: PEP8 fixes.	2014-03-26 20:46:20 +09:00
Eddie Hebert	71cda461c5	BUG: Fix check for cached public data for Python 2.7 Python 2 and 3 throw different exception types when a file does not exist. Catch both exception types to trigger the download, so that the loader works under both Python versions.	2014-01-07 17:19:16 -05:00
Eddie Hebert	46ab748dd2	MAINT: Use pandas for data cache file I/O The compatibility between the two versions was made easier by letting pandas handle the heavy lifting, so pass filenames to the pandas serialization methods, instead of dealing doing the file handling and reading/writing within the data module.	2014-01-07 12:01:08 -05:00
Eddie Hebert	b4959e46cf	MAINT: Use six for Python 3 compatible names and behavior. Use the six module to import functions and types that are consistent between Python 2 and 3, so that one code base can support both versions. - Use integer types instead of int and long. - Use string_types instead of basestring. - Account for iteritems, itervalues, iterkeys. - Use six.moves for filter and zip, reduce - Use compatible bytes for md5 hasher. - xrange and range	2014-01-07 11:33:50 -05:00
Eddie Hebert	54ddd1c109	MAINT: print function clean up in preparation for Python 3 - Use `print()` function for all print calls - Fix strip and format calls that were on the outside of the print function for some reason. (Which were breaking in Python 3 because of print returning None.) - Remove commented out print calls.	2014-01-04 20:55:43 -05:00
David Stephens	e45528458f	ENH: Added functionality to download Canadian treasury curves. Added automatic switching of treasury curves based on index sent to environment.	2013-12-27 13:27:43 -05:00
Eddie Hebert	50800a9863	BUG: Fix data cache filepath on Windows. Prevent the ':' char, generated by converting a datetime to a string, from creating on incompatible filepath for Windows.	2013-11-18 20:37:45 -05:00
Eddie Hebert	43b85cffb0	MAINT: Calculate tradingcalendar with days beyond the current day. To make 'next open' calculations more straight ahead, calculate more than enough days in the trading calendar.	2013-11-11 15:48:44 -05:00
Eddie Hebert	797cb8ece3	BUG: Fix bad reference to benchmark timezone in loader.	2013-11-11 14:39:11 -05:00
Eddie Hebert	89793e371c	MAINT: Protect loader against Series saved with no tz. Checking for tz.UTC is not sufficient, since it is possible for the index.tz value to be None.	2013-11-11 14:17:14 -05:00
Eddie Hebert	c45c1a22e1	BUG: Only localize benchmark index if it is naive. Check for whether or not the index's timezone is UTC or not before attempting to localize, since an already localized index throws an error when tz_localize is called.	2013-10-29 13:17:58 -04:00
Eddie Hebert	2d64ab8bfe	BUG: Fix naive timestamps in benchmarks. Always convert the benchmarks to UTC, not just on reload.	2013-10-29 08:36:53 -04:00
Eddie Hebert	37c56b9aa4	MAINT: Use Series throughout for daily returns. Remove the lists of DailyReturn objects in favor of using pd.Series to store the return values. Should make it easier to inspect the values when stepping through, make the windowing of data to a certain range more facile by using, and have some performance increases due to removing object creation and member access.	2013-10-19 23:06:18 -04:00
Eddie Hebert	71f03e9537	BUG: Ensure loading benchmarks include latest dates. The Series `.append` does not update in-place, assign the value to `saved_benchmarks` so that we update the newest benchmarks.	2013-10-07 12:17:26 -04:00
Eddie Hebert	6ac5d49573	MAINT: Remove duplicated treasury loading code. The dump and update of curves were both using the entire history. So instead of having the update use a different code path, always use dump and overwrite.	2013-10-02 11:10:15 -04:00
Eddie Hebert	5ddc134379	ENH: Cache daily data to eliminate repeat network calls. Both unit tests and repeated runs while developing an algorithm can benefit from having a local copy of the Yahoo data, instead of doing a network call each time. Store the web request results as a csv file in a cache directory, named by symbol and date range.	2013-10-01 15:04:02 -04:00
Eddie Hebert	b44fc20e4e	MAINT: Remove msgpack as a dependency. Now that the data serialization uses pandas, msgpack is no longer needed.	2013-10-01 14:28:11 -04:00
Thomas Wiecki	a66f45b598	MAINT: Moving yahoo loader from factory to utils.	2013-10-01 14:09:26 -04:00
Eddie Hebert	b65f7f42c0	BUG: Fix updating treasury curves. A transpose back to the serialization shape was left out. Also, fixes empty return from update.	2013-10-01 11:57:04 -04:00
Eddie Hebert	956107a846	MAINT: Use pandas instead of msgpack for benchmarks and treasuries. Instead of writing our own serialization using msgpack, leverage the csv serialization provided by pandas. Also, lessens the need for msgpack and functions in date_utils.	2013-09-30 11:27:35 -04:00
Eddie Hebert	a968b5827c	MAINT: Use print function instead of print statement. The loader module printed some warning messages, these could be changed to use a logger, but for now convert to use the print function for compatibility with Python 3.	2013-07-02 21:33:40 -04:00
Eddie Hebert	4f5b2d6298	MAINT: Change relative library imports to use dot syntax. Testing with a Python 3 virtualenv uncovered more relative imports that did not explicitly use the dot syntax.	2013-07-02 21:31:59 -04:00
Eddie Hebert	158988d184	MAINT: Use explicit syntax for relative imports. Python 3 requires using dot syntax for relative imports, otherwise the import is treated as an absolute import, i.e. an import of a module from outside of the project. By using dot syntax now, imports should be compatible with both Python 2.7 and Python 3.	2013-07-02 15:54:12 -04:00
Richard Frank	2dbafd5162	BUG: Zero out the microsecond attribute of datetimes wherever we zero out the second attribute. Otherwise, we can be off by some microseconds from midnight, etc.	2013-04-15 10:44:44 -04:00
Eddie Hebert	05a03bcf21	BUG: Fix error during benchmark update over empty period. On ranges with missing data from Yahoo, e.g.: On 2013-04-2 the date range of April 2013-03-29 failed because of the first day in the range being Good Friday, and the API not yet updating for the Monday after. Handle the 404 that is found by raising and warning that no benchmark data was found, but continuing on.	2013-04-02 11:13:26 -04:00
Eddie Hebert	718ce0270a	ENH: Improves handling of edge case in data update. If the trading_days end date is not greater than the date being tested, (this can happen if the algorithm's end date is set to a date that is before the latest date available saved in the msgpacks), then trying to get the location will fail, instead searchsorted will get the lastest date available in the trading day map to use as a test date.	2013-03-04 21:54:11 -05:00
Ryan Day	4170c05fb6	ENH: Update msgpack files if new data is available.	2013-03-04 21:54:05 -05:00
Richard Frank	ebdb5429aa	MAINT: Moved DailyReturn to protocol module to break circular references and removed code that solved that same problem with conditional imports.	2013-03-01 16:05:39 -05:00
fawce	2db01b3935	created a new generator-backed trading environment for testing. other details: - also fixed grammatical errors in loader's status messages. - converting the treasury curves to an ordered dict. - moved to using a lambda for clarity as per @ehebert - initializing calendar end dates to be midnight of current date in - US/Eastern. Yahoo data isn't available until midnight eastern.	2013-02-18 23:35:38 -05:00
fawce	a4a4d38a73	TradingEnvironment allows the specification of a benchmark index and a local timezone for the exchange. This commit adds tests to verify the TradingEnvironment properly handles London Stock Exchange index, FTSE. - added LSE reference rrules calendar (thanks to Edward Johns) - added tests to verify LSE environment matches rrule calendar - added a test to verify global environment behavior can be set. - moved DailyReturn class to trading to eliminate circularity from risk <-> trading. - updated TradingEnvironment to be a context manager. This allows users to run algorithms in individually isolated environments in one python process. This is useful for managing multiple algorithms in a single ipython notebook. - added comments to explain behavior and useage of the global environment	2013-02-18 10:24:32 -05:00
fawce	2c7355a0dc	Refactoring of TradingEnvironment to isolate the global state: index symbol and exchange timezone. Parameters that define the simulation (start, end, and capital base) were put in a new class, SimulationParameters. Global state for the financial simulation environment is accessed through the zipline.finance.trading module, which now contains a module variable: environment. Parameters are passed into an algorithm as a keyword argument, sim_params. SimulationParameters creates a trading day index for the test period that can be used to find trading days, calculate distance between trading days, and other common operations. The sim params index is just selected from the global state. ================ Details: - adding delorean to the requirements. - made index symbol a parameter for loading the benchmark data. changed messagepack storage to be symbol specific. - ported risk, performance, algorithm, transforms, batch transforms and associated tests to use simulation parameters and global environment - factory and sim factory use global state and sim params - factory method parameter names now reflect the class expected	2013-02-18 10:24:32 -05:00
Eddie Hebert	8a0d6e72cd	Changes the location of downloaded external data files. So that the zipline library can be used when installed to a write-protected location, e.g. the global site-packages, moving the download files to a directory in the user's path, which should be writeable. For now, choosing a ~/.zipline/data location.	2012-11-08 22:51:03 -05:00
Eddie Hebert	7904773d00	Updates flake8 to latest. The latest flake8 release in now 1.5, which pulls in pep8: 1.3.4a0 The upgrade pep8 has changes to what it picks up as lint. Making code base compatible, so that new devs can install pep8 from PyPI and not have friction over the version difference. Currently using these ignores in the config file: ``` [pep8] ignore = E124,E125,E126 ``` Ignoring these since they are difficult to squash while maintaining an 80 char line length, and appear spurious. Should address later. Updates Travis config, README, and pip requirements to reflect change.	2012-10-22 11:57:16 -04:00
Eddie Hebert	0ab136f1c8	Adds a loader for market data when it doesn't exist locally. Hopefully, this helps ease ramp up time for developing against market data, without us distributing the data. We do a check for the data when attempting to read the msgpack files, if they don't exist the loader makes a web request and retrieves and serializes the data for the user. Provides a loader for: - curves from data.treasury.gov - benchmarks from Yahoo! Finance Adds dependency of requests library in dev requirements.	2012-10-19 11:19:14 -04:00

46 Commits