catalyst

mirror of https://github.com/wassname/catalyst.git synced 2026-07-04 10:27:45 +08:00

Author	SHA1	Message	Date
Eddie Hebert	314b22656f	MAINT: Remove left over simple transform code. Remove pieces that are no longer used now that the simple transforms are wrappers around history via the SIDData object. Move window length related pieces into batch_transform, since the rest of the utils module is no longer used.	2015-03-30 14:18:39 -04:00
Jonathan Kamens	e942275108	STY: Flake8 Upgrade the version of the flake8, pep8, and mccabe PyPI packages, and make the code changes necessary for compatibility with the updated packages.	2015-03-19 17:21:25 -04:00
Joe Jevnik	e3d8b1034e	ENH: Replaces the simple transforms with history calls. Switches transforms to quantopian syntax. Adds the sid attribute to the siddata so it is aware of which security it represents.	2014-11-17 13:16:30 -05:00
Scott Sanderson	235954d480	DEV: Overhaul core history logic. Overhaul the core HistoryContainer logic to be more robust to changing universes. Major Changes ------------- * Remove `return_frame` cache. The original purpose of using return_frames was to avoid having to create new DataFrames on each iteration of handle_data, but we ended up having to copy the return frames anyway because user code could mutate the frames in place. Removing the return_frames reduces unnecessary copying, and reduces the logic of `get_history` to just forward-filling and concatenating two DataFrames. * Use a `MultiIndex`ed DataFrame to represent `last_known_prior_values`. This makes lookups faster and greatly simplifies the logic of adding and dropping sids. * HistoryContainer no longer attempts to determine its universe based on the contents of its internal buffers. The TradingAlgorithm controlling the container is now responsible for explicitly calling `add_sids` or `drop_sids` when securities enter or leave the algorithm's universe. These methods, along with the internal `_realign` method, provide a clean interface for changing the universe of securities managed by the container. * Refactor index mutation logic in `RollingPanel` into a `MutableIndexRollingPanel` subclass. Maintenance of the old behavior is regrettably necessary to support `BatchTransform`. * Refactor shared logic from `roll` and `get_history` into a single `aggregate_ohlcv_panel` method that's responsible for collapsing an OHLCV buffer into a frame.	2014-09-29 14:42:57 -04:00
Eddie Hebert	7eb1d719ed	MAINT: Check attributes instead of contains for event fields. In support of source that emits a subclass of Event which defines some fields as properties instead of doubling the value in the `Event.__dict__` Use hasattr instead of the overridden __contains__ method of the Event class, so that when non-algorithm facing code checks for field existence, properties count. Intentionally not touching the `__contains__` in Event, to avoid changing, at the moment, any algo behavior that relies on the `__contains__` behavior's use of `__dict__`	2014-09-08 11:09:41 -04:00
Scott Sanderson	d02dd972d5	BUG: Fix crash in transforms on malformed CUSTOM events. Fixes a crash in various transforms when providing CUSTOM events whose fields don't match the fields required for the transform. This is fixed by requiring all `EventWindow` subclasses to supply a `fields` property, which returns a list of strings that are required keys for any event that can be processed by the window. Any CUSTOM events the don't supply the required fields for a transform window are ignored by that window.	2014-08-13 12:54:29 -04:00
twiecki	4bdecd6402	STY: PEP8 fixes.	2014-03-26 20:46:20 +09:00
Eddie Hebert	4860a966b3	REL: Update copyright year on all files changed since the new year.	2014-03-07 22:31:41 -05:00
Eddie Hebert	a203f69635	PERF: Remove alias_dt transform in favor of property on SIDData. Adding a copy of the Event's dt field as datetime via the `alias_dt` generator, so that the API was forgiving and allowed both datetime and dt on a SIDData object, was creating noticeable overhead, even on an noop algorithms. Instead of incurring the cost of copying the datetime value and assigning it to the Event object on every event that is passed through the system, add a property to SIDData which acts as an alias `datetime` to `dt`. Eventually support for `data['foo'].datetime` may be removed, and could be considered deprecated.	2014-03-07 10:55:59 -05:00
Eddie Hebert	5dab18613b	BUG: Prevent crash in vwap transform due to non-existent member. The WrongDataForTransform was referencing a `self.fields` member, which did not exist. Add a self.fields member set to `price` and `volume` and use it to iterate over during the check.	2014-02-20 17:33:30 -05:00
Eddie Hebert	dd61ed4e25	BUG: Check required fields for stdev transform. Move the assert method to the EventWindow subclass, to be consistent with other transforms; also, add the check to `handle_add`.	2014-02-20 17:29:25 -05:00
Eddie Hebert	c139afb6a9	STY: Indentation fix in stddev.	2014-02-20 17:23:37 -05:00
Eddie Hebert	9feed61748	MAINT: Use more exact boolean checks for pandas structures. To prepare for being compatible with pandas==0.13.0, which is stricter on boolean checks.	2014-01-29 17:13:13 -05:00
Eddie Hebert	8481e2df49	MAINT: Use Python 3 compatible metaclass. Use six's with_metaclass to have objects that use metaclasses, in both Python 2 and 3. Otherwise, in Python 3 the objects were being treated as if they did not have a metaclass, when the Python 2 syntax is used, leading to errors because of missing attributes, etc.	2014-01-07 11:58:01 -05:00
Eddie Hebert	045e2975b7	MAINT: Remove type checking which trip up Python 3 compatibility. Instead of porting these cases of type checking, remove them instead. Slightly more Python-ic to be more generous in what is allowed, and the conversion to make these compatible with Python 3 are more trouble than they are worth.	2014-01-07 11:44:25 -05:00
Eddie Hebert	b4959e46cf	MAINT: Use six for Python 3 compatible names and behavior. Use the six module to import functions and types that are consistent between Python 2 and 3, so that one code base can support both versions. - Use integer types instead of int and long. - Use string_types instead of basestring. - Account for iteritems, itervalues, iterkeys. - Use six.moves for filter and zip, reduce - Use compatible bytes for md5 hasher. - xrange and range	2014-01-07 11:33:50 -05:00
Eddie Hebert	f2dc979fbc	BUG: Make all iteration related methods fo BarData match __iter__ `for s in data` and methods like `for s in data.keys` were not producing the same list of active sids Make the other iteration methods match __iter__ by using the contains method to check whether or not the sid is active. For use of data outside of the algoscript context, which needs access to all data fields use data._data	2013-12-20 00:05:53 -05:00
Jonathan Kamens	73faf9133e	MAINT: Clean up imports of zipline.finance.trading Use "from zipline.finance import trading" instead of "import zipline.finance.trading as trading".	2013-10-29 13:50:14 -04:00
fawce	279605dfb5	BUG: Fix excessive recalcuations in batch. Downsampled batch transforms should only recalcuate on close	2013-10-20 08:06:39 -04:00
fawce	76887e2855	BUG: Fix non-trading advancement of trading day count and downsample. Add a guard so that we do not advance trading day count or downsample on non-trading days.	2013-10-18 14:07:49 -04:00
fawce	0ceabb590b	BUG: Group and aggregate downsampling by trading day. Otherwise, out of market events would result in a mishaped panel.	2013-10-17 22:41:20 -04:00
fawce	3a1ca1ddb2	BUG: Mask of batch_transform columns to match keys in data parameter. The underlying RollingPanel in batch_transform was always accumulating all values to ever appear in data. However, at any given algo time the desired return value is what the current active sids are. Instead, mask down to the sids that are passed in as the data parameter.	2013-10-16 15:16:03 -04:00
Eddie Hebert	0224aeb552	MAINT: Use enviroment normalize_date instead of dt.replace Continue path of removing scattered calls to dt.replace.	2013-10-16 15:14:24 -04:00
fawce	f8ce7d944b	ENH: Add downsampling to BatchTransform. So that with minute data, 2.5 orders of magnitude of data can be cut, allowing for longer window_lenghts, when the daily values are what are desired for a signal.	2013-10-11 16:48:08 -04:00
Eddie Hebert	90d8570f70	MAINT: Remove debug logging about stateful transforms. The noise outweighed the signal with logging before and after each time the transform generator is called.	2013-09-27 15:55:35 -04:00
Eddie Hebert	ddb541b9b3	MAINT: Change shape of supplemental data to match batch data. Expect the same shape of data for the supplemental data, to make working and preparing with the supplemental data consistent with what is passed to the algorithm.	2013-09-16 11:51:22 -04:00
Eddie Hebert	7d5194ec2c	ENH: Include TALib output names in ta transform results. For TALib functions like MACD that have output names, return a DataFrame that for which the columns are the output names of the function. So that when using a TALib function, the algorithm doesn't need to know the index position of the desired result, in favor of using the name of the result. e.g. ``` macd_result['AAPL'][0] ``` becomes, ``` macd_result['AAPL']['macd'] ``` and ``` macd_result['AAPL'][1] ``` becomes, ``` macd_result['AAPL']['macdsignal'] ``` Also, change return type of functions that return floats from a dictionary to a Series, so that the function is always returning a pandas type.	2013-08-19 16:46:11 -04:00
Thomas Wiecki	b89886297f	STY: autopep8 codebase.	2013-08-08 16:46:44 -04:00
Thomas Wiecki	f75d781b02	DOC: Fix talib sphinx warnings. Closes #171 .	2013-08-08 10:50:00 -04:00
Eddie Hebert	e85306524b	BUG: Prevent crashes for TALib functions when stocks have nans. If a stock stops gettign updated values, e.g. if a stock rolls out of a universe strategy, currently the underlying batch transform for TALib may have nans (which is another issue that could be addressed), the nans cause crashes when passed to some TALib function, e.g. Bollinger Bands are incompatible with all nan values. So, drop sids that only have nan values for the current data panel.	2013-08-05 18:17:20 -04:00
Eddie Hebert	376ae33264	PERF: Remove deepcopy from EventWindow ticks. The deepcopy of events into the EventWindow's ticks was causing a significant increase in memory consumption, e.g. an algorithm with almost 200 sids and 14 vwaps removing the deepcopy reduces the amount of memory consumed by about 40%. The downside is that if an event's properties are changed, which is not advised, later on, then the signal derived from vwap etc. may be changed.	2013-07-31 18:55:07 -04:00
Eddie Hebert	0dbdf5b1d3	BUG: Fix duplicated values for multi-stock TALib transform. A multi-stock TALib transform was returning the same value for all stocks, specifically the value for the first stock in the panel. Index into the datapanel using `sid` instead of using the `[0:]` index which was used when only supporting one sid.	2013-07-11 15:44:38 -04:00
Eddie Hebert	fa845cbf03	MAINT: Move batch transform into a dedicated module. Break the BatchTransform class and decorator function into a separate module, with hope that it makes it easier to zero in on batch transform logic.	2013-07-10 17:56:32 -04:00
Eddie Hebert	eac882b773	ENH: Enable TALib transforms to perform on multiple stocks. The TALib transform only supported operating on the first value of a given batch transform panel row. Instead of returning the one value, even if an panel with multiple sids was provided, return a dictionary that maps stock to TALib result.	2013-07-10 14:40:58 -04:00
Eddie Hebert	37352210c0	MAINT: Make TALib zipline_wrapper a module level function. Prepare for making the zipline_wrapper operate on multiple sids, as the needed nested logic will get cramped within the nested function. Also, should help clearly define the inputs of the zipline_wrapper function that are needed before it is passed to the BatchTransform constructor.	2013-07-10 14:38:57 -04:00
Eddie Hebert	15323a8179	BUG: Enable return values on first day using minute TALib functions. Set the `compute_only_full` to False so that the 'is window full' logic is delegated to the TALib's lookback function. If the window is not full to the `timeperiod` or other lookback setting, then TALib returns a `np.nan`. Also, fix the bars/data_frequency not being passed to the BatchTransform init. This further shows need to create a minute test for TALib transforms.	2013-07-09 14:54:38 -04:00
Eddie Hebert	3d8bdeb429	ENH: Enable minute bar data with TALib transform. Add a `bars` keyword arg, as is used with BatchTransform. Also, instead of overwriting the window_length kwarg with timeperiod, always use the lookback value from the created TALib function, as timeperiod will be an input into that value if it exists. Calculate `window_length` in minute mode so that there are enough days to cover the minutes in the timeperiod.	2013-07-09 11:26:26 -04:00
Eddie Hebert	ab0d07d8d0	API: Conform timeperiad to TALib interface instead of BatchTransform's. For the creation of a TALib transform use timeperiod intsead of window_length, to be more in the style of TALib usage, since all TALib functions may not ending up using BatchTransform, so start the practice of adhering to TALib conventions to make porting and explanation easier.	2013-07-03 13:46:32 -04:00
Eddie Hebert	158988d184	MAINT: Use explicit syntax for relative imports. Python 3 requires using dot syntax for relative imports, otherwise the import is treated as an absolute import, i.e. an import of a module from outside of the project. By using dot syntax now, imports should be compatible with both Python 2.7 and Python 3.	2013-07-02 15:54:12 -04:00
Eddie Hebert	4510c56142	MAINT: Use TALib abstract function names value when creating transforms. Instead of using reading the __all__ value and filtering out Function, use the abstract internal variable that lists all functions.	2013-06-28 16:22:22 -04:00
Eddie Hebert	5d76c0b8a4	ENH: Remove need to pass sid to TALib transform constructor. Now that BatchTransforms use RollingPanels under the hood, passing an sid is no longer needed, passing the data will to handle_data will suffice.	2013-06-27 19:12:01 -04:00
Thomas Wiecki	33c23af503	BUG: Do not try to set non-existent items.	2013-06-20 18:32:05 -04:00
Thomas Wiecki	3ea8ac8da2	BUG: Fix updating of trading_days_total in minute. In the batch_transform we were incrementing the trading_days counter if there is a new day event. Thus with a window_length of 1 and daily bars you will update the batch_transform on the first day which is correct. But with minutes you update with the first minute bar of the day which is not correct. This is fixed by calculating the market_close explicity and seeing whether the event.dt is on or past it. I also added a unittest to test the correct behavior of this.	2013-05-16 14:51:19 -04:00
Eddie Hebert	54541eb24c	BUG: Fix short circuit in StateTransform loop. A missing `continue` statement was causing the short circuit on event type to not have the intended effect of skipping the update portion of the loop.	2013-05-15 16:57:09 -04:00
Thomas Wiecki	b87d454938	BUG: Add bar kwarg to batch_transform. Before the change to the RollingPanel, window_length specified the number of days that should be in a window. The previous commit broke this if data was minute resolution. By passing bar='minute' to the batch_transform we internally multiply the window_length by 60*6.5 to have a full day. Also adds a (still rudamentary) test for batch_transform with minute data.	2013-05-13 16:42:58 -04:00
Thomas Wiecki	aa7d5de073	BUG: Fix off-by-one error in TALib wrapper. When setting timeperiod in the talib function it subtracts by 1. We then used this subtracted value to set the window_length in the batch_transform which was then not passing a big enough panel. Ultimately this caused the talib transforms to always return nans. This also makes the unittest more stringent by explicitly comparing the output of the wrapped TALib moving average to pandas rolling_mean(). Finally, this also allows passing of window_length instead of timeperiod to allow usage of the same interface as before.	2013-05-11 15:42:27 -04:00
Thomas Wiecki	c22b86194f	DOC: Add __doc__ string for all talib functions.	2013-05-11 14:41:51 -04:00
Eddie Hebert	5c883659b5	BUG: Fix missing values from supplemental data in batch transform. Adapt the supplemental data fill so that it works with the new shape of the batch transform that was switched to with the rolling panel optimizations.	2013-05-07 18:17:08 -04:00
Eddie Hebert	d3ade5050c	MAINT: Tune logging of StateTransform start/end to debug. These log lines are useful for debugging, but a bit spurious on every algorithm run.	2013-05-07 17:27:06 -04:00
fawce	59dcf0f6cd	exposes stateful transform's window length	2013-05-01 22:05:06 -04:00

1 2 3 4

152 Commits