catalyst

mirror of https://github.com/wassname/catalyst.git synced 2026-06-29 17:51:22 +08:00

Author	SHA1	Message	Date
Maya Tydykov	757e1c6367	ENH: allow estimates to be split-adjusted. This modificaiton to the estimates loader allows the caller to pass in an equity pricing loader which can then be used to get split data for sids. That split data is then used to do point-in-time adjustments of estimates data. TST: add test for multiple estimates columns TST: add test for multiple datasets requesting different columns TST: add blaze versions for all next/previous tests	2016-11-18 09:39:54 -05:00
Eddie Hebert	4fcf31730c	BUG: Fix minute bar last traded after half day. When the following conditions occur, - a `nan` occurred after a half day (e.g. on the Monday after Thanksgiving, where the Friday would be a half day.) -data was written to the span between the early close and where the market close would have been if it were not an early close session - a `nan` also occured on the last minute of the early market session. the exisitng implementation would incorrectly return a `nan` when requesting a forward filled price. The steps that caused this error were. 1. Request for `'price'` on the market open of the day after the early close. 2. `nan` is found for that minute 3. `get_last_traded_dt` is called, and finds a volume that occurs after the early close. e.g. `18:47` when the market close was `18:00`. 4. The minute position for `18:47` is used, when calling `find_positon_of_minute`, since that value is after the `market_close` the minute is set to the position of `18:00`` due to the delta logic in 5. Since there is also no data in at `18:00`, a `nan` is returned, even though there were valid minutes earlier in the session. e.g. a non-zero volume at `16:47` should have been used, but was not. Fix by checking the current minute against the minute close when searching for the last traded minute. If the minute is greater than the market close for the corresponding day, continue the search until the minute position is within the trading session. This could also be fixed by enforcing that only zeros can be written between an early close and the minute where the close would have been, but this fix allows the reader to work with existing data.	2016-11-15 15:09:19 -05:00
Eddie Hebert	e415c0f350	BUG: Fix continuous future end dates. The end date of the last contract with a sufficient start date was being used for the continuous future overall end date; however the end date of that contract (which is the last day for which there is data for the contract) is not necessarily the greatest end date out of all contracts. It is possible for the furthest out contract to have some, but very few, trades before it is more actively traded. Which would give it a start date within in the range of the simulation, but an end date is earlier than the other contracts which are active during the simulation. This bug would result in `nan`s when getting the current price because of the `end_date` check in `get_spot_value`. When the current simulation time was greater than the `end_date` of the last contract the condition which guards against attempting to get data for an instrument past its end date would return a `nan`, even when the current underlying contract did have data for that date. Use max end date of all contracts instead of the last one, to ensure that the continuous future last date is always great enough to allow access to all contracts with in the chain. Also, use min start date to accurately mirror the end date behavior.	2016-11-09 16:19:19 -05:00
Eddie Hebert	512e62b13e	BUG: Fix bad attribute lookup on session continuous future reader. Use `roll_style` not `roll`. Also, add test case to cover using the session bar reader `get_value`, by adding a test which uses `close`, since only `contract` was being exercised, which does not exercise the session daily bar reader.	2016-11-08 15:48:28 -05:00
Eddie Hebert	25eb13ccff	Merge pull request #1583 from quantopian/allow-sliding-window-to-reset ENH: Allow arbitrary history queries.	2016-11-07 22:31:13 -05:00
Eddie Hebert	e419e20acf	ENH: Allow arbitrary history queries. In preparation for using `DataPortal` in notebooks, remove restriction on the `HistoryLoader` to dates that are monotonically increasing. Notebook usage of the `DataPortal` is more useful when the end of the history window can be arbitrary dates without having to restart the notebook kernel. Due to the implementation of the prefetch and caching logic, the end date of history calls could previously only increase. e.g. `2016-11-01`, `2016-11-02`, `2016-11-03`. This pattern was sufficient for backtesting and live simulations, since the current time of the algorithm only ever increases. With this change, which resets the underlying sliding window when the last fetched idx is greater than the Now calls to history in the same process with end dates such `2016-11-01`, `2016-10-31`, `2015-11-02` should work.	2016-11-07 16:40:51 -05:00
Andrew Daniels	993b694d49	BUG: Allows 'contract' in get_spot_value with daily frequency (#1582 ) Also removes duplicate check in test_current_contract.	2016-11-07 16:28:48 -05:00
Eddie Hebert	abc4f55f64	ENH: Allow configurable history prefetch length. To support using a `DataPortal` and `HistoryLoader` in a notebook, allow the prefetch length to be configurable, so that it can be set to 0. Unlike backtesting where the prefetch is useful for repeated history windows viewed from datetimes which are monotonically increasing by a small amount, the notebook usage of history windows needs only to retrieve the exact data needed for the window specified. This patch also fixes some boundary conditions related to rolls and adjustments which were uncovered by querying for the adjustments with an end date near the end of the window.	2016-11-04 13:30:30 -04:00
Scott Sanderson	1b9b3c1a29	MAINT: Rename `SpecificAssets` to `StaticAssets`. This better reflects the intended usage of accepting a fixed set of predetermined assets.	2016-11-01 11:01:36 -04:00
Joe Jevnik	dd191f4ab2	STY: cleanup	2016-10-28 15:04:18 -04:00
Scott Sanderson	bb2d25e67c	ENH: Allow windows of NumericalExpression. If all the inputs to an expr are window-safe, then the expr is itself window-safe.	2016-10-28 15:04:18 -04:00
Scott Sanderson	582c44a4aa	TEST: Add more populate_initial_workspace tests. - Tests different pipeline lengths and window lengths. - Tests a term that depends on a window of a term that's been precomputed.	2016-10-28 15:04:18 -04:00
Joe Jevnik	d07f133579	STY: remove unused imports and method, clean up docs	2016-10-28 15:04:18 -04:00
Joe Jevnik	af3e1016a0	TST: add tests for postprocess and to_workspace_value	2016-10-28 15:04:18 -04:00
Joe Jevnik	bae4088cc0	ENH: cleanup branch based on feedback	2016-10-28 15:04:18 -04:00
Joe Jevnik	0123bb8a97	ENH: prune the graph based on the initial workspace	2016-10-28 15:04:18 -04:00
Joe Jevnik	e3e4ad2735	ENH: update aliasedmixin repr	2016-10-28 15:04:18 -04:00
Joe Jevnik	c8e40a3736	ENH: Make aliases filters, factors, and classifiers to give them their methods	2016-10-28 15:04:18 -04:00
Scott Sanderson	13c8139d45	WIP: Add Alias expression.	2016-10-28 15:04:18 -04:00
Joe Jevnik	67b35168db	ENH: provide a hook for prepopulating the initial workspace	2016-10-28 15:04:18 -04:00
Eddie Hebert	9a51efc7d2	Merge pull request #1565 from quantopian/fix-offset-history BUG: Fix continuous future history with offsets.	2016-10-28 09:44:34 -04:00
Eddie Hebert	575a8cf048	BUG: Protect against contract offset at end of range. (#1564 ) This boundary case was exposed with internal fixture data which used a continuous future with a contract chain of size one.	2016-10-27 16:48:34 -04:00
Eddie Hebert	4235dbd758	BUG: Fix continuous future history with offsets. Apply offset value when writing out the rolls in a continuous future which is offset from the primary.	2016-10-27 16:23:03 -04:00
Eddie Hebert	aa021531d9	BUG: Use proxy for settlement on future adjustments. Instead of using the difference between the session close of the front contract before the roll and and the open of back contract on the beginning of the roll, use the close of both at the end of the session before the roll. The closes of the session prior to roll is in lieu of settlement data.	2016-10-27 12:40:59 -04:00
Eddie Hebert	f1929342d9	Merge pull request #1556 from quantopian/volume-based-rolls ENH: Volume based rolls for futures.	2016-10-25 15:21:41 -04:00
Eddie Hebert	afbe3cdcd7	ENH: Volume based rolls for futures. Add roll style which takes the volume of the contracts into account. If the volume moves from the front to the back before the auto close date, the roll is put at that session. Also, factors out some of the common logic shared with calendar based rolls.	2016-10-25 14:08:21 -04:00
Eddie Hebert	321afc1626	Merge pull request #1555 from quantopian/return-nan-from-bcolz-daily-reader MAINT: Return nan from daily bcolz get_value.	2016-10-25 11:49:51 -04:00
Eddie Hebert	098d38ac76	MAINT: Return nan from daily bcolz get_value. Match the behavior of the minute bar reader, now that the session and minute bar readers share a common interface. isnull is slightly slower than checking against -1; however, n cases where we check against illiquid trades in a tight loop, volume is checked which is not using nan. The change here should be marginal with regards to performance.	2016-10-25 11:25:09 -04:00
Eddie Hebert	3c7df3e997	TST: Make test dates aligned with test data. Move dates queried near beginning of test data so that the range of data covered does not extend beyond the beginning of the range. i.e. the windows were covering 2016-01-25, which had no test data generated. (Does not matter for the calendar based rolls, but is needed for volume based rolls.) Also, make room for having the first roll to be a day before the first auto close by moving the first contracts auto close date back a day. In preparation for testing volume rolls.	2016-10-25 10:21:23 -04:00
Eddie Hebert	fccbae25ed	BUG: Fix session from minute reader's last traded. The last traded dt provided from the session bar reader which resamples from minutes should provide a dt that is a session label, not one that is at the minute frequency.	2016-10-24 13:58:58 -04:00
Eddie Hebert	a4205a0500	PERF: Speedup minute to session sampling. The minute to session sampling reading was creating two DataFrame objects, the first to hold the minute data, and then a second returned by the `DataFrame.groupby` to sample down to sessions. Instead use the arrays returned by the minute readers `load_raw_arrays` and implement sampling logic which takes advantage that the minutes being passed start with the first minute of the first session and end with the last minute of the last session. On my machine this takes the tests in `test/test_continuous_futures` from ~4.0 to about ~0.1 seconds.	2016-10-24 09:59:22 -04:00
Maya Tydykov	e1f008edcd	TST: update adjustment tests - add gaps between sids TST: add a seed for permuting	2016-10-21 16:53:56 -04:00
Eddie Hebert	5b425d54d0	ENH: Add adjusted history for continuous futures. Add `.adj('mul')` and `.adj('add')` methods on ContinuousFuture, which when used with `history`, will calculate and apply adjustments so that the values are adjusted to account for discounts and premiums during rolls. Example usage in an algo: ``` from zipline.api import continuous_future def initialize(context): context.cl_add = continuous_future('CL', offset=0, roll='calendar').adj('add') context.cl_mul = continuous_future('CL', offset=0, roll='calendar').adj('mul') context.cl = continuous_future('CL', offset=0, roll='calendar') schedule_function(print_history) def print_history(context, data): frame = data.history([context.cl, context.cl_add, context.cl_mul], ['price', 'sid'], 20, '1d') print 'unadjusted' print frame.loc[:, :, context.cl] print 'adjusted add' print frame.loc[:, :, context.cl_add] print 'adjusted mul' print frame.loc[:, :, context.cl_mul] ```	2016-10-21 10:18:12 -04:00
Eddie Hebert	a5f0df30dd	TST: Encode minutes in continuous future tests. Include minutes (in addition to the days) in the price encoding for continuous futures tests. Need for different values minute to minute arose when working on tests for adjusted values.	2016-10-19 11:40:53 -04:00
Eddie Hebert	a1a99dd9aa	MAINT: Limit perspective offset. Limit the perspective offset to 1. There is a possibility that if a consumer of the AdjustedArrayWindow does not fetch adjustments between the end of the data window and the vantage points beyond the end of the window. Until that case has a solution, e.g. having the consumer of the AdjustedArrayWindow include the perspective offset when calculating the query for adjustments, limit the offsets to 1.	2016-10-17 15:08:11 -04:00
Scott Sanderson	0244f03411	DOC: Fix typo in comment.	2016-10-17 14:23:39 -04:00
Scott Sanderson	9738c14271	MAINT: Use perspective_offset in more tests. - Refactor `test_adjusted_array` to test a range of perspective_offsets in all tests. - Make perspective_offset a parameter to `AdjustedArray.traverse` instead of `AdjustedArray`.	2016-10-17 14:23:39 -04:00
Eddie Hebert	7049d11c1f	MAINT: Perspective offset for load adjustments. Add a perspective offset to `AdjustedArrayWindow` and `AdjustedArray`, so that `HistoryLoader` does not need to twiddle with offsets to support viewing the data from the bar after end of the window, (Which is the case when a '1d' history window is retrieved in minute mode, which is explained in the docstring for `HistoryLoader.history`) Presently, this simplifies the logic in `HistoryLoader._get_adjustments_in_range`, and other incoming AdjustmentReader's, (e.g. the roll based adjustment reader for continous futures.) This patch should also make it easier for history and pipeline to converge on a singular `load_adjustments` method.	2016-10-17 14:23:39 -04:00
Eddie Hebert	73b03de63e	ENH: Add history for continuous futures. Enable unadjusted history for continuous futures. The history array is filled by the values for the underlying contracts, where the contract used changes based on rolls. e.g., if a `1d` history window was over the range `2016-01-20` -> `2016-02-29` with contracts with a suffix of `F16` that rolls at the beginning of the session on `2016-01-26`, `G16` on `2016-02-26`, and `H16` on `2016-03-26`. The `2016-01-20` -> `2016-01-25` portion would use the values for `F16', the `2016-01-26` -> `2016-02-25` portion would use `G16` and the `2016-02-26` -> `2016-02-29` portion would use `H16`. Using the same contracts as above, a `1m` history window over the range (using a timezone of US/Eastern) `2016-01-25 4:00PM` -> `2016-01-25 7:00PM` would fill the `4:00PM` -> `6:00PM` portion with data for `F16` and the `6:01PM` -> `7:00PM` portion with data for `G16`, since the beginning of the `2016-01-26` session is `2016-01-25 6:01PM`. Supports `1d` and `1m`. Also adds the `sid` field to `history` to assist in showing the active contract at each dt in the window.	2016-10-16 22:40:08 -04:00
Joe Jevnik	abcb6276ff	BUG: fix blaze query in ffill_query_in_range to correct issue in events loader	2016-10-13 15:27:44 -04:00
Scott Sanderson	053206327b	ENH: Name overrides in preprocessor factories. Allows ``__funcname`` to be passed to preprocessors like expect_types and expect_dtypes to override the name displayed in error messages. This is useful for providing clearer errors for ``__init__`` and ``__new__`` methods in classes.	2016-10-12 15:50:10 -04:00
Eddie Hebert	ca8950bf9c	ENH: Add current chain for continuous futures. Add `chain`field to current, as well as supporting methods in DataPortal and OrderedContracts. Enables the following example: ``` from zipline.api import continuous_future def initialize(context): context.primary_cl = continuous_future('CL', offset=0, roll='calendar') schedule_function(print_current_chain) def print_current_chain(context, data): chain = data.current_chain(context.primary_cl) print 'datetime={0}'.format(get_datetime()) print 'primary={0}'.format(chain[0]) print 'secondary={0}'.format(chain[1]) print 'tertiary={0}'.format(chain[2]) ``` ``` datetime=2015-12-23 14:31:00+00:00 primary=Future(1058201602 [CLG16]) secondary=Future(1058201603 [CLH16]) tertiary=Future(1058201604 [CLJ16]) ``` Also: - make return types of OrderedContracts methods compatible across architectures. (Noticed while adding `active_chain` method.) - Add year suffix to future contract names in test data.	2016-10-11 16:16:16 -04:00
Maya Tydykov	fe00452b7b	Merge pull request #1525 from quantopian/fix_estimates_overwrites_bug Fix estimates overwrites bug	2016-10-10 09:02:40 -04:00
Maya Tydykov	4efe99017a	TST: add test condition to check for bug	2016-10-10 08:41:02 -04:00
Scott Sanderson	acc46f5fe3	Merge pull request #1530 from quantopian/add-specific-assets ENH: Add `SpecificAssets` filter.	2016-10-09 14:09:14 -04:00
Eddie Hebert	3ab251c90c	Merge pull request #1529 from quantopian/current-contract ENH: Add continuous future current contract.	2016-10-07 23:39:01 -04:00
Eddie Hebert	ec6f298972	ENH: Add continuous future current contract. Add the ability for an algorithm to request the current contract for a future chain via `data.current`. e.g.: ``` data.current(ContinuousFuture('CL', offset=0, roll='calendar'), 'contract') ```	2016-10-07 18:26:23 -04:00
Scott Sanderson	f874e670c4	ENH: Add `SpecificAssets` filter. Adds a filter that matches a set of assets. Mainly useful for testing and debugging.	2016-10-07 18:11:01 -04:00
Andrew Daniels	eba02da271	ENH: Adds last_available_{session, minute} args to DataPortal (#1528 ) This allows optionally setting the last available dts in the DataPortal explicitly. If these args aren't provided, we fall back to inferring these from the underlying readers, which was the previous behavior.	2016-10-06 20:46:54 -04:00
Scott Sanderson	8465ea18d0	TEST: Simplify testing of restriction orderings.	2016-10-05 14:42:17 -04:00

1 2 3 4 5 ...

1338 Commits