catalyst

mirror of https://github.com/wassname/catalyst.git synced 2026-07-01 06:14:29 +08:00

Author	SHA1	Message	Date
Andrew Daniels	b501ecc736	ENH: Adds support for supplementary asset mappings (#1612 ) * ENH: Adds support for supplementary asset mappings - Adds a supplementary_mappings table to the assets.db, to hold point- in-time mappings of sids to arbitrary categories of values, e.g. alternative identifiers. This bumps ASSET_DB_VERSION. - Adds supplementary_map and supplementary_map_by_sid to AssetFinder, caches of the underlying table that are fully populated on first access, which map the supplementary values to sids and vice versa, respectively. - Adds lookup_by_supplementary_mapping method, which fronts supplementary_map to query for the asset last known to have held a value at a given dt. - Add get_supplementary_field method, which fronts supplementary_map_by_sid to query for the last known value held by an asset at a given dt.	2016-12-16 15:20:53 -05:00
Joe Jevnik	de7c32b22c	ENH: Allow custom chunks iterators for attach_pipeline	2016-12-15 19:58:19 -05:00
Eddie Hebert	a19ec84a1d	BUG: Allow rolls to skip over contracts. For futures that behave like GC, use the latest roll as the back contract when walking backwards over the window, so that when the front contract is skipped because it never has more volume between its auto close date and the previous auto close date, the back contract which did have volume is still used when making comparisons to construct the chain.	2016-12-05 22:33:03 -05:00
Joe Jevnik	c1063eac26	BUG: fix blaze pipeline queries for asof_date	2016-12-01 14:35:59 -05:00
Eddie Hebert	1f71c8d068	ENH: Allow future chains to only use certain delivery months. To support contracts such as `PL` which should roll from F->J->N->V, add the ability to pass a predicate function to the ordered contract chain contstrution which returns `True` if the contract is allowed in the chain.	2016-12-01 13:26:07 -05:00
Eddie Hebert	d217b275fb	BUG: Fix 1m history for volume rolls with adjustments. Convert the end minute to the its session label before calling `_active_contract`, otherwise the volume roll finder's attempt to use the session bar reader fails due to a non-session label Timestamp.	2016-11-30 13:13:53 -05:00
Eddie Hebert	c3004b2529	BUG: Support futures which do not roll month to month. Fix multiple errors when attempting to generate rolls for futures which do not roll month to month, e.g. the Eurodollar. These errors were caused by logic that always incremented from contract to contract by delivery month, with errors when the next contract was not part of the quarterly roll chain and thus had not yet begun trading even though the previous contract had autoclosed. Instead, filter out these contracts and only allow contracts that have begun trading before the previous contract's autoclose. This is in lieu of a more explicit specification of quarterly rolls.	2016-11-30 11:23:43 -05:00
Eddie Hebert	69b35a131a	Merge pull request #1604 from quantopian/use-linked-list-for-contracts MAINT: Use a doubly linked list for contract chain.	2016-11-30 06:54:28 -05:00
Eddie Hebert	518f3d1f4b	MAINT: Use a doubly linked list for contract chain. Instead of requiring the roll finder to juggle the indices into the ordered contracts, use a doubly linked list where the nodes element is the contract with members pointing to the previous and next contracts in the chain. Besides improving legibility in the roll finder code, this change is on the path to adding a predicate to exclude contracts from the chain, e.g. contracts in ED which are not in the roll schedule. Change test results for primary chain, since new implementaton does not stop at contract in which has not yet started when constructing the chain.	2016-11-30 06:01:59 -05:00
Scott Sanderson	e5c941e592	Merge pull request #1588 from quantopian/randc-built-in-factors ENH: Add MACD, MA, and AnnVol as built in factors	2016-11-28 16:18:58 -05:00
Scott Sanderson	9c05e5edfe	MAINT: Tweaks/cleanups in technical.py. - Use `expect_bounded` to check inputs. - Add tests for expected failures from `MACDSignal`. - Use `float64` instead of `float` in a few places. This prevents diverging behavior on 32-bit systems. - Docstring edits.	2016-11-28 13:02:40 -05:00
Scott Sanderson	5e8cc00625	STY: Put whitespace between operators.	2016-11-28 12:57:03 -05:00
Scott Sanderson	9b8d6202f0	TEST: Use parameter_space for randomized tests. - Use a RandomState with a seed so that we have repeatible results. - Use `randint` instead of `random_integers.` `random_integers` is deprecated. - Use `parameter_space` to test multiple period lengths.	2016-11-28 12:57:03 -05:00
Ana Ruelas	f1254ea79a	ENH: Add MACDSignal, test with random input	2016-11-23 11:43:29 -05:00
Eddie Hebert	40605e3c5d	BUG: Fix bounds errors in roll finder. Fix common error condition which was triggered whenever the session at the end of the prefetched history window was a session where the back contract was active. When the back contract was the active contract, the next contract for consideration was the front contract at the end of the window, which definitionally always has an autoclose after the end of the window. Instead, just start seeking backwards from the end of the window. Also prevent lookahead bias in volume rolls, which was caused by the using the volume for a session to determine whether that session had rolled. Information that would not have been available at the beginning of the session. This change makes the volume rolls overly conservative, and may be improved by looking at vectors of the preceding volume and making the roll off of momentum.	2016-11-23 10:35:45 -05:00
Ana Ruelas	9aa6a585c4	TST: Add test for annualized volatility factor	2016-11-21 14:24:28 -05:00
Ana Ruelas	66dd10a50c	ENH: Actually use rolling windows for EWMA in MACD	2016-11-21 14:24:28 -05:00
Maya Tydykov	bfc01546ae	ENH: allow estimates to be split-adjusted. This modificaiton to the estimates loader allows the caller to pass in an equity pricing loader which can then be used to get split data for sids. That split data is then used to do point-in-time adjustments of estimates data. TST: add test for multiple estimates columns TST: add test for multiple datasets requesting different columns TST: add blaze versions for all next/previous tests	2016-11-18 09:39:54 -05:00
Eddie Hebert	5624e0f391	BUG: Fix minute bar last traded after half day. When the following conditions occur, - a `nan` occurred after a half day (e.g. on the Monday after Thanksgiving, where the Friday would be a half day.) -data was written to the span between the early close and where the market close would have been if it were not an early close session - a `nan` also occured on the last minute of the early market session. the exisitng implementation would incorrectly return a `nan` when requesting a forward filled price. The steps that caused this error were. 1. Request for `'price'` on the market open of the day after the early close. 2. `nan` is found for that minute 3. `get_last_traded_dt` is called, and finds a volume that occurs after the early close. e.g. `18:47` when the market close was `18:00`. 4. The minute position for `18:47` is used, when calling `find_positon_of_minute`, since that value is after the `market_close` the minute is set to the position of `18:00`` due to the delta logic in 5. Since there is also no data in at `18:00`, a `nan` is returned, even though there were valid minutes earlier in the session. e.g. a non-zero volume at `16:47` should have been used, but was not. Fix by checking the current minute against the minute close when searching for the last traded minute. If the minute is greater than the market close for the corresponding day, continue the search until the minute position is within the trading session. This could also be fixed by enforcing that only zeros can be written between an early close and the minute where the close would have been, but this fix allows the reader to work with existing data.	2016-11-15 15:09:19 -05:00
Eddie Hebert	48c360727b	BUG: Fix continuous future end dates. The end date of the last contract with a sufficient start date was being used for the continuous future overall end date; however the end date of that contract (which is the last day for which there is data for the contract) is not necessarily the greatest end date out of all contracts. It is possible for the furthest out contract to have some, but very few, trades before it is more actively traded. Which would give it a start date within in the range of the simulation, but an end date is earlier than the other contracts which are active during the simulation. This bug would result in `nan`s when getting the current price because of the `end_date` check in `get_spot_value`. When the current simulation time was greater than the `end_date` of the last contract the condition which guards against attempting to get data for an instrument past its end date would return a `nan`, even when the current underlying contract did have data for that date. Use max end date of all contracts instead of the last one, to ensure that the continuous future last date is always great enough to allow access to all contracts with in the chain. Also, use min start date to accurately mirror the end date behavior.	2016-11-09 16:19:19 -05:00
Eddie Hebert	57d35f6aac	BUG: Fix bad attribute lookup on session continuous future reader. Use `roll_style` not `roll`. Also, add test case to cover using the session bar reader `get_value`, by adding a test which uses `close`, since only `contract` was being exercised, which does not exercise the session daily bar reader.	2016-11-08 15:48:28 -05:00
Eddie Hebert	f7fdc56777	Merge pull request #1583 from quantopian/allow-sliding-window-to-reset ENH: Allow arbitrary history queries.	2016-11-07 22:31:13 -05:00
Eddie Hebert	6ff1d55504	ENH: Allow arbitrary history queries. In preparation for using `DataPortal` in notebooks, remove restriction on the `HistoryLoader` to dates that are monotonically increasing. Notebook usage of the `DataPortal` is more useful when the end of the history window can be arbitrary dates without having to restart the notebook kernel. Due to the implementation of the prefetch and caching logic, the end date of history calls could previously only increase. e.g. `2016-11-01`, `2016-11-02`, `2016-11-03`. This pattern was sufficient for backtesting and live simulations, since the current time of the algorithm only ever increases. With this change, which resets the underlying sliding window when the last fetched idx is greater than the Now calls to history in the same process with end dates such `2016-11-01`, `2016-10-31`, `2015-11-02` should work.	2016-11-07 16:40:51 -05:00
Andrew Daniels	f94a161c7a	BUG: Allows 'contract' in get_spot_value with daily frequency (#1582 ) Also removes duplicate check in test_current_contract.	2016-11-07 16:28:48 -05:00
Eddie Hebert	a3df1e3cef	ENH: Allow configurable history prefetch length. To support using a `DataPortal` and `HistoryLoader` in a notebook, allow the prefetch length to be configurable, so that it can be set to 0. Unlike backtesting where the prefetch is useful for repeated history windows viewed from datetimes which are monotonically increasing by a small amount, the notebook usage of history windows needs only to retrieve the exact data needed for the window specified. This patch also fixes some boundary conditions related to rolls and adjustments which were uncovered by querying for the adjustments with an end date near the end of the window.	2016-11-04 13:30:30 -04:00
Scott Sanderson	24931bd4dd	MAINT: Rename `SpecificAssets` to `StaticAssets`. This better reflects the intended usage of accepting a fixed set of predetermined assets.	2016-11-01 11:01:36 -04:00
Joe Jevnik	ad7ddd17ea	STY: cleanup	2016-10-28 15:04:18 -04:00
Scott Sanderson	66a214fae9	ENH: Allow windows of NumericalExpression. If all the inputs to an expr are window-safe, then the expr is itself window-safe.	2016-10-28 15:04:18 -04:00
Scott Sanderson	0f57dac4ab	TEST: Add more populate_initial_workspace tests. - Tests different pipeline lengths and window lengths. - Tests a term that depends on a window of a term that's been precomputed.	2016-10-28 15:04:18 -04:00
Joe Jevnik	7f40f7a99d	STY: remove unused imports and method, clean up docs	2016-10-28 15:04:18 -04:00
Joe Jevnik	d5b901df3f	TST: add tests for postprocess and to_workspace_value	2016-10-28 15:04:18 -04:00
Joe Jevnik	0eb461bd16	ENH: cleanup branch based on feedback	2016-10-28 15:04:18 -04:00
Joe Jevnik	92516e6609	ENH: prune the graph based on the initial workspace	2016-10-28 15:04:18 -04:00
Joe Jevnik	499680553f	ENH: update aliasedmixin repr	2016-10-28 15:04:18 -04:00
Joe Jevnik	f9ce7f060e	ENH: Make aliases filters, factors, and classifiers to give them their methods	2016-10-28 15:04:18 -04:00
Scott Sanderson	7fc2588571	WIP: Add Alias expression.	2016-10-28 15:04:18 -04:00
Joe Jevnik	4f9c6bd254	ENH: provide a hook for prepopulating the initial workspace	2016-10-28 15:04:18 -04:00
Eddie Hebert	e93087e452	Merge pull request #1565 from quantopian/fix-offset-history BUG: Fix continuous future history with offsets.	2016-10-28 09:44:34 -04:00
Eddie Hebert	8876092d29	BUG: Protect against contract offset at end of range. (#1564 ) This boundary case was exposed with internal fixture data which used a continuous future with a contract chain of size one.	2016-10-27 16:48:34 -04:00
Eddie Hebert	c25da8f442	BUG: Fix continuous future history with offsets. Apply offset value when writing out the rolls in a continuous future which is offset from the primary.	2016-10-27 16:23:03 -04:00
Eddie Hebert	e1bafe1ecc	BUG: Use proxy for settlement on future adjustments. Instead of using the difference between the session close of the front contract before the roll and and the open of back contract on the beginning of the roll, use the close of both at the end of the session before the roll. The closes of the session prior to roll is in lieu of settlement data.	2016-10-27 12:40:59 -04:00
Eddie Hebert	642e404982	Merge pull request #1556 from quantopian/volume-based-rolls ENH: Volume based rolls for futures.	2016-10-25 15:21:41 -04:00
Eddie Hebert	473c8fddba	ENH: Volume based rolls for futures. Add roll style which takes the volume of the contracts into account. If the volume moves from the front to the back before the auto close date, the roll is put at that session. Also, factors out some of the common logic shared with calendar based rolls.	2016-10-25 14:08:21 -04:00
Eddie Hebert	7e3e00c772	Merge pull request #1555 from quantopian/return-nan-from-bcolz-daily-reader MAINT: Return nan from daily bcolz get_value.	2016-10-25 11:49:51 -04:00
Eddie Hebert	a823cceabc	MAINT: Return nan from daily bcolz get_value. Match the behavior of the minute bar reader, now that the session and minute bar readers share a common interface. isnull is slightly slower than checking against -1; however, n cases where we check against illiquid trades in a tight loop, volume is checked which is not using nan. The change here should be marginal with regards to performance.	2016-10-25 11:25:09 -04:00
Eddie Hebert	fa63fa6dd5	TST: Make test dates aligned with test data. Move dates queried near beginning of test data so that the range of data covered does not extend beyond the beginning of the range. i.e. the windows were covering 2016-01-25, which had no test data generated. (Does not matter for the calendar based rolls, but is needed for volume based rolls.) Also, make room for having the first roll to be a day before the first auto close by moving the first contracts auto close date back a day. In preparation for testing volume rolls.	2016-10-25 10:21:23 -04:00
Eddie Hebert	18096f750a	BUG: Fix session from minute reader's last traded. The last traded dt provided from the session bar reader which resamples from minutes should provide a dt that is a session label, not one that is at the minute frequency.	2016-10-24 13:58:58 -04:00
Eddie Hebert	e82fef41dd	PERF: Speedup minute to session sampling. The minute to session sampling reading was creating two DataFrame objects, the first to hold the minute data, and then a second returned by the `DataFrame.groupby` to sample down to sessions. Instead use the arrays returned by the minute readers `load_raw_arrays` and implement sampling logic which takes advantage that the minutes being passed start with the first minute of the first session and end with the last minute of the last session. On my machine this takes the tests in `test/test_continuous_futures` from ~4.0 to about ~0.1 seconds.	2016-10-24 09:59:22 -04:00
Maya Tydykov	086ea6be6b	TST: update adjustment tests - add gaps between sids TST: add a seed for permuting	2016-10-21 16:53:56 -04:00
Eddie Hebert	ce37ea64a9	ENH: Add adjusted history for continuous futures. Add `.adj('mul')` and `.adj('add')` methods on ContinuousFuture, which when used with `history`, will calculate and apply adjustments so that the values are adjusted to account for discounts and premiums during rolls. Example usage in an algo: ``` from zipline.api import continuous_future def initialize(context): context.cl_add = continuous_future('CL', offset=0, roll='calendar').adj('add') context.cl_mul = continuous_future('CL', offset=0, roll='calendar').adj('mul') context.cl = continuous_future('CL', offset=0, roll='calendar') schedule_function(print_history) def print_history(context, data): frame = data.history([context.cl, context.cl_add, context.cl_mul], ['price', 'sid'], 20, '1d') print 'unadjusted' print frame.loc[:, :, context.cl] print 'adjusted add' print frame.loc[:, :, context.cl_add] print 'adjusted mul' print frame.loc[:, :, context.cl_mul] ```	2016-10-21 10:18:12 -04:00

1 2 3 4 5 ...

1355 Commits