- backend:
- tensor: add
resize
to SharedTensor (f3bb3b44) - feature_flag: add alternative unstable method to allocate native memory (usable via
unstable_alloc
feature flag) (ebaf3dac) - export: add prelude module (import most of collenchyma via
use collenchyma::prelude::*;
) (4aea78df, 60c1a395) - license: change license to dual MIT/Apache-2.0 (af5c08a0)
- cuda_context: fix repeating CUDA context creation (0bf616f8)
- lib: various fixes concerning impl of plugins (bec27ca1)
- tensor: rename SharedMemory -> Tensor (acc3cbb7)
- travis: fix feature build flags (f9861a73)
- use-size-types: use target dependent size types (4e4a5cd3)
- bench: add benchmarks for memory synchronization (762b87ca)
- features: add native and opencl feature (3609fea1)
- cuda:
- cudnn: add cudnn ffi (0bbbff83)
- library: remove last pieces of library (38dcd6a6)
- opencl: implement shared_memory for OpenCL (be47d6ba)
- perf: make error messages static strings (430c4ed6)
- plugin: move library out; replace with thin plugin mod (3bbebe9a)
- shared_memory: add dimensionality to shared_memory (13cd0905)
- blas: reduce overhead of calling blas (8b7a7aee)
- shared_memory: use linear_map for SharedMemory.copies (44ea377d)
- compilation: make cuda optional (1f933977)
- windows:
- blas:
- computation: add basic design for backend-agnostic computation (a3f9534f)
- cuda:
- add cuda structure (d42430c1)
- dot: add working dot computation (6572c010)
- extern: add backend traits for extern support (f3d50172)
- flatbox: provide slice accessors and give more allocation responsibility to shared_memory (cfbb5b13)
- memory:
- ci: change clippy to be optional (753dfb02)
- collenchyma: outline design (876ac4b0)
- computation: add basic design for backend-agnostic computation (e43f947a)
- dot: add working dot computation (f5c8fdaa)
- flatbox: provide slice accessors and give more allocation responsibility to shared_memory (a31dd493)
- lib: add foundation of the library design (f5fd0235)
- memory:
- ci: change clippy to be optional (db13da29)