Performant, differentiable reinforcement learning

Related tags

Deep Learningdeluca
Overview

deluca

Performant, differentiable reinforcement learning

Notes

  1. This is pre-alpha software and is undergoing a number of core changes. Updates to follow.
  2. Please see the examples for guidance on how to use deluca

pypi pyversions security: bandit Code style: black License: Apache 2.0

build coverage Documentation Status doc_coverage

deluca

Comments
  • Exception error during installing deluca

    Exception error during installing deluca

    Hi.

    I am trying to install deluca and I get an Exception error. I am using

    Ubuntu 64 on a virtual machine Pycharm CE 2021.2, Python 3.8 pip 212.1.2

    I tried to install deluca with the package manager in Pycharm, the terminal in Pycharm and also the Ubuntu terminal. The error is the same. Note that I can install other normal packages like Numpy, Scipy, etc with no problem. Thanks in advance and I am looking forward to using this amazing package!

    pip install deluca
    Collecting deluca
       Using cached deluca-0.0.17-py3-none-any.whl (52 kB)
    Collecting flax
       Using cached flax-0.3.4-py3-none-any.whl (183 kB)
    Collecting brax
       Using cached brax-0.0.4-py3-none-any.whl (117 kB)
    Processing
    ./.cache/pip/wheels/78/ae/07/bd3adac873fa80efc909c09331831905ac657dbb8d1278235e/jax-0.2.19-py3-none-any.whl
    Collecting optax
       Using cached optax-0.0.9-py3-none-any.whl (118 kB)
    Collecting scipy
       Using cached
    scipy-1.7.1-cp38-cp38-manylinux_2_5_x86_64.manylinux1_x86_64.whl (28.4 MB)
    Collecting numpy
       Using cached
    numpy-1.21.2-cp38-cp38-manylinux_2_12_x86_64.manylinux2010_x86_64.whl
    (15.8 MB)
    Collecting matplotlib
       Using cached matplotlib-3.4.3-cp38-cp38-manylinux1_x86_64.whl (10.3 MB)
    Collecting msgpack
       Using cached msgpack-1.0.2-cp38-cp38-manylinux1_x86_64.whl (302 kB)
    Collecting grpcio
       Using cached grpcio-1.39.0-cp38-cp38-manylinux2014_x86_64.whl (4.3 MB)
    Collecting clu
       Using cached clu-0.0.6-py3-none-any.whl (77 kB)
    Collecting gym
       Using cached gym-0.19.0.tar.gz (1.6 MB)
    Collecting absl-py
       Using cached absl_py-0.13.0-py3-none-any.whl (132 kB)
    Collecting tfp-nightly[jax]<=0.13.0.dev20210422
       Using cached tfp_nightly-0.13.0.dev20210422-py2.py3-none-any.whl (5.3 MB)
    Collecting jaxlib
       Using cached jaxlib-0.1.70-cp38-none-manylinux2010_x86_64.whl (46.9 MB)
    Collecting dataclasses
       Using cached dataclasses-0.6-py3-none-any.whl (14 kB)
    Collecting opt-einsum
       Using cached opt_einsum-3.3.0-py3-none-any.whl (65 kB)
    Collecting chex>=0.0.4
       Using cached chex-0.0.8-py3-none-any.whl (57 kB)
    Requirement already satisfied: pillow>=6.2.0 in
    /usr/lib/python3/dist-packages (from matplotlib->flax->deluca) (7.0.0)
    Collecting cycler>=0.10
       Using cached cycler-0.10.0-py2.py3-none-any.whl (6.5 kB)
    Collecting pyparsing>=2.2.1
       Using cached pyparsing-2.4.7-py2.py3-none-any.whl (67 kB)
    Collecting kiwisolver>=1.0.1
       Using cached kiwisolver-1.3.1-cp38-cp38-manylinux1_x86_64.whl (1.2 MB)
    Requirement already satisfied: python-dateutil>=2.7 in
    /usr/lib/python3/dist-packages (from matplotlib->flax->deluca) (2.7.3)
    Requirement already satisfied: six>=1.5.2 in
    /usr/lib/python3/dist-packages (from grpcio->brax->deluca) (1.14.0)
    Collecting tensorflow-datasets
       Using cached tensorflow_datasets-4.4.0-py3-none-any.whl (4.0 MB)
    Collecting packaging
       Using cached packaging-21.0-py3-none-any.whl (40 kB)
    Collecting ml-collections
       Using cached ml_collections-0.1.0-py3-none-any.whl (88 kB)
    Collecting tensorflow
       Downloading tensorflow-2.6.0-cp38-cp38-manylinux2010_x86_64.whl
    (458.4 MB)
          |▋                               | 8.4 MB 16 kB/s eta
    7:44:54ERROR: Exception:
    Traceback (most recent call last):
       File
    "/usr/share/python-wheels/urllib3-1.25.8-py2.py3-none-any.whl/urllib3/response.py",
    line 425, in _error_catcher
         yield
       File
    "/usr/share/python-wheels/urllib3-1.25.8-py2.py3-none-any.whl/urllib3/response.py",
    line 507, in read
         data = self._fp.read(amt) if not fp_closed else b""
       File
    "/usr/share/python-wheels/CacheControl-0.12.6-py2.py3-none-any.whl/cachecontrol/filewrapper.py",
    line 62, in read
         data = self.__fp.read(amt)
       File "/usr/lib/python3.8/http/client.py", line 455, in read
         n = self.readinto(b)
       File "/usr/lib/python3.8/http/client.py", line 499, in readinto
         n = self.fp.readinto(b)
       File "/usr/lib/python3.8/socket.py", line 669, in readinto
         return self._sock.recv_into(b)
       File "/usr/lib/python3.8/ssl.py", line 1241, in recv_into
         return self.read(nbytes, buffer)
       File "/usr/lib/python3.8/ssl.py", line 1099, in read
         return self._sslobj.read(len, buffer)
    socket.timeout: The read operation timed out
    
    During handling of the above exception, another exception occurred:
    
    Traceback (most recent call last):
       File
    "/usr/lib/python3/dist-packages/pip/_internal/cli/base_command.py", line
    186, in _main
         status = self.run(options, args)
       File
    "/usr/lib/python3/dist-packages/pip/_internal/commands/install.py", line
    357, in run
         resolver.resolve(requirement_set)
       File
    "/usr/lib/python3/dist-packages/pip/_internal/legacy_resolve.py", line
    177, in resolve
         discovered_reqs.extend(self._resolve_one(requirement_set, req))
       File
    "/usr/lib/python3/dist-packages/pip/_internal/legacy_resolve.py", line
    333, in _resolve_one
         abstract_dist = self._get_abstract_dist_for(req_to_install)
       File
    "/usr/lib/python3/dist-packages/pip/_internal/legacy_resolve.py", line
    282, in _get_abstract_dist_for
         abstract_dist = self.preparer.prepare_linked_requirement(req)
       File
    "/usr/lib/python3/dist-packages/pip/_internal/operations/prepare.py",
    line 480, in prepare_linked_requirement
         local_path = unpack_url(
       File
    "/usr/lib/python3/dist-packages/pip/_internal/operations/prepare.py",
    line 282, in unpack_url
         return unpack_http_url(
       File
    "/usr/lib/python3/dist-packages/pip/_internal/operations/prepare.py",
    line 158, in unpack_http_url
         from_path, content_type = _download_http_url(
       File
    "/usr/lib/python3/dist-packages/pip/_internal/operations/prepare.py",
    line 303, in _download_http_url
         for chunk in download.chunks:
       File "/usr/lib/python3/dist-packages/pip/_internal/utils/ui.py", line
    160, in iter
         for x in it:
       File "/usr/lib/python3/dist-packages/pip/_internal/network/utils.py",
    line 15, in response_chunks
         for chunk in response.raw.stream(
       File
    "/usr/share/python-wheels/urllib3-1.25.8-py2.py3-none-any.whl/urllib3/response.py",
    line 564, in stream
         data = self.read(amt=amt, decode_content=decode_content)
       File
    "/usr/share/python-wheels/urllib3-1.25.8-py2.py3-none-any.whl/urllib3/response.py",
    line 529, in read
         raise IncompleteRead(self._fp_bytes_read, self.length_remaining)
       File "/usr/lib/python3.8/contextlib.py", line 131, in __exit__
         self.gen.throw(type, value, traceback)
       File
    "/usr/share/python-wheels/urllib3-1.25.8-py2.py3-none-any.whl/urllib3/response.py",
    line 430, in _error_catcher
         raise ReadTimeoutError(self._pool, None, "Read timed out.")
    urllib3.exceptions.ReadTimeoutError:
    HTTPSConnectionPool(host='files.pythonhosted.org', port=443): Read timed
    out.
    
    opened by FarnazAdib 4
  • Internal change

    Internal change

    Internal change

    FUTURE_COPYBARA_INTEGRATE_REVIEW=https://github.com/google/deluca/pull/57 from google:inverse_map baa4932444495538d91151653165cdcb386b52fc

    opened by copybara-service[bot] 0
  • Internal change

    Internal change

    Internal change

    FUTURE_COPYBARA_INTEGRATE_REVIEW=https://github.com/google/deluca/pull/57 from google:inverse_map baa4932444495538d91151653165cdcb386b52fc

    opened by copybara-service[bot] 0
  • Internal change

    Internal change

    Internal change

    FUTURE_COPYBARA_INTEGRATE_REVIEW=https://github.com/google/deluca/pull/57 from google:inverse_map baa4932444495538d91151653165cdcb386b52fc

    cla: yes 
    opened by copybara-service[bot] 0
  • Internal change

    Internal change

    Internal change

    FUTURE_COPYBARA_INTEGRATE_REVIEW=https://github.com/google/deluca/pull/57 from google:inverse_map baa4932444495538d91151653165cdcb386b52fc

    cla: yes 
    opened by copybara-service[bot] 0
  • Consider dependency on OpenAI Gym

    Consider dependency on OpenAI Gym

    • Not clear what the benefits of compatibility are since existing agents that work on OpenAI Gym environments have no guarantee of working on deluca environments
    • OpenAI Gym bundles environment with initialization and task. Not necessarily something we want to do.
    opened by danielsuo 0
  • Changes to _adaptive.py

    Changes to _adaptive.py

    Hello! I made some modifications to AdaGPC (in _adaptive.py). In the existing implementation, GPC outperforms AdaGPC in the known LDS setting, which is the opposite of what one should expect. Based on some preliminary experiments, I believe AdaGPC is now working properly (at least in the known dynamics version). (I also made some miscellaneous changes in other files, e.g., to the imports in some of the agent files -- I think there might have been some file restructuring across different versions of deluca, but the imports were not updated to reflect this change, causing some errors at runtime.) Please let me know if you have any questions/concerns. Thanks!

    opened by simran135 1
  • [JAX] Avoid private implementation detail _ScalarMeta.

    [JAX] Avoid private implementation detail _ScalarMeta.

    [JAX] Avoid private implementation detail _ScalarMeta.

    The closest public approximation to type(jnp.float32) is type[Any]. Nothing is ever actually an instance of one of these types, either (they build DeviceArrays if instantiated.)

    opened by copybara-service[bot] 0
  • [JAX] Avoid private implementation detail _ScalarMeta.

    [JAX] Avoid private implementation detail _ScalarMeta.

    [JAX] Avoid private implementation detail _ScalarMeta.

    The closest public approximation to type(jnp.float32) is type[Any]. Nothing is ever actually an instance of one of these types, either (they build DeviceArrays if instantiated.)

    opened by copybara-service[bot] 0
  • Internal change

    Internal change

    Internal change

    FUTURE_COPYBARA_INTEGRATE_REVIEW=https://github.com/google/deluca/pull/57 from google:inverse_map baa4932444495538d91151653165cdcb386b52fc

    opened by copybara-service[bot] 0
  • Implementation of drc

    Implementation of drc

    Hi

    Thanks for providing this interesting package.

    I am trying to test drc on a simple setup and I notice that the current implementation of drc does not work. I mean when I try it for a simple partially observable linear system with A = np.array([[1.0 0.95], [0.0, -0.9]]), B = np.array([[0.0], [1.0]]) C = np.array([[1.0, 0]]) Q , R = I gaussian process noise, zero observation noise which is open loop stable, the controller acts like a zero controller. I tried to get a different response by setting the hyperparameters but they are mostly the same. Then I looked at the implementation at the deluca github and I noticed that the counterfactual cost is not defined correctly (if I am not wrong). According to Algorithm 1 in [1], we need to use M_t to compute y_t (which depends on the previous controls (u) using again M_t) but in the implementation, the previous controls based on M_{t-i} are used. Anyway, I implemented the algorithm using M_t but what I get after the simulation is either close to zero control or an unstable one.

    I was wondering if you have any code example for the DRC algorithm that works? [1] Simchowitz, Max and Singh, Karan and Hazan, Elad, "Improper learning for non-stochastic control", COLT 2020.

    Thanks a lot, Sincerely, Farnaz

    opened by FarnazAdib 4
Releases(v0.0.17)
Owner
Google
Google ❤️ Open Source
Google
Vehicles Counting using YOLOv4 + DeepSORT + Flask + Ngrok

A project for counting vehicles using YOLOv4 + DeepSORT + Flask + Ngrok

Duong Tran Thanh 37 Dec 16, 2022
Learnable Boundary Guided Adversarial Training (ICCV2021)

Learnable Boundary Guided Adversarial Training This repository contains the implementation code for the ICCV2021 paper: Learnable Boundary Guided Adve

DV Lab 27 Sep 25, 2022
Visualization toolkit for neural networks in PyTorch! Demo -->

FlashTorch A Python visualization toolkit, built with PyTorch, for neural networks in PyTorch. Neural networks are often described as "black box". The

Misa Ogura 692 Dec 29, 2022
SubOmiEmbed: Self-supervised Representation Learning of Multi-omics Data for Cancer Type Classification

SubOmiEmbed: Self-supervised Representation Learning of Multi-omics Data for Cancer Type Classification

Sayed Hashim 3 Nov 15, 2022
Machine-in-the-Loop Rewriting for Creative Image Captioning

Machine-in-the-Loop Rewriting for Creative Image Captioning Data Annotated sources of data used in the paper: Data Source URL Mohammed et al. Link Gor

Vishakh P 6 Jul 24, 2022
The Official TensorFlow Implementation for SPatchGAN (ICCV2021)

SPatchGAN: Official TensorFlow Implementation Paper "SPatchGAN: A Statistical Feature Based Discriminator for Unsupervised Image-to-Image Translation"

39 Dec 30, 2022
Official PyTorch implementation of "Edge Rewiring Goes Neural: Boosting Network Resilience via Policy Gradient".

Edge Rewiring Goes Neural: Boosting Network Resilience via Policy Gradient This repository is the official PyTorch implementation of "Edge Rewiring Go

Shanchao Yang 4 Dec 12, 2022
[CVPR2021 Oral] UP-DETR: Unsupervised Pre-training for Object Detection with Transformers

UP-DETR: Unsupervised Pre-training for Object Detection with Transformers This is the official PyTorch implementation and models for UP-DETR paper: @a

dddzg 430 Dec 23, 2022
Official PyTorch implementation of "RMGN: A Regional Mask Guided Network for Parser-free Virtual Try-on" (IJCAI-ECAI 2022)

RMGN-VITON RMGN: A Regional Mask Guided Network for Parser-free Virtual Try-on In IJCAI-ECAI 2022(short oral). [Paper] [Supplementary Material] Abstra

27 Dec 01, 2022
Cross View SLAM

Cross View SLAM This is the associated code and dataset repository for our paper I. D. Miller et al., "Any Way You Look at It: Semantic Crossview Loca

Ian D. Miller 99 Dec 09, 2022
bio_inspired_min_nets_improve_the_performance_and_robustness_of_deep_networks

Code Submission for: Bio-inspired Min-Nets Improve the Performance and Robustness of Deep Networks Run with docker To build a docker environment, chan

0 Dec 09, 2021
Code for the ECIR'22 paper "Evaluating the Robustness of Retrieval Pipelines with Query Variation Generators"

Query Variation Generators This repository contains the code and annotation data for the ECIR'22 paper "Evaluating the Robustness of Retrieval Pipelin

Gustavo Penha 12 Nov 20, 2022
Implementation of a Transformer, but completely in Triton

Transformer in Triton (wip) Implementation of a Transformer, but completely in Triton. I'm completely new to lower-level neural net code, so this repo

Phil Wang 152 Dec 22, 2022
Code for Active Learning at The ImageNet Scale.

Code for Active Learning at The ImageNet Scale. This repository implements many popular active learning algorithms and allows training with torch's DDP.

Zeyad Emam 47 Dec 12, 2022
Public Models considered for emotion estimation from EEG

Emotion-EEG Set of models for emotion estimation from EEG. Composed by the combination of two deep-learing models learning together (RNN and CNN) with

Victor Delvigne 21 Dec 23, 2022
Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CVPR 2021)

Semi-supervised Semantic Segmentation with Directional Context-aware Consistency (CAC) Xin Lai*, Zhuotao Tian*, Li Jiang, Shu Liu, Hengshuang Zhao, Li

DV Lab 137 Dec 14, 2022
Meta Self-learning for Multi-Source Domain Adaptation: A Benchmark

Meta Self-Learning for Multi-Source Domain Adaptation: A Benchmark Project | Arxiv | YouTube | | Abstract In recent years, deep learning-based methods

CVSM Group - email: <a href=[email protected]"> 188 Dec 12, 2022
Few-Shot-Intent-Detection includes popular challenging intent detection datasets with/without OOS queries and state-of-the-art baselines and results.

Few-Shot-Intent-Detection Few-Shot-Intent-Detection is a repository designed for few-shot intent detection with/without Out-of-Scope (OOS) intents. It

Jian-Guo Zhang 73 Dec 26, 2022
Self-Guided Contrastive Learning for BERT Sentence Representations

Self-Guided Contrastive Learning for BERT Sentence Representations This repository is dedicated for releasing the implementation of the models utilize

Taeuk Kim 16 Dec 04, 2022
PromptDet: Expand Your Detector Vocabulary with Uncurated Images

PromptDet: Expand Your Detector Vocabulary with Uncurated Images Paper Website Introduction The goal of this work is to establish a scalable pipeline

103 Dec 20, 2022