Commit Graph

39 Commits

Author SHA1 Message Date
9aa07fed6a Adding Environment Wrapper and including index randomization for trajectory selection 2023-04-13 18:41:15 +02:00
233ca77aa4 Completing initial model and treating memory leak 2023-04-13 18:39:55 +02:00
3e9d8f7a9c Replacing Relu with LeakyRelu 2023-04-12 18:22:02 +02:00
c8fdd11d8c Outputting only 3 channels from the decoder 2023-04-12 17:30:20 +02:00
a83149f61e Keeping channels as 3 2023-04-12 17:29:50 +02:00
1f4667a08d Adding preprocessing function 2023-04-12 17:16:47 +02:00
7d7387bd5d Adding target value function updates and momentum updates 2023-04-12 17:15:17 +02:00
9085abe684 Correcting UB loss 2023-04-12 09:34:11 +02:00
5ded7bc8f1 Adding actor and value learners 2023-04-12 09:33:42 +02:00
cc48b0b0f8 Adding Rewards 2023-04-12 09:33:19 +02:00
ac714e3495 Correct history with detach 2023-04-10 20:18:39 +02:00
de17cab9f5 Add MOCO to introduce lower bound loss 2023-04-10 20:18:17 +02:00
05dd20cdfa Add a class to freeze parameters 2023-04-10 20:17:44 +02:00
8fd56ba94d Adding model architecture for Reward, Value and Target Value 2023-04-10 13:18:41 +02:00
47090449d1 Adding Reward, Value and Target Value models 2023-04-10 13:18:08 +02:00
c4283ced6f Changing CLUB loss and Tensor stacking 2023-04-09 18:23:16 +02:00
6b4762d5fc Changing Upper Bound loss 2023-04-09 18:22:41 +02:00
5caea7695a Changing variable reshaping strategy 2023-04-09 18:22:12 +02:00
ada3cadf0c Adding momentum encoder 2023-04-02 18:52:46 +02:00
d9d350e191 Adding Contrastive learning models 2023-04-02 18:52:26 +02:00
7c9e75030b Updating value model to be stochastic 2023-03-31 19:12:46 +02:00
e6e11f90b7 Adding value model 2023-03-31 18:38:51 +02:00
4e1ef89924 Adding action network 2023-03-31 18:00:07 +02:00
13765c2f9e Adding action decoder 2023-03-31 17:59:42 +02:00
47a0772c9d Replacing seed with version name variable in environment id naming 2023-03-28 20:22:27 +02:00
d558b9f558 Changing names for clean and noisy environments via version 2023-03-28 20:21:58 +02:00
41dcf22262 Collecting dataset from noiseless environment 2023-03-28 20:21:26 +02:00
11f00ad695 Add encoder loss and include tqdm for visualization 2023-03-27 19:23:42 +02:00
a1fe81f018 Grouping for actions too 2023-03-27 19:22:47 +02:00
38cc645253 Update models to give distribution as well in the output 2023-03-27 19:22:17 +02:00
a351134f08 Deleting unnecessary files 2023-03-25 17:53:04 +01:00
ab2b6599c1 File formatting 2023-03-25 17:51:34 +01:00
4515c6a6b7 Detecting and removing corrupt video files from the dataset 2023-03-25 17:36:58 +01:00
25c2853ba6 Adding high noise by randomising the frames 2023-03-25 17:07:07 +01:00
f2aa9baebb Removing ground plane from observations 2023-03-25 14:19:56 +01:00
8464503dd8 Adding new background videos for each episode 2023-03-25 14:18:07 +01:00
43f862ee6d Minor editing changes in comments 2023-03-24 20:41:28 +01:00
641c9bd57c Implementing ICLUB 2023-03-24 20:39:14 +01:00
abaca2bea9 Adding Denoised Predictive Imagination 2023-03-23 15:05:28 +01:00