https://cran.r-project.org/web/packages/ pomdp/vignettes/POMDP.html#defining-a-pomdp-problem.
Madani, Omid. On the Computability of Infinite-Horizon Partially Observable Markov Decision Processes. 1999
Müller, Vincent C. Ethics of Artificial Intelligence and Robotics. Stanford Encyclopedia of Philosophy. 2020.
Nevins, Daniel. Halakhic Responses to Artificial Intelligence and Autonomous Machines. 2019.
Perfectly Natural. Victor Alonso-Berbel. DUST, 2018. Film.
The Social Dilemma. Jeff Orlowski. Netflix, 2020. Film.
Abel, David et al. Reinforcement Learning As a Framework for Ethical Decision Making. AAAI Workshop: AI, Ethics, and Society. 2016. ↩︎
Abel pg 6. Observations and actions yield further actions via a Bayesian update process where new observations inform a “belief” of the environment’s true state. This is the difference between the POMDP and the canonical MDP, where actions depend on full knowledge of the actual state of the environment. ↩︎
Abel pg 5. In the paper, it is not made explicit exactly what “companion” means. This could be another agent, an honest and ethical human, or some kind of nefarious actor. ↩︎
Madani, Omid. On the Computability of Infinite-Horizon Partially Observable Markov Decision Processes. 1999 ↩︎
RLE, pgs 4, 5 ↩︎
Kamalzadeh, H. & Hahsler, M. POMDP: Introduction to Partially Observable Markov Decision Processes. R. 2019. https://cran.r-project.org/web/packages/pomdp/vignettes/POMDP.html#defining-a-pomdp-problem. ↩︎
Arguably, this is where our less deterministic “gut feeling” originates. In The Eden Project, Jungian psychologist James Hollis, claims our understanding of the world is shaped quite explicitly at an early age. ↩︎