Hypothesis

65 Matching Annotations

Last 7 days
arxiv.org arxiv.org

1810.08272v4.pdf

1
1. martinfunkquist 19 Dec 2024
  
  in Public
  
  Objects can be picked up, dropped and movedaround by the agent.
  
  How does the agent know if it is holding an object?
Visit annotations in context

Annotators

martinfunkquist

URL

arxiv.org/pdf/1810.08272
Dec 2024
aclanthology.org aclanthology.org

Faithful Chain-of-Thought Reasoning

1
1. martinfunkquist 12 Dec 2024
  
  in Public
  
  we call a PDDL Planner
  
  What is the domain and instance in this case?
Visit annotations in context

Annotators

martinfunkquist

URL

aclanthology.org/2023.ijcnlp-main.20.pdf
arxiv.org arxiv.org

2403.00092v2.pdf

1
1. martinfunkquist 12 Dec 2024
  
  in Public
  
  In real life, the en-vironment is often described with natural languagetexts.
  
  No it is not. The environment is observed through sensory readings e.g. vision or touch.
Visit annotations in context

Annotators

martinfunkquist

URL

arxiv.org/pdf/2403.00092
arxiv.org arxiv.org

2308.13724v1.pdf

1
1. martinfunkquist 12 Dec 2024
  
  in Public
  
  nother limitation stems from the inherent randomnesswithin LLMs, c
  
  What randomness?
Visit annotations in context

Annotators

martinfunkquist

URL

arxiv.org/pdf/2308.13724
arxiv.org arxiv.org

2209.00588v2.pdf

2
1. martinfunkquist 03 Dec 2024
  
  in Public
  
  The decoder D reconstructs an image ˆx0 = D(z0), from which thepolicy π predicts the action a0.
  
  Why is not the latent embedding z used as the input to the policy?
2. martinfunkquist 03 Dec 2024
  
  in Public
  
  setting anew state of the art for methods without lookahead search
  
  Isn't the world model used to do search?
Visit annotations in context

Annotators

martinfunkquist

URL

arxiv.org/pdf/2209.00588
arxiv.org arxiv.org

2302.05128v1.pdf

1
1. martinfunkquist 01 Dec 2024
  
  in Public
  
  Somewhat surprisingly, the lowest scores in Blocksworldare associated with BlockAmbiguity and KStacksColor; thesetwo problems require the LLM to associate objects basedon their color and we had apriori expected the LLM to becapable of such associations and perform well on this task.
  
  This kind of makes sense, because the colors are not explicitly modelled but they are part of the make of the object e.g. "red_block_1" rather than "red(block_1)". The latter would be a more natural way to express colors, as colors is a property of an object
Visit annotations in context

Annotators

martinfunkquist

URL

arxiv.org/pdf/2302.05128
Nov 2024
arxiv.org arxiv.org

Continuous control with deep reinforcement learning

1
1. martinfunkquist 23 Nov 2024
  
  in Public
  
  fan-in of the layer.
  
  What is "fan-in"?
Visit annotations in context

Annotators

martinfunkquist

URL

arxiv.org/pdf/1509.02971
proceedings.mlr.press proceedings.mlr.press

haarnoja17a.pdf

1
1. martinfunkquist 23 Nov 2024
  
  in Public
  
  qst , qat are positive over S and A respectively
  
  What does this mean?
Visit annotations in context

Annotators

martinfunkquist

URL

proceedings.mlr.press/v70/haarnoja17a/haarnoja17a.pdf
Jul 2024
arxiv.org arxiv.org

1711.00937.pdf

3
1. martinfunkquist 06 Jul 2024
  
  in Public
  
  ze(x)
  
  What is the dimensions of this?
2. martinfunkquist 05 Jul 2024
  
  in Public
  
  3 × 3 blocks
  
  What does "3 x 3 blocks" mean?
3. martinfunkquist 05 Jul 2024
  
  in Public
  
  Our proposal distribution q(z = k|x) is deterministic, and bydefining a simple uniform prior over z we obtain a KL divergence constant and equal to log K.
  
  What?
Visit annotations in context

Annotators

martinfunkquist

URL

arxiv.org/pdf/1711.00937
rll.berkeley.edu rll.berkeley.edu

dsae.pdf

1
1. martinfunkquist 04 Jul 2024
  
  in Public
  
  orcesthe autoencoder to focus on object positions
  
  This is still unclear to me
Visit annotations in context

Annotators

martinfunkquist

URL

rll.berkeley.edu/dsae/dsae.pdf
Jun 2024
arxiv.org arxiv.org

1801.01290.pdf

1
1. martinfunkquist 12 Jun 2024
  
  in Public
  
  This is notdirectly feasible with conventional policy gradient formula-tions
  
  Why not?
Visit annotations in context

Annotators

martinfunkquist

URL

arxiv.org/pdf/1801.01290
arxiv.org arxiv.org

1707.06347.pdf

1
1. martinfunkquist 11 Jun 2024
  
  in Public
  
  At is an estimator of the advantage function at timestep t
  
  How is this calculated?
Visit annotations in context

Annotators

martinfunkquist

URL

arxiv.org/pdf/1707.06347.pdf
May 2024
arxiv.org arxiv.org

2201.02135.pdf

1
1. martinfunkquist 17 May 2024
  
  in Public
  
  quality
  
  How is the quality measured?
Visit annotations in context

Annotators

martinfunkquist

URL

arxiv.org/pdf/2201.02135
www.semanticscholar.org www.semanticscholar.org

Symbolic Manipulation Planning with Discovered Object and Relational Predicates

1
1. martinfunkquist 16 May 2024
  
  in Public
  
  Thedifference is that we only record objects that are either actionarguments or in contact with them.
  
  How do you know, when the model is not learned yet?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/295672ca7e19be9fc06baf3745c4a6a4f15bcf83
www.semanticscholar.org www.semanticscholar.org

From Reals to Logic and Back: Inventing Symbolic Vocabularies, Actions, and Models for Planning from Raw Data

3
1. martinfunkquist 16 May 2024
  
  in Public
  
  A “world” frameserves as a default frame of reference for every object in the environment
  
  Is this what the dataset consists of? Sequences of world frames?
2. martinfunkquist 16 May 2024
  
  in Public
  
  However, these approachesassume high-level actions to be provided as input.
  
  No they don't. At least not Asai et al 2022
3. martinfunkquist 16 May 2024
  
  in Public
  
  A is an uncountably infinite set of primitivedeterministic actions.
  
  So actions are continous and not discrete?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/b7c0b197d2d04bed85f24e88dd8230c8cbcdb78c
Feb 2024
www.semanticscholar.org www.semanticscholar.org

Learning Plannable Representations with Causal InfoGAN

1
1. martinfunkquist 12 Feb 2024
  
  in Public
  
  One can view the noise vector z in such a GAN as a featurevector, containing some representation of the transition to o′ from o.
  
  How can it contain a representation of the transition if it is just noise?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/3e19a257825066f9a6db913f8d546524c0dc3387
Jan 2024
www.semanticscholar.org www.semanticscholar.org

[PDF] Interpretable and Explainable Logical Policies via Neurally Guided Symbolic Abstraction | Semantic Scholar

1
1. martinfunkquist 19 Jan 2024
  
  in Public
  
  Wedefine action predicates PA = {left(1), left(2), right(1), right(2), jump(1), idle(1), ...} and state predicatesPS = {type, closeby, ...}
  
  How did they come up with these?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/ac0cd2cfb7b663cc94e9db9d2d9b790ccb644b1a
openreview.net openreview.net

55_learning_discrete_world_models.pdf

1
1. martinfunkquist 15 Jan 2024
  
  in Public
  
  This dataset will contain a set of tuples, (s, a, s′), of states,actions, and next states
  
  What is a state?
Visit annotations in context

Annotators

martinfunkquist

URL

openreview.net/pdf
www.semanticscholar.org www.semanticscholar.org

2209.00588.pdf

1
1. martinfunkquist 15 Jan 2024
  
  in Public
  
  More recently, Chenet al. (2022) explored a variant of DreamerV2 where a Transformer replaces the recurrent network inthe RSSM
  
  Then what is the novelty in this paper?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/235303a8bc1e4892efd525a38ead657422d8a519
openreview.net openreview.net

1759_transformers_are_sample_effici.pdf

1
1. martinfunkquist 15 Jan 2024
  
  in Public
  
  straight-through estimator
  
  ?
Visit annotations in context

Annotators

martinfunkquist

URL

openreview.net/pdf
www.semanticscholar.org www.semanticscholar.org

A Compositional Object-Based Approach to Learning Physical Dynamics

1
1. martinfunkquist 14 Jan 2024
  
  in Public
  
  bject state vectors
  
  Where do the object state vectors come from?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/a1786540a4e15f0757e1b84a02f98ed436a969e0
openreview.net openreview.net

593_scan_learning_hierarchical_com.pdf

5
1. martinfunkquist 13 Jan 2024
  
  in Public
  
  DeepMind Lab dataset
  
  How is this dataset structured? There is no "fixed" dataset in the DeepMind Lab repo
2. martinfunkquist 13 Jan 2024
  
  in Public
  
  showing good variability over the irrelevant factors
  
  Not really. For the "white suitcase" scene it only differs in wall colors and floor colors, but the "black and white" representation of the scene is the same. Essentially there could be a way larger range of scenes where a white suitcase appears.
3. martinfunkquist 13 Jan 2024
  
  in Public
  
  blue wall
  
  Is "blue wall" a compositional concept or an atomic one?
4. martinfunkquist 12 Jan 2024
  
  in Public
  
  small, round, red
  
  Are these "features" hand-crafted?
5. martinfunkquist 12 Jan 2024
  
  in Public
  
  few example images of an apple paired with the symbol “apple”
  
  This is not unsupervised data
Visit annotations in context

Annotators

martinfunkquist

URL

openreview.net/pdf
www.semanticscholar.org www.semanticscholar.org

Classical Planning in Deep Latent Space

1
1. martinfunkquist 09 Jan 2024
  
  in Public
  
  unlabeled set of image pairs
  
  It's kind of labelled because they know that an action has taken place between the images, just not what action it is.
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/8d56c79e5dcde10c71d0f70962b92533143e253d
www.semanticscholar.org www.semanticscholar.org

1912.01603.pdf

3
1. martinfunkquist 08 Jan 2024
  
  in Public
  
  vψ (sτ )
  
  What is the difference between this and \(V_\lambda\)?
2. martinfunkquist 08 Jan 2024
  
  in Public
  
  ataset of past experience
  
  Where does this data come from? Random exploration?
3. martinfunkquist 07 Jan 2024
  
  in Public
  
  finite imagination horizon
  
  What's the alternative, infinite imagination horizon? Seems impossible
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/0cc956565c7d249d4197eeb1dbab6523c648b2c9
www.semanticscholar.org www.semanticscholar.org

Learning First-Order Representations for Planning from Black-Box States: New Results

3
1. martinfunkquist 07 Jan 2024
  
  in Public
  
  blocks1-5 (arm, 5 blocks)
  
  Why only up to 5 blocks?
2. martinfunkquist 06 Jan 2024
  
  in Public
  
  in many casesoptimally
  
  What does it mean to solve them optimally?
3. martinfunkquist 05 Jan 2024
  
  in Public
  
  In one case, the input data corresponds to one or morestate graphs Gi assumed to originate from hidden planninginstances Pi = 〈D, Ii〉 that need to be uncovered
  
  Isn't the domain needed in order to generate the state graph?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/9a07c648fb4586fcb33602f8e76e2f8c5073d89f
www.semanticscholar.org www.semanticscholar.org

Learning First-Order Symbolic Representations for Planning from the Structure of the State Space

1
1. martinfunkquist 06 Jan 2024
  
  in Public
  
  The latter approaches are less likely to generate crisp represen-tations due to the dependence on images
  
  Why?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/097c71968b5eaacec908f4d1ce1137a53dca8bdc
www.semanticscholar.org www.semanticscholar.org

Learning to Act without Actions

6
1. martinfunkquist 04 Jan 2024
  
  in Public
  
  a latent policy via behavior cloning
  
  How is this done?
2. martinfunkquist 04 Jan 2024
  
  in Public
  
  π(ot)
  
  How is this value known?
3. martinfunkquist 03 Jan 2024
  
  in Public
  
  Moveover
  
  Moreover?
4. martinfunkquist 03 Jan 2024
  
  in Public
  
  is
  
  Remove
5. martinfunkquist 03 Jan 2024
  
  in Public
  
  before and after the action of inter-est is taken
  
  Does every next observation depend on an action, or can the environment change "by itself"?
6. martinfunkquist 03 Jan 2024
  
  in Public
  
  which predicts which action at was taken by the agent between consecutive obser-vations ot and ot+1
  
  How is this trained when the action is not known?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/d71ab72104e48c9f37a9acedb749bf5c7fa90622
people.csail.mit.edu people.csail.mit.edu

DreamCoder: Bootstrapping Inductive Program Synthesis with Wake-Sleep Library Learning

1
1. martinfunkquist 01 Jan 2024
  
  in Public
  
  prior distribution over programs likely to solve tasks inthe domain
  
  What does this prior distribution mean? The probability of the program to solve any task in the domain? Is there even any programs that would solve multiple tasks?
Visit annotations in context

Annotators

martinfunkquist

URL

people.csail.mit.edu/asolar/papers/EllisWNSMHCST21.pdf
www.semanticscholar.org www.semanticscholar.org

[PDF] DreamCoder: growing generalizable, interpretable knowledge with wake–sleep Bayesian program learning | Semantic Scholar

3
1. martinfunkquist 01 Jan 2024
  
  in Public
  
  positive probability
  
  What does it mean that a program has a "positive probability" of solving a task?
2. martinfunkquist 01 Jan 2024
  
  in Public
  
  earch for programs
  
  Search for programs where? How are these programs created?
3. martinfunkquist 01 Jan 2024
  
  in Public
  
  find best program
  
  How is best defined?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/ef2bbcd928749978b4395460a96c9869833c9c89
Dec 2023
www.semanticscholar.org www.semanticscholar.org

1912.01603.pdf

1
1. martinfunkquist 30 Dec 2023
  
  in Public
  
  We selected the tasks on which Tassa et al. (2018) reportnon-zero performance from image inputs
  
  Why?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/0cc956565c7d249d4197eeb1dbab6523c648b2c9
www.semanticscholar.org www.semanticscholar.org

Faithful Chain-of-Thought Reasoning

1
1. martinfunkquist 20 Dec 2023
  
  in Public
  
  Finally, we call a PDDL Planner as the de-terministic solver to obtain A, a plan to accomplishthe goal CSL under the predefined scenario.
  
  With what PDDL domain?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/b115c1e1e9e51f8ad7d47b745bc04e29a654b84d
www.semanticscholar.org www.semanticscholar.org

Grounding Classical Task Planners via Vision-Language Models

1
1. martinfunkquist 20 Dec 2023
  
  in Public
  
  Task descriptions are constructed using PDDL and symbolicplans are generated using the FAST-DOWNWARD planner
  
  To generate a symbolic plan, an initial state (problem file) needs to be given. How does this looks like? Is there only three problem files (one for each problem) representing some "general" state? Shouldn't the initial plan depend on the initial state?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/1ff31e27301de5fd6c813a82e026bb56d6b7a875
www.semanticscholar.org www.semanticscholar.org

Structured, flexible, and robust: benchmarking and improving large language models towards more human-like behavior in out-of-distribution reasoning tasks

2
1. martinfunkquist 19 Dec 2023
  
  in Public
  
  Ourset-up automatically parses LLM-generated language intoa program using our synthetic grammar
  
  How?
  
  Also, how do they handle cases where the parser generates incorrect PDDL? Wouldn't that give the LLM-as-planner a worse score that it actually should have?
2. martinfunkquist 19 Dec 2023
  
  in Public
  
  The P+S modeloutputs executable PDDL actions
  
  How do you make sure of this?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/7ef9aafc68511afab5b287e62b754576ea37b4ce
www.semanticscholar.org www.semanticscholar.org

ProgPrompt: Generating Situated Robot Task Plans using Large Language Models

3
1. martinfunkquist 16 Dec 2023
  
  in Public
  
  Even with high Exec, some task GCR are low, becausesome tasks have multiple appropriate goal states, but weonly evaluate against a single “true” goal
  
  This seems like an unfair way to evaluate the model
2. martinfunkquist 16 Dec 2023
  
  in Public
  
  SR is the fraction of executionsthat achieved all task-relevant goal-conditions
  
  How are the goal-conditions specified and where do they come from?
3. martinfunkquist 16 Dec 2023
  
  in Public
  
  We provide the available objects in theenvironment as a list of strings
  
  How are these objects retrieved? Automaticall or manually?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/c03fa01fbb9c77fe3d10609ba5f1dee33a723867
Nov 2023
www.semanticscholar.org www.semanticscholar.org

Integrating Action Knowledge and LLMs for Task Planning and Situation Handling in Open Worlds

1
1. martinfunkquist 21 Nov 2023
  
  in Public
  
  “The bowl can also be a container to fillwater”, will be added to the task planner.
  
  Where does this come from? The LLM? Template?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/a79c7062809548fdfb593d68c106034012a401d9
www.semanticscholar.org www.semanticscholar.org

Visually-Grounded Planning without Vision: Language Models Infer Detailed Plans from High-level Instructions

3
1. martinfunkquist 10 Nov 2023
  
  in Public
  
  perfectly match gold visual semantic plans us-ing only the text directives as input
  
  Where do they say how they provide the state representation to the model?
2. martinfunkquist 10 Nov 2023
  
  in Public
  
  Generated strings from all models arepost-processed for common errors in sequence-to-sequence models, including token doubling,completing missing bigrams (e.g. “pick <arg1>”→ “pick up <arg1>”), and heuristics for addingmissing argument tags
  
  Probably won't generalize well to new domains
3. martinfunkquist 07 Nov 2023
  
  in Public
  
  The ALFRED dataset contains 6,574gold command sequences
  
  Didn't the "Understanding Language in Context" paper mention that it was around 8k data samples?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/0cd0c3c76578deb7dc2c779256cd1b1f48ea2567
www.semanticscholar.org www.semanticscholar.org

Understanding Natural Language in Context

3
1. martinfunkquist 07 Nov 2023
  
  in Public
  
  astly, the goal predicates for each problem were generatedfrom the "PDDL parameters" field of every data sample.
  
  What is this field?
2. martinfunkquist 07 Nov 2023
  
  in Public
  
  hus, we have created a PDDL domain file usingour knowledge of the objects and actions in the ALFRED world and a PDDL problem file for eachsample
  
  I assume that the domain file is created manually, but are the problem files also created by hand? If so that seems like a lot of work, since the dataset has 8,055 visual samples, the same amount would be needed to be handcoded.
3. martinfunkquist 07 Nov 2023
  
  in Public
  
  Since in our task we ignorethe vision part of the data, we might encounter some duplicates between our datasets
  
  How do they get the scene representation from the visual data? Is this included in the ALFRED dataset?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/b13cb83f9a9e8cea529c527f76ab2b50ab879bbc
Oct 2023
www.semanticscholar.org www.semanticscholar.org

2010.03768.pdf

1
1. martinfunkquist 29 Oct 2023
  
  in Public
  
  ALFWorld uses PDDL - Planning DomainDefinition Language (McDermott et al., 1998) to describe each scene from ALFRED and to constructan equivalent text game using the TextWorld engine.
  
  How is the PDDL created?
Visit annotations in context

Annotators

martinfunkquist

URL

semanticscholar.org/reader/398a0625e8707a0b41ac58eaec51e8feb87dd7cb

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL

Annotators

URL