Implement on Spot batch VLM predicate evaluation pipeline #285

lf-zhao · 2024-05-08T21:08:00Z

Implement a new pipeline for VLM predicate evaluation / VLM classifier:
(Note: The previous PR #284 hasn't been merged and it carries over. That one was mainly for keeping record. This is what actually working on Spot)

Pipeline:

Initialize VLM predicates in the env, now e.g., On and Inside
In Spot env reset, Spot builds initial obs and need to see all objects. We provide all VLM predicates and calculate all GroundAtom queries to evaluate for VLM classifiers, and save it as a dictionary.
In Spot env step, Spot takes one step (skill level) and observes a set of new images, we only update VLM ground atoms of all VLM predicates but evaluate VLMGroundAtoms with the visible objects.
The evaluation in reset and step are done in batch manner: a batch contains all relevant VLM ground atoms and a collection of Spot camera images (e.g., 6 cameras), and return a list to parse.
In Spot perceiver (obs -> object-centric state), save VLM predicates and ground atoms to state.
In abstract (object-centric state -> ground atoms, symbolic state), if it's VLM ground atom, we directly get from state.

Command:
python predicators/main.py --spot_robot_ip 192.168.80.3 --spot_graph_nav_map b45-621 --env lis_spot_block_floor_env --approach spot_wrapper[oracle] --bilevel_plan_without_sim True --seed 0 --perceiver spot_perceiver --spot_vlm_eval_predicate True --vlm_eval_verbose True --num_train_tasks 0 --num_test_tasks 1

Modifications:

Structs: Add VLM predicate and VLM ground atom class. Partial perception state and Observation class include additional information on visible objects, images, ground atoms from last state, and more.
Model: Add OpenAI VLM class. Also add some demo VLM predicates based on VLM and their prompts, such as for On, Inside, Blocking.
VLM predicate evaluation: Add a function to evaluate all visible ground atoms batched in one query.
Spot VLM object perception: Generate visible ground atoms for visible objects and all VLM predicates.

…; we need to figure out a better way to sync later!

…IS predicators!

… this image

…py one

…e-eval # Conflicts: # predicators/pretrained_model_interface.py # setup.py

…le objects

…, only update these predicates

…o idea why

…l VLM and more

…dicate and VLMGroundAtom

…evaluate all VLM predicates on all objects to build init obs for env reset

…ld init obs that query all objects + all VLM predicates, update VLM ground atoms for visible objects, update VLM predicate classifiers

lf-zhao added 30 commits April 23, 2024 17:52

try to visualize sokoban - may need nore neat way

0824ae7

add assert for fast downward planner in running once func

bf4000d

try to use rich package for more structured console output!

19d9d53

upload a naive way to store images

feaa264

debug

dd1fbf6

upload - manual copy from Nishanth's VLM interface in LIS predicators…

32a06a3

…; we need to figure out a better way to sync later!

add OpenAI vlm - in progress

ee3f634

update config setting for using vlm

a1a67fd

add package

5d9f12e

another missed one

cf3dbf7

manually add Nishanth's new pretrained model interface for now, see L…

3001f32

…IS predicators!

add new OpenAI VLM class, add example to use

4d47211

add a flag for caching

e45a0e9

now the example working - fix requesting vision messages, update test

8786dcd

update; add choosing img detail quality

71d0b3e

include the test image I used for now, not sure what I should do with…

112b690

… this image

remove original vlm interface, already merged into latest pretrained …

f884df7

…py one

Merge branch 'refs/heads/master' into lis-spot/implement-vlm-predicat…

8c17c42

…e-eval # Conflicts: # predicators/pretrained_model_interface.py # setup.py

found a way to use VLM to evaluate; add current images and also visib…

aec70de

…le objects

found a way to use VLM to evaluate; check if visible in current scene…

94e6a4c

…, only update these predicates

update State struct; adding to Spot specific subclass doesn't work, n…

3ee2ba9

…o idea why

add detail option

1abf488

working; implement On predicate with VLM classifier pipeline! add cal…

1c82c44

…l VLM and more

make a separate function for vlm predicate classifier evaluation

eeb1583

add test

acbdb0a

update example, move to test, move img

01498f6

remove

0774354

format

68ef57d

update

6560a94

batch VLM classifier working on Spot!! add field to State, add VLMPre…

1596f68

…dicate and VLMGroundAtom

lf-zhao added 7 commits May 8, 2024 16:41

batch VLM classifier eval: add vlm predicates fields to observation

98ae2a9

batch VLM classifier eval: function on batch query and parse

2683a6b

batch VLM classifier eval: provide VLM predicates to object finding, …

b0b7568

…evaluate all VLM predicates on all objects to build init obs for env reset

batch VLM classifier eval: add VLM predicate fields to state+obs, bui…

b4718f7

…ld init obs that query all objects + all VLM predicates, update VLM ground atoms for visible objects, update VLM predicate classifiers

formatting

3ca9032

remove some comments

1ab3d36

remove some comments

11f91ff

lf-zhao requested review from nkumar-bdai, wmcclinton and tsilver-bdai as code owners May 8, 2024 21:08

lf-zhao added 11 commits May 8, 2024 18:22

fix, add tenacity

c98e2fe

fix structs

4973f1f

more fix

26239e8

update

e78e342

some clean

ce43a76

fix no VLM case

93fb574

add predicate prompt; fix and clean

cb0e1ee

add predicate prompt & some logging; fix and clean

63a1429

overwrite vlm predicate classifier; reformat

ba2575f

update

a11e5a1

update vlm query in obj finding

8ba4c5d

lf-zhao mentioned this pull request May 23, 2024

Implement pick and place tasks + some improvements and fixes #289

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement on Spot batch VLM predicate evaluation pipeline #285

Implement on Spot batch VLM predicate evaluation pipeline #285

lf-zhao commented May 8, 2024 •

edited

Loading

Implement on Spot batch VLM predicate evaluation pipeline #285

Are you sure you want to change the base?

Implement on Spot batch VLM predicate evaluation pipeline #285

Conversation

lf-zhao commented May 8, 2024 • edited Loading

lf-zhao commented May 8, 2024 •

edited

Loading