| 0 | Questionperception | What are objects to the front of the ego car? |
| GT Answer | There are many trucks, one car, and three pedestrians in front of the ego car. |
| InternVL2-2B | To the front of the ego car, there are several objects and features visible in the image:
1. **Pedestrian Crossing**: There is a pedestrian crossing sign on the left side of the road.
2. **Road Markings**: The road has white lane markings, including a pedestrian crossing line.
3. **Traffic Sign**: There is a traffic sign indicating "Do Not Enter" on the right side of the road.
4. **Construction Fence**: A chain-link fence is visible on the right side of the road, indicating construction or maintenance work.
5. **Parking Area**: There is a parking area with several cars parked in front of the building.
6. **Building**: The building in the background has large windows and a sign that reads "Harbor Place."
7. **Pedestrian**: A pedestrian is walking on the sidewalk in front of the building.
8. **Road Signs**: There are multiple road signs visible, including a "Do Not Enter" sign and a "S" sign on the right side of the road.
These objects and features are essential for navigation and understanding the environment in front of the ego car. |
| Mini-InternVL2-2B-DA-DriveLM | There are two pedestrians and one car in front of the ego car. |
| llama_adapter_v2_multimodal7b | There are no objects to the front of the ego car in the image. |
| InternVL2-2b-LoRA-25k-drivelm | There is one car, one pedestrian, and one traffic cone to the front of the ego car. |
| InternVL2-2b-LoRA-25k-drivelm-offline-redcircle-ctag-bkgd | There are many traffic cones in front of the ego car. |
| InternVL2-2b-LoRA-25k_plus_DL-PL-10pct | There are many cars, one pedestrian, and one truck to the front of the ego car. |
| InternVL2-2b-LoRA-300k-drivelm | There are many cars, one truck, and one pedestrian in front of the ego car. |
| llava-v1.6-mistral-7b-hf | The image shows a series of four photos, each depicting a different scene. In the first photo, there is a car parked on the side of the road. In the second photo, there is a car parked in front of a building. In the third photo, there is a car driving on a road with a construction zone ahead. In the fourth photo, there is a car parked behind a construction barrier. |
| 1 | Questionperception | What is the moving status of object <c1,CAM_FRONT_RIGHT,1014.2,516.7>? Please select the correct answer from the following options: A. Turn left. B. Going ahead. C. Drive backward. D. Stopped. |
| GT Answer | B |
| InternVL2-2B | The moving status of object <c1,CAM_FRONT_RIGHT,1014.2,516.7> is D. Stopped. |
| Mini-InternVL2-2B-DA-DriveLM | B |
| llama_adapter_v2_multimodal7b | The moving status of object <c1,CAM_FRONT_RIGHT,1014.2,516.7> is that it is stopped. |
| InternVL2-2b-LoRA-25k-drivelm | B |
| InternVL2-2b-LoRA-25k-drivelm-offline-redcircle-ctag-bkgd | B |
| InternVL2-2b-LoRA-25k_plus_DL-PL-10pct | B |
| InternVL2-2b-LoRA-300k-drivelm | D. Stopped. |
| llava-v1.6-mistral-7b-hf | D. Stopped |
| 2 | Questionprediction | What object should the ego vehicle notice first when the ego vehicle is getting to the next possible location? What is the state of the object that is first noticed by the ego vehicle and what action should the ego vehicle take? What object should the ego vehicle notice second when the ego vehicle is getting to the next possible location? What is the state of the object perceived by the ego vehicle as second and what action should the ego vehicle take? What object should the ego vehicle notice third? What is the state of the object perceived by the ego vehicle as third and what action should the ego vehicle take? |
| GT Answer | Firstly, notice that <c5,CAM_FRONT,627.1,366.6>. The object is a traffic sign, so the ego vehicle should keep going ahead at the same speed. Secondly, notice that <c2,CAM_FRONT,1019.2,485.8>. The object is going ahead, so the ego vehicle should keep going ahead at the same speed. Thirdly, notice that <c1,CAM_FRONT_RIGHT,1014.2,516.7>. The object is going ahead, so the ego vehicle should keep going ahead at the same speed. |
| InternVL2-2B | When the ego vehicle is getting to the next possible location, it should first notice the "Do Not Enter" sign on the right side of the road. The state of the object perceived by the ego vehicle as the first object to be noticed is the "Do Not Enter" sign. The action the ego vehicle should take is to proceed straight ahead.
When the ego vehicle is getting to the next possible location, it should notice the "Do Not Enter" sign on the right side of the road. The state of the object perceived by the ego vehicle as the second object to be noticed is the "Do Not Enter" sign. The action the ego vehicle should take is to proceed straight ahead.
When the ego vehicle is getting to the next possible location, it should notice the "Do Not Enter" sign on the right side of the road. The state of the object perceived by the ego vehicle as the third object to be noticed is the "Do Not Enter" sign. The action the ego vehicle should take is to proceed straight ahead. |
| Mini-InternVL2-2B-DA-DriveLM | Firstly, notice that the object in front of the ego vehicle is going ahead. The ego vehicle should continue going ahead at the same speed. Secondly, notice that the object in front of the ego vehicle is going ahead. The ego vehicle should continue going ahead at the same speed. Thirdly, notice that the object in front of the ego vehicle is going ahead. The ego vehicle should continue going ahead at the same speed. |
| llama_adapter_v2_multimodal7b | When the ego vehicle is getting to the next possible location, it should first notice the traffic light, which is in the state of "red." The ego vehicle should then take the appropriate action, which is to stop at the red light. Next, the ego vehicle should notice the fire hydrant, which is in the state of "closed." The ego vehicle should then take the appropriate action, which is to proceed cautiously, as it may be necessary to avoid any obstacles or hazards in the area. Finally, the ego vehicle should notice the street sign, which is in the state of "front." The ego vehicle should then take the appropriate action, which is to continue driving in the direction indicated by the street sign. |
| InternVL2-2b-LoRA-25k-drivelm | Firstly, notice that <c3,CAM_FRONT,1080.8,480.8>. The object is going ahead, so the ego vehicle should keep going ahead at the same speed. Secondly, notice that <c2,CAM_FRONT,1080.8,480.8>. The object is going ahead, so the ego vehicle should keep going ahead at the same speed. Thirdly, notice that <c1,CAM_FRONT,1080.8,480.8>. The object is going ahead, so the ego vehicle should keep going ahead at the same speed. |
| InternVL2-2b-LoRA-25k-drivelm-offline-redcircle-ctag-bkgd | Firstly, notice <c5,CAM_FRONT,627.1,366.6>. The object is a traffic sign, so the ego vehicle should remain stationary. Secondly, notice <c2,CAM_FRONT,1019.2,485.8>. The object is going ahead, so the ego vehicle should remain stationary. Thirdly, notice <c3,CAM_FRONT_RIGHT,495.1,490.0>. The object is going ahead, so the ego vehicle should remain stationary. |
| InternVL2-2b-LoRA-25k_plus_DL-PL-10pct | Firstly, notice <c3,CAM_FRONT,1080.0,525.0>. The object is a traffic sign, so the ego vehicle should continue ahead at the same speed. Secondly, notice <c1,CAM_FRONT,1080.0,525.0>. The object is a traffic sign, so the ego vehicle should continue ahead at the same speed. Thirdly, notice <c2,CAM_FRONT,1080.0,525.0>. The object is a traffic sign, so the ego vehicle should continue ahead at the same speed. |
| InternVL2-2b-LoRA-300k-drivelm | Firstly, notice <c3,CAM_FRONT,1080.0,490.8>. It is a traffic sign, so the ego vehicle should slow down and proceed. Secondly, notice <c2,CAM_FRONT,1080.0,490.8>. It is a traffic sign, so the ego vehicle should slow down and proceed. Thirdly, notice <c1,CAM_BACK,1035.0,625.0>. It is a traffic sign, so the ego vehicle should slow down and proceed. |
| llava-v1.6-mistral-7b-hf | The image you've provided appears to be a collage of different street scenes, each with various objects that the ego vehicle should notice when approaching the next possible location. Here's a breakdown of what the ego vehicle should notice and what actions it should take:
1. First object: The "CAM_FRONT_LEFT" sign. The state of this object is that it is a sign indicating a camera is in front of the vehicle. The ego vehicle should be aware of the camera's presence and adjust its speed and behavior accordingly to ensure compliance with traffic regulations.
2. Second object: The "CAM_BACK_RIGHT" sign. The state of this object is that it is a sign indicating a camera is behind the vehicle. The ego vehicle should be aware of the camera's presence and adjust its speed and behavior accordingly to ensure compliance with traffic regulations.
3. Third object: The construction area. The state of this object is that it is a construction zone with barriers and possibly workers. The ego vehicle should slow down, be prepared to stop if necessary, and follow any instructions given by construction personnel or traffic control devices.
4. Fourth object: The "CAM_FRONT_RIGHT" sign. The state of this object is that it is a sign indicating a camera is in front of the vehicle. The ego vehicle should be aware of the camera's presence and adjust its speed and behavior accordingly to ensure compliance with traffic regulations.
5. Fifth object: The "CAM_BACK_LEFT" sign. The state of this object is that it is a sign indicating a camera is behind the vehicle. The ego vehicle should be aware of the camera's presence and adjust its speed and behavior accordingly to ensure compliance with traffic regulations.
6. Sixth object: The "CAM_FRONT_LEFT" sign. The state of this object is that it is a sign indicating a camera is in front of the vehicle. The ego vehicle should be aware of the camera's presence and adjust its speed and behavior accordingly to ensure compliance with traffic regulations.
7. Seventh object: The "CAM_BACK_RIGHT" sign. The state of this object is that it is a sign indicating a camera is behind the vehicle. The ego vehicle should be aware of the camera's presence and adjust its speed and behavior accordingly to ensure compliance with traffic regulations.
8. Eighth object: The "CAM_FRONT_LEFT" sign. |
| 3 | Questionprediction | Is <c1,CAM_FRONT_RIGHT,1014.2,516.7> a traffic sign or a road barrier? |
| GT Answer | No. |
| InternVL2-2B | To determine if the coordinates provided (c1, CAM_FRONT_RIGHT, 1014.2, 516.7) are a traffic sign or a road barrier, let's analyze the image in detail:
1. **Location and Orientation**:
- The coordinates are given in a format that typically represents a point in a 2D space, with the x-coordinate (c1) and y-coordinate (CAM_FRONT_RIGHT) provided.
- The coordinates are within the image, and the coordinates are relative to the image's dimensions.
2. **Visual Inspection**:
- **Traffic Sign**: A traffic sign typically has a distinct shape, color, and text that clearly indicates a specific rule or regulation. In this image, there is no such distinct shape or text.
- **Road Barrier**: A road barrier is typically a physical structure that separates two roads or areas, often with a distinct shape and color. In this image, there is no visible barrier structure.
3. **Contextual Analysis**:
- The coordinates are located in the middle of the image, near the intersection of two roads.
- The coordinates are not within the boundaries of any visible road or intersection, which suggests they are not part of the road itself.
4. **Conclusion**:
- Given the coordinates are within the image and not within the boundaries of any visible road or intersection, it is reasonable to conclude that these coordinates represent a point within the image, likely a point of interest or a point of interest within the image itself, rather than a traffic sign or a road barrier.
In conclusion, the coordinates provided (c1, CAM_FRONT_RIGHT, 1014.2, 516.7) are not a traffic sign but rather a point of interest within the image. |
| Mini-InternVL2-2B-DA-DriveLM | No. |
| llama_adapter_v2_multimodal7b | The image does not provide enough information to confidently determine whether the <c1,CAM_FRONT_RIGHT,1014.2,516.7> is a traffic sign or a road barrier. It could be either, depending on the context and the specific location within the image. |
| InternVL2-2b-LoRA-25k-drivelm | No. |
| InternVL2-2b-LoRA-25k-drivelm-offline-redcircle-ctag-bkgd | No. |
| InternVL2-2b-LoRA-25k_plus_DL-PL-10pct | No. |
| InternVL2-2b-LoRA-300k-drivelm | No. |
| llava-v1.6-mistral-7b-hf | The image you've provided appears to show a traffic sign rather than a road barrier. The sign is labeled "CAM_FRONT_RIGHT," which suggests it is a camera monitoring traffic from the front right side of the road. This type of sign is typically used to indicate the presence of a traffic camera for enforcement purposes. |
| 4 | Questionplanning | What actions could the ego vehicle take based on <c1,CAM_FRONT_RIGHT,1014.2,516.7>? Why take this action and what's the probability? |
| GT Answer | The action is to keep going at the same speed. The reason is that there is no safety issue. The probability of this action is high. |
| InternVL2-2B | The ego vehicle, represented by the car in the image, is currently positioned at the intersection of Cam Front Right. The action it can take is to proceed straight through the intersection.
### Reasoning:
1. **Proximity to the Intersection**: The car is directly in front of the intersection, making it a safe and straightforward path to take.
2. **No Obstacles**: There are no visible obstacles or pedestrians in the immediate vicinity that would impede the car's path.
3. **Traffic Rules**: The car is within the right-of-way, as indicated by the "Do Not Enter" sign, and there are no other vehicles or pedestrians blocking the way.
### Probability:
Given the current position and the clear visibility of the intersection, the probability of proceeding straight through the intersection is high. The car is in a safe and unobstructed area, and there are no immediate hazards or restrictions that would prevent it from proceeding. |
| Mini-InternVL2-2B-DA-DriveLM | The action is to keep going at the same speed. The reason is that there is no safety issue, and the probability is high. |
| llama_adapter_v2_multimodal7b | Based on the given coordinate values, the ego vehicle is at the intersection of a one-way street and a two-way street. The vehicle is facing the front right direction, and the coordinates indicate that it is at a position of 1014.2 and 516.7. The vehicle could take several actions, such as:1. Continue straight ahead on the one-way street, as it is currently on the correct side of the road.2. Turn right onto the two-way street, as it is facing the front right direction and the coordinates indicate that it is at the intersection.3. Turn left onto the one-way street, as it is currently on the correct side of the road.The probability of the vehicle taking any of these actions depends on the vehicle's driver's intentions and the traffic rules in the area. If the vehicle is following traffic rules and the driver intends to continue straight ahead, the most likely action would be to continue straight ahead on the one-way street. If the driver intends to turn right onto the two-way street, the most likely action would be to turn right. If the driver intends to turn left onto the one-way street, the most likely action would be to turn left. |
| InternVL2-2b-LoRA-25k-drivelm | The action is to keep going at the same speed. The reason is that there is no safety issue, and the probability is high. |
| InternVL2-2b-LoRA-25k-drivelm-offline-redcircle-ctag-bkgd | The action is to remain stationary. The reason for this action is that the object has little reference value for the ego vehicle, and the probability is high. |
| InternVL2-2b-LoRA-25k_plus_DL-PL-10pct | The action is to keep going at the same speed. The reason is that there is no safety issue, and the probability is high. |
| InternVL2-2b-LoRA-300k-drivelm | The action is to keep going at the same speed. The reason is that there is no safety issue, and the probability is high. |
| llava-v1.6-mistral-7b-hf | The image you've provided appears to be a sequence of images showing a vehicle's perspective at different points in time, likely from a camera mounted on the front right of the vehicle. The sequence seems to be capturing the vehicle's movement through an intersection.
Based on the image <c1,CAM_FRONT_RIGHT,1014.2,516.7>, the ego vehicle is likely to be in the process of making a right turn. The vehicle is positioned at the intersection, and the camera angle suggests that the driver is looking towards the right side of the road.
The probability of the vehicle making a right turn is high, as the vehicle is already at the intersection and the driver's line of sight is directed towards the right. The driver would typically look in the direction they intend to turn to ensure it's safe to do so.
The sequence of images suggests that the vehicle has already started its turn, as the position of the vehicle and the camera angle indicate that the vehicle is moving towards the right side of the road. The driver would need to ensure that the intersection is clear of oncoming traffic and pedestrians before completing the turn. |
| 5 | Questionplanning | What actions taken by the ego vehicle can lead to a collision with <c2,CAM_FRONT,1019.2,485.8>? |
| GT Answer | Accelerating and going straight. |
| InternVL2-2B | The ego vehicle is approaching a pedestrian crossing, and there are several actions that can lead to a collision with the vehicle in front of it:
1. **Driving too fast**: If the ego vehicle is driving at a high speed, it may not have enough time to react to the pedestrian crossing and could potentially collide with the vehicle in front.
2. **Not paying attention**: The ego vehicle may not be paying attention to the pedestrian crossing, failing to notice the pedestrian's presence and the need to stop.
3. **Not stopping**: The ego vehicle may not have stopped at the pedestrian crossing, failing to yield to the pedestrian's right-of-way.
4. **Not signaling**: The ego vehicle may not have activated its turn signal or braking signals, failing to alert other drivers of its intention to stop.
5. **Driving recklessly**: The ego vehicle may be driving recklessly, such as speeding or swerving, which could lead to a collision with the vehicle in front.
6. **Not following traffic rules**: The ego vehicle may not be following traffic rules, such as not stopping at a pedestrian crossing or not yielding to the pedestrian, leading to a collision.
7. **Not being aware of the pedestrian**: The ego vehicle may not be aware of the pedestrian's presence, failing to anticipate their movement and react accordingly.
8. **Not being prepared for the pedestrian crossing**: The ego vehicle may not be prepared for the pedestrian crossing, failing to slow down or stop in time to avoid a collision.
9. **Not being prepared for the pedestrian's actions**: The pedestrian may be crossing the road at an unexpected time or in a way that the ego vehicle is not prepared for, leading to a collision.
10. **Not being prepared for the pedestrian's actions**: The pedestrian may be crossing the road in a way that the ego vehicle is not prepared for, such as crossing in front of the ego vehicle or crossing in a way that requires the ego vehicle to stop.
In summary, the ego vehicle must be aware of the pedestrian crossing, slow down and stop, and signal to yield to the pedestrian to avoid a collision. |
| Mini-InternVL2-2B-DA-DriveLM | Accelerating and going straight. |
| llama_adapter_v2_multimodal7b | The ego vehicle is driving down the street, and it is approaching a red traffic light at an intersection. If the ego vehicle continues to drive forward without stopping, it may collide with the red traffic light. This can lead to accidents, damage to the vehicle, and potential injuries to the occupants or other road users. It is crucial for drivers to obey traffic signals and follow traffic rules to ensure road safety for everyone. |
| InternVL2-2b-LoRA-25k-drivelm | Accelerating and going straight. |
| InternVL2-2b-LoRA-25k-drivelm-offline-redcircle-ctag-bkgd | Accelerating and going straight. |
| InternVL2-2b-LoRA-25k_plus_DL-PL-10pct | Accelerating and going straight. |
| InternVL2-2b-LoRA-300k-drivelm | Accelerating and going straight. |
| llava-v1.6-mistral-7b-hf | The image you've provided shows a sequence of actions that can lead to a collision with the vehicle labeled as "CAM_FRONT" in the image. Here's a breakdown of the actions shown:
1. **CAM_BACK_LEFT**: The vehicle is turning left at an intersection.
2. **CAM_FRONT_LEFT**: The vehicle is now in the front view of the camera, indicating that it has turned left and is now in the path of the vehicle labeled "CAM_FRONT."
3. **CAM_BACK_RIGHT**: The vehicle is now turning right at the same intersection.
4. **CAM_FRONT_RIGHT**: The vehicle is now in the front view of the camera from the right side, indicating that it has turned right and is now in the path of the vehicle labeled "CAM_FRONT."
5. **CAM_BACK_LEFT**: The vehicle is again turning left at the intersection.
6. **CAM_FRONT_LEFT**: The vehicle is now in the front view of the camera from the left side, indicating that it has turned left and is now in the path of the vehicle labeled "CAM_FRONT."
7. **CAM_BACK_RIGHT**: The vehicle is now turning right at the intersection.
8. **CAM_FRONT_RIGHT**: The vehicle is now in the front view of the camera from the right side, indicating that it has turned right and is now in the path of the vehicle labeled "CAM_FRONT."
9. **CAM_BACK_LEFT**: The vehicle is again turning left at the intersection.
10. **CAM_FRONT_LEFT**: The vehicle is now in the front view of the camera from the left side, indicating that it has turned left and is now in the path of the vehicle labeled "CAM_FRONT."
11. **CAM_BACK_RIGHT**: The vehicle is now turning right at the intersection.
12. **CAM_FRONT_RIGHT**: The vehicle is now in the front view of the camera from the right side, indicating that it has turned right and is now in the path of the vehicle labeled "CAM_FRONT."
13. **CAM_BACK_LEFT**: The vehicle is again turning left at the intersection.
14. **CAM_FRONT_LEFT**: The vehicle is now in the front view of the |
| 6 | Questionplanning | In this scenario, what are safe actions to take for the ego vehicle? |
| GT Answer | Keep going at the same speed and turn left. |
| InternVL2-2B | Based on the image, the safe actions for the ego vehicle would include maintaining a safe following distance from the vehicle in front of it, as indicated by the "CAM_FRONT" sign. The vehicle should also be prepared to stop if necessary, as suggested by the "CAM_FRONT" sign. Additionally, the vehicle should be cautious of the construction area, as indicated by the "CAM_FRONT" sign, and should not enter the construction zone. The "CAM_FRONT" sign also suggests that the vehicle should be aware of the pedestrian crossing, as indicated by the "CAM_FRONT" sign, and should yield to pedestrians. |
| Mini-InternVL2-2B-DA-DriveLM | Slightly offset to the left, slightly offset to the right, and brake gently to a stop are safe actions to take for the ego vehicle in this scenario. |
| llama_adapter_v2_multimodal7b | In the image, there are multiple cars and trucks on the road, and a traffic light is present. To ensure safety, the ego vehicle should follow the traffic light signals, which are designed to regulate traffic flow and minimize the risk of accidents. The vehicle should also maintain a safe distance from other vehicles, especially larger ones like trucks, to avoid collisions. Additionally, the driver should be attentive to the road conditions, pedestrians, and other potential hazards, and adjust their speed accordingly. By following these safe actions, the ego vehicle can navigate the busy street and minimize the risk of accidents or traffic violations. |
| InternVL2-2b-LoRA-25k-drivelm | Keep going at the same speed, accelerate, and proceed. |
| InternVL2-2b-LoRA-25k-drivelm-offline-redcircle-ctag-bkgd | Brake gently to a stop, slightly offset to the left. |
| InternVL2-2b-LoRA-25k_plus_DL-PL-10pct | Keep going at the same speed, decelerate gradually without braking. |
| InternVL2-2b-LoRA-300k-drivelm | Keep going at the same speed, decelerate gradually without braking. |
| llava-v1.6-mistral-7b-hf | In the image you've provided, it appears to be a collage of different traffic situations. Here are safe actions to take for the ego vehicle in each scenario:
1. **CAM_BACK_LEFT**:
- The ego vehicle should maintain a safe following distance from the vehicle in front of it.
- It should be prepared to slow down or stop if necessary, as indicated by the traffic light.
- The driver should be aware of the pedestrian crossing the street and yield to them if they are crossing legally.
2. **CAM_FRONT_LEFT**:
- The ego vehicle should maintain a safe speed and be prepared to stop if necessary, as indicated by the traffic light.
- The driver should be aware of the pedestrian crossing the street and yield to them if they are crossing legally.
3. **CAM_BACK_RIGHT**:
- The ego vehicle should maintain a safe following distance from the vehicle in front of it.
- It should be prepared to slow down or stop if necessary, as indicated by the traffic light.
- The driver should be aware of the pedestrian crossing the street and yield to them if they are crossing legally.
4. **CAM_FRONT_RIGHT**:
- The ego vehicle should maintain a safe speed and be prepared to stop if necessary, as indicated by the traffic light.
- The driver should be aware of the pedestrian crossing the street and yield to them if they are crossing legally.
5. **CAM_BACK_LEFT**:
- The ego vehicle should maintain a safe following distance from the vehicle in front of it.
- It should be prepared to slow down or stop if necessary, as indicated by the traffic light.
- The driver should be aware of the pedestrian crossing the street and yield to them if they are crossing legally.
6. **CAM_FRONT_LEFT**:
- The ego vehicle should maintain a safe speed and be prepared to stop if necessary, as indicated by the traffic light.
- The driver should be aware of the pedestrian crossing the street and yield to them if they are crossing legally.
7. **CAM_BACK_RIGHT**:
- The ego vehicle should maintain a safe following distance from the vehicle in front of it.
- It should be prepared to slow down or stop if necessary, as indicated by the traffic light. |
| 7 | Questionbehavior | Predict the behavior of the ego vehicle. Please select the correct answer from the following options: A. The ego vehicle is going straight. The ego vehicle is driving fast. B. The ego vehicle is going straight. The ego vehicle is driving slowly. C. The ego vehicle is slightly steering to the right. The ego vehicle is driving slowly. D. The ego vehicle is slightly steering to the left. The ego vehicle is driving slowly. |
| GT Answer | D |
| InternVL2-2B | Based on the image, the ego vehicle is driving on a one-way street. The sign indicating "CAM_FRONT_LEFT" suggests that the vehicle is on the left side of the road. The presence of construction barriers and the direction of the traffic light (indicating "CAM_FRONT") further support that the vehicle is on a one-way street.
Given the options:
A. The ego vehicle is going straight. The ego vehicle is driving fast.
B. The ego vehicle is going straight. The ego vehicle is driving slowly.
C. The ego vehicle is slightly steering to the right. The ego vehicle is driving slowly.
D. The ego vehicle is slightly steering to the left. The ego vehicle is driving slowly.
The correct answer is:
C. The ego vehicle is slightly steering to the right. The ego vehicle is driving slowly. |
| Mini-InternVL2-2B-DA-DriveLM | B |
| llama_adapter_v2_multimodal7b | The ego vehicle is driving slowly, as it is seen in the image. |
| InternVL2-2b-LoRA-25k-drivelm | A |
| InternVL2-2b-LoRA-25k-drivelm-offline-redcircle-ctag-bkgd | B |
| InternVL2-2b-LoRA-25k_plus_DL-PL-10pct | B |
| InternVL2-2b-LoRA-300k-drivelm | C. The ego vehicle is slightly steering to the right. The ego vehicle is driving slowly. |
| llava-v1.6-mistral-7b-hf | A. The ego vehicle is going straight. The ego vehicle is driving slowly. |