| 0 | Questionperception | What are the important objects in the current scene? Those objects will be considered for the future reasoning and driving decision. |
| GT Answer | There is a big truck to the front right of the ego vehicle, a black sedan to the back of the ego vehicle, and a yellow truck to the front of the ego vehicle. The IDs of these objects are <c1,CAM_FRONT_RIGHT,1016.6,416.5>, <c2,CAM_BACK,675.8,446.7>, and <c3,CAM_FRONT,694.2,406.7>. |
| InternVL2-2B | In the current scene, the important objects include the large truck with the "Truly Freshier Salmon" advertisement on its trailer, the yellow truck in the background, and the industrial building with multiple large doors. These objects are crucial for the future reasoning and driving decision as they provide context and potential routes for the truck to follow. |
| Mini-InternVL2-2B-DA-DriveLM | There is a white truck to the front of the ego vehicle, a white truck to the front of the ego vehicle, a white truck to the front of the ego vehicle, a black sedan to the back of the ego vehicle, and a yellow truck to the front of the ego vehicle. The IDs of these objects are 1, 2, 3, 4, and 5. |
| llama_adapter_v2_multimodal7b | In the current scene, the important objects to consider for future reasoning and driving decisions are the trucks and the fence. The trucks are parked in a parking lot, and the fence is located in front of the trucks. The driver should be aware of the trucks' positions and movements to avoid collisions or accidents. Additionally, the fence could be a potential obstacle or boundary that the driver should be cautious of when navigating the parking lot. |
| InternVL2-2b-LoRA-25k-drivelm | There is a white truck to the front of the ego vehicle, a white truck to the back of the ego vehicle, a blue truck to the front right of the ego vehicle, and a blue truck to the front of the ego vehicle. The IDs of these objects are <c1,CAM_FRONT,1080.0,495.8>, <c2,CAM_BACK,1080.0,495.8>, <c3,CAM_FRONT_RIGHT,1080.0,495.8>, and <c4,CAM_FRONT,1080.0,495.8>. |
| InternVL2-2b-LoRA-25k-drivelm-offline-redcircle-ctag-bkgd | There is a white truck to the front right of the ego vehicle, a black sedan to the back of the ego vehicle, and a yellow truck to the front of the ego vehicle. The IDs of these objects are <c1,CAM_FRONT_RIGHT,1016.6,416.5>, <c2,CAM_BACK,675.8,446.7>, and <c3,CAM_FRONT,694.2,406.7>. |
| InternVL2-2b-LoRA-25k_plus_DL-PL-10pct | There is a white truck to the front of the ego vehicle, a white truck to the front of the ego vehicle, a yellow truck to the front of the ego vehicle, a red truck to the front of the ego vehicle, a black sedan to the front of the ego vehicle, and a black sedan to the front of the ego vehicle. The IDs of these objects are <c1,CAM_FRONT,1010.0,495.8>, <c2,CAM_FRONT,1010.0,495.8>, <c3,CAM_FRONT,1010.0,495.8>, <c4,CAM_FRONT,1010.0,495.8>, <c5,CAM_FRONT,1010.0,495.8>, and <c6,CAM_FRONT,1010.0,495.8>. |
| InternVL2-2b-LoRA-300k-drivelm | There is a white truck to the front of the ego vehicle, a white truck to the front of the ego vehicle, a blue truck to the front of the ego vehicle, a blue truck to the back of the ego vehicle, a blue truck to the front of the ego vehicle, and a white truck to the front of the ego vehicle. The IDs of these objects are <c1,CAM_FRONT,1080.0,450.8>, <c2,CAM_FRONT,1080.0,450.8>, <c3,CAM_FRONT,1080.0,450.8>, <c4,CAM_BACK,1080.0,520.0>, <c5,CAM_FRONT,1080.0,450.8>, and <c6,CAM_FRONT,1080.0,450.8>. |
| llava-v1.6-mistral-7b-hf | In the current scene, the important objects to consider for future reasoning and driving decisions include:
1. The large truck on the right side of the image.
2. The smaller truck in the middle of the image.
3. The large truck on the left side of the image.
4. The building in the background.
5. The fence in the foreground.
6. The road itself.
7. The parked cars on the left side of the image.
8. The traffic lights and signs, if any.
These objects will be relevant for understanding the traffic flow, potential obstacles, and the overall layout of the area. |
| 1 | Questionperception | What is the moving status of object <c2,CAM_BACK,675.8,446.7>? Please select the correct answer from the following options: A. Going ahead. B. Drive backward. C. Reverse parking. D. Turn right. |
| GT Answer | A |
| InternVL2-2B | The moving status of object <c2,CAM_BACK,675.8,446.7> is C. Reverse parking. |
| Mini-InternVL2-2B-DA-DriveLM | A |
| llama_adapter_v2_multimodal7b | The moving status of object <c2,CAM_BACK,675.8,446.7> is "Drive backward." |
| InternVL2-2b-LoRA-25k-drivelm | A |
| InternVL2-2b-LoRA-25k-drivelm-offline-redcircle-ctag-bkgd | A |
| InternVL2-2b-LoRA-25k_plus_DL-PL-10pct | A |
| InternVL2-2b-LoRA-300k-drivelm | A. Going ahead. |
| llava-v1.6-mistral-7b-hf | B. Drive backward |
| 2 | Questionprediction | What object should the ego vehicle notice first when the ego vehicle is getting to the next possible location? What is the state of the object that is first noticed by the ego vehicle and what action should the ego vehicle take? What object should the ego vehicle notice second when the ego vehicle is getting to the next possible location? What is the state of the object perceived by the ego vehicle as second and what action should the ego vehicle take? What object should the ego vehicle notice third? What is the state of the object perceived by the ego vehicle as third and what action should the ego vehicle take? |
| GT Answer | Firstly, notice that <c3,CAM_FRONT,694.2,406.7>. The object is stationary, so the ego vehicle should keep going ahead at the same speed. Secondly, notice that <c2,CAM_BACK,675.8,446.7>. The object is going ahead, so the ego vehicle should keep going ahead at the same speed. Thirdly, notice that <c1,CAM_FRONT_RIGHT,1016.6,416.5>. The object is going ahead, so the ego vehicle should keep going ahead at the same speed. |
| InternVL2-2B | When the ego vehicle is getting to the next possible location, it should notice the truck in the middle of the image. The state of the object perceived by the ego vehicle as the first object is a large truck with a white and red cab. The action the ego vehicle should take is to continue driving straight ahead. The state of the object perceived by the ego vehicle as the second object is a large truck with a blue cab. The action the ego vehicle should take is to continue driving straight ahead. The state of the object perceived by the ego vehicle as the third object is a large truck with a yellow cab. The action the ego vehicle should take is to continue driving straight ahead. |
| Mini-InternVL2-2B-DA-DriveLM | Firstly, notice that the object in front of the ego vehicle is stationary. The ego vehicle should continue ahead at the same speed. Secondly, notice that the object in front of the ego vehicle is stationary. The ego vehicle should continue ahead at the same speed. Thirdly, notice that the object in front of the ego vehicle is stationary. The ego vehicle should continue ahead at the same speed. |
| llama_adapter_v2_multimodal7b | When the ego vehicle is getting to the next possible location, it should first notice the truck parked in front of the building. The ego vehicle should then take the appropriate action, such as slowing down or stopping, to ensure the safety of the truck and the people around it. Next, the ego vehicle should notice the fence and the building, and take appropriate action to avoid any potential obstacles or hazards. Finally, the ego vehicle should notice the traffic light and follow the traffic light signals to ensure safe navigation through the area. |
| InternVL2-2b-LoRA-25k-drivelm | Firstly, notice that <c3,CAM_FRONT,1080.8,525.0>. The object is going ahead, so the ego vehicle should keep going ahead at the same speed. Secondly, notice that <c2,CAM_FRONT,1080.8,500.8>. The object is going ahead, so the ego vehicle should keep going ahead at the same speed. Thirdly, notice that <c1,CAM_FRONT,1080.8,485.8>. The object is going ahead, so the ego vehicle should keep going ahead at the same speed. |
| InternVL2-2b-LoRA-25k-drivelm-offline-redcircle-ctag-bkgd | Firstly, notice that <c3,CAM_FRONT,694.2,406.7>. The object is stationary, so the ego vehicle should continue ahead at the same speed. Secondly, notice that <c1,CAM_FRONT_RIGHT,1016.6,416.5>. The object is stationary, so the ego vehicle should continue ahead at the same speed. Thirdly, notice that <c2,CAM_BACK,675.8,446.7>. The object is moving ahead, so the ego vehicle should continue ahead at the same speed. |
| InternVL2-2b-LoRA-25k_plus_DL-PL-10pct | Firstly, notice <c1, CAM_FRONT, 1200.0, 600.0>. The object is approaching, so the ego should slow down. Secondly, notice <c2, CAM_FRONT, 1200.0, 600.0>. The object is approaching, so the ego should slow down. Thirdly, notice <c3, CAM_FRONT, 1200.0, 600.0>. The object is approaching, so the ego should slow down. |
| InternVL2-2b-LoRA-300k-drivelm | Firstly, notice that <c3,CAM_FRONT,1080.0,450.8>. The object is stationary, so the ego vehicle should continue ahead at the same speed. Secondly, notice that <c1,CAM_FRONT,1080.0,450.8>. The object is stationary, so the ego vehicle should continue ahead at the same speed. Thirdly, notice that <c2,CAM_FRONT,1080.0,450.8>. The object is stationary, so the ego vehicle should continue ahead at the same speed. |
| llava-v1.6-mistral-7b-hf | The image you've provided appears to be a collage of different scenes, likely from a surveillance or traffic camera system. The scenes include a variety of vehicles and infrastructure, such as trucks, cars, and buildings.
When the ego vehicle is getting to the next possible location, it should first notice the traffic light. The state of the traffic light is not visible in the image, but the ego vehicle should be prepared to stop or proceed depending on the color of the light.
The second object the ego vehicle should notice is the pedestrian crossing. The state of the pedestrian crossing is not visible in the image, but the ego vehicle should be prepared to yield to pedestrians if they are present and if the pedestrian crossing is activated.
The third object the ego vehicle should notice is the road sign. The state of the road sign is not visible in the image, but the ego vehicle should be prepared to follow the directions indicated by the sign.
In each case, the ego vehicle should be prepared to react appropriately to the state of the object it is approaching. This could involve stopping, slowing down, changing lanes, or following the direction of the road sign. The exact action will depend on the specific state of the object and the rules of the road. |
| 3 | Questionprediction | Is <c1,CAM_FRONT_RIGHT,1016.6,416.5> a traffic sign or a road barrier? |
| GT Answer | No. |
| InternVL2-2B | To determine if the coordinates <c1,CAM_FRONT_RIGHT,1016.6,416.5> represent a traffic sign or a road barrier, let's break down the coordinates step by step:
1. **Coordinates Analysis**:
- **c1**: This is the starting point of the coordinates.
- **CAM_FRONT_RIGHT**: This is the target location.
- **1016.6**: This is the x-coordinate.
- **416.5**: This is the y-coordinate.
2. **Understanding the Coordinates**:
- The coordinates are given in a format that is typical for GPS coordinates, where the first part (c1) is the latitude, the second part (CAM_FRONT_RIGHT) is the longitude, and the third part (1016.6, 416.5) is the altitude.
3. **Interpreting the Coordinates**:
- **Latitude**: The latitude is 1016.6, which is a positive value, indicating it is north of the origin.
- **Longitude**: The longitude is 416.5, which is a positive value, indicating it is east of the origin.
- **Altitude**: The altitude is 0, which means it is at the same level as the ground.
4. **Conclusion**:
- **Coordinates <c1,CAM_FRONT_RIGHT,1016.6,416.5>** represent a point on the ground, not a traffic sign or a road barrier. It is a point on the ground level, likely representing a location on the ground.
In conclusion, the coordinates <c1,CAM_FRONT_RIGHT,1016.6,416.5> represent a point on the ground, not a traffic sign or a road barrier. |
| Mini-InternVL2-2B-DA-DriveLM | No. |
| llama_adapter_v2_multimodal7b | The image does not provide enough information to definitively determine whether the <c1,CAM_FRONT_RIGHT,1016.6,416.5> is a traffic sign or a road barrier. However, given that it is a part of a traffic sign, it is more likely to be a traffic sign rather than a road barrier. |
| InternVL2-2b-LoRA-25k-drivelm | No. |
| InternVL2-2b-LoRA-25k-drivelm-offline-redcircle-ctag-bkgd | No. |
| InternVL2-2b-LoRA-25k_plus_DL-PL-10pct | No. |
| InternVL2-2b-LoRA-300k-drivelm | No. |
| llava-v1.6-mistral-7b-hf | The image you've provided appears to show a series of photographs, likely taken from a vehicle's rear-view camera, showing different views of a truck and its surroundings. The image labeled as <c1,CAM_FRONT_RIGHT,1016.6,416.5> is the front-right view of the truck. It is not a traffic sign or a road barrier; it is a photograph of the truck's front-right side taken from the perspective of the vehicle's rear-view camera. |
| 4 | Questionplanning | What actions could the ego vehicle take based on <c1,CAM_FRONT_RIGHT,1016.6,416.5>? Why take this action and what's the probability? |
| GT Answer | The action is to keep going at the same speed; the reason is that there is no safety issue. The probability is high. |
| InternVL2-2B | The ego vehicle, indicated by the camera's position and the text "CAM_FRONT_RIGHT," is currently facing the right side of the road. Based on the image, the ego vehicle could take the following actions:
1. **Driving straight ahead**: The ego vehicle could continue straight ahead, following the road's direction.
2. **Turning right**: The ego vehicle might need to make a right turn to navigate around the truck in the middle of the image.
3. **Driving past the truck**: The ego vehicle could pass the truck on its right side, as it is positioned in the middle of the image.
4. **Driving past the warehouse**: The ego vehicle could continue past the warehouse on its right side, as it is not obstructing the path.
The probability of taking this action can be inferred from the image. The truck is positioned in the middle of the image, and the ego vehicle is facing the right side of the road. This suggests that the ego vehicle is likely to turn right to avoid the truck. The probability of this action is relatively high, as the truck is not blocking the path and the ego vehicle is facing the right side of the road. However, the exact probability would depend on the ego vehicle's speed, the maneuverability of the ego vehicle, and the traffic conditions. |
| Mini-InternVL2-2B-DA-DriveLM | The action is to keep going at the same speed. The reason is that there is no safety issue, and the probability is high. |
| llama_adapter_v2_multimodal7b | Based on the given coordinates, the ego vehicle is likely to take the action of turning right. The coordinates indicate that the vehicle is at a position where it can see the truck in the background, and turning right would allow the vehicle to move away from the truck and continue its journey. The probability of the vehicle taking this action is high, as the vehicle is already in a position where it can see the truck and turn right, making it a logical choice. |
| InternVL2-2b-LoRA-25k-drivelm | The action is to keep going at the same speed. The reason is that there is no safety issue, and the probability is high. |
| InternVL2-2b-LoRA-25k-drivelm-offline-redcircle-ctag-bkgd | The action is to keep going at the same speed. The reason is that there is no safety issue, and the probability is high. |
| InternVL2-2b-LoRA-25k_plus_DL-PL-10pct | The action is to keep going at the same speed. The reason is that there is no safety issue, and the probability is high. |
| InternVL2-2b-LoRA-300k-drivelm | The action is to keep going at the same speed. The reason is that there is no safety issue, and the probability is high. |
| llava-v1.6-mistral-7b-hf | The image you've provided appears to be a sequence of images showing a vehicle's perspective as it moves through a series of actions:
1. **CAM_FRONT_LEFT**: The vehicle is facing left, likely indicating that it is turning left.
2. **CAM_FRONT_RIGHT**: The vehicle is facing right, suggesting that it is turning right.
3. **CAM_BACK_LEFT**: The vehicle is facing left from the back, which could mean it is backing up or turning around.
4. **CAM_BACK_RIGHT**: The vehicle is facing right from the back, which could mean it is backing up or turning around.
The actions the ego vehicle could take based on the given camera angles are:
1. **Turn Left**: The vehicle is already in the process of turning left, as indicated by the first image.
2. **Turn Right**: The vehicle is not currently turning right, as indicated by the second image.
3. **Back Up**: The vehicle is not currently backing up, as indicated by the third and fourth images.
4. **Turn Around**: The vehicle is not currently turning around, as indicated by the third and fourth images.
The probability of the vehicle taking each action is as follows:
1. **Turn Left**: The vehicle is already in the process of turning left, so the probability of it continuing to turn left is 100%.
2. **Turn Right**: The vehicle is not currently turning right, so the probability of it turning right is 0%.
3. **Back Up**: The vehicle is not currently backing up, so the probability of it backing up is 0%.
4. **Turn Around**: The vehicle is not currently turning around, so the probability of it turning around is 0%.
The images do not provide enough information to determine the exact probability of the vehicle performing any of these actions in the future. The probabilities would depend on the vehicle's intended route, the driver's intentions, and the current traffic and road conditions. |
| 5 | Questionplanning | What actions taken by the ego vehicle can lead to a collision with <c1,CAM_FRONT_RIGHT,1016.6,416.5>? |
| GT Answer | Slight right turn. |
| InternVL2-2B | The ego vehicle, which is the car in the image, has several actions that could potentially lead to a collision with the truck on the right side of the road. Here are some of the key actions:
1. **Driving too close to the truck**: The truck is parked on the right side of the road, and the ego vehicle is driving towards it. If the ego vehicle were to drive too close to the truck, it could potentially collide with the truck's rear end or side.
2. **Driving in the wrong lane**: The ego vehicle is driving in the right lane, while the truck is in the left lane. If the ego vehicle were to change lanes to the left, it could potentially collide with the truck if it were to cross over into the left lane.
3. **Speeding**: The ego vehicle is driving at a moderate speed. If the ego vehicle were to exceed the speed limit, it could potentially collide with the truck if it were to overtake or pass it.
4. **Driving recklessly**: The ego vehicle is driving at a steady speed, but if it were to make sudden turns or swerve, it could potentially collide with the truck.
5. **Driving in the wrong direction**: The ego vehicle is driving straight ahead. If it were to turn left or change direction, it could potentially collide with the truck if it were to cross over into the left lane.
6. **Driving too fast for the road conditions**: The road appears to be a two-lane highway with no visible traffic. If the ego vehicle were to drive too fast for the road conditions, it could potentially collide with the truck if it were to overtake or pass it.
In summary, the ego vehicle's actions that could lead to a collision with the truck include driving too close to the truck, driving in the wrong lane, speeding, driving recklessly, driving in the wrong direction, or driving too fast for the road conditions. |
| Mini-InternVL2-2B-DA-DriveLM | Slight right turn. |
| llama_adapter_v2_multimodal7b | The ego vehicle is driving down the street, and its actions, such as changing lanes, making sudden turns, or not paying attention to the surroundings, can lead to a collision with the <c1,CAM_FRONT_RIGHT,1016.6,416.5> object. The ego vehicle is also driving close to the camera, which might make it difficult for the driver to see the object in time to avoid a collision. Additionally, the ego vehicle is driving near a truck, which could obstruct the driver's view of the object or make it more challenging to react in time to avoid a collision. |
| InternVL2-2b-LoRA-25k-drivelm | Sharp right turn. |
| InternVL2-2b-LoRA-25k-drivelm-offline-redcircle-ctag-bkgd | Slight right turn. |
| InternVL2-2b-LoRA-25k_plus_DL-PL-10pct | Moderate right turn. |
| InternVL2-2b-LoRA-300k-drivelm | Sharp right turn. |
| llava-v1.6-mistral-7b-hf | The image you've provided shows a sequence of actions that can lead to a collision with the vehicle labeled as "CAM_FRONT_RIGHT" in the image. Here's a breakdown of the actions:
1. **CAM_BACK_LEFT**: The ego vehicle is moving backward, which means it is facing away from the camera.
2. **CAM_BACK_LEFT**: The vehicle continues to move backward, and it appears to be turning to the left.
3. **CAM_BACK_LEFT**: The vehicle is still moving backward and is now making a sharp turn to the left.
4. **CAM_FRONT_RIGHT**: The vehicle has now turned 180 degrees and is facing the camera.
5. **CAM_FRONT_RIGHT**: The vehicle is still facing the camera, but it seems to be moving forward.
6. **CAM_FRONT_RIGHT**: The vehicle is still moving forward, and it appears to be turning to the right.
7. **CAM_BACK_RIGHT**: The vehicle has now turned 180 degrees again and is facing away from the camera.
8. **CAM_BACK_RIGHT**: The vehicle is still moving forward and is now making a sharp turn to the right.
9. **CAM_FRONT_RIGHT**: The vehicle has turned 180 degrees once more and is facing the camera again.
10. **CAM_FRONT_RIGHT**: The vehicle is still facing the camera, but it seems to be moving forward.
11. **CAM_FRONT_RIGHT**: The vehicle is still moving forward, and it appears to be turning to the right.
12. **CAM_BACK_RIGHT**: The vehicle has turned 180 degrees again and is facing away from the camera.
13. **CAM_FRONT_RIGHT**: The vehicle is still moving forward and is now making a sharp turn to the right.
14. **CAM_FRONT_RIGHT**: The vehicle has turned 180 degrees once more and is facing the camera again.
15. **CAM_FRONT_RIGHT**: The vehicle is still facing the camera, but it seems to be moving forward.
16. **CAM_FRONT_RIGHT**: The vehicle is still moving forward, and it appears to be turning to the right.
1 |
| 6 | Questionplanning | In this scenario, what are safe actions to take for the ego vehicle? |
| GT Answer | Keep going at the same speed, and brake gently to a stop. |
| InternVL2-2B | Based on the image, the safe actions for the ego vehicle would include maintaining a safe following distance from the trailing vehicle, ensuring that the trailing vehicle is not overtaking the ego vehicle, and being aware of the presence of the other vehicle. The ego vehicle should also be prepared to react to any sudden changes in the trailing vehicle's speed or direction, as well as any potential hazards that may arise from the proximity of the trailing vehicle. Additionally, the ego vehicle should be vigilant for any other vehicles or obstacles that may be present on the road. |
| Mini-InternVL2-2B-DA-DriveLM | Keep going at the same speed, decelerate gradually without braking. |
| llama_adapter_v2_multimodal7b | In the image, a truck is driving down a street with a fence and a building in the background. To ensure the safety of the ego vehicle, the driver should follow these actions:1. Maintain a safe distance from the fence and building to avoid any potential collisions or damage to the vehicle.2. Be cautious of the truck's blind spots, as it may be difficult for the truck driver to see the ego vehicle in certain areas.3. Keep an eye on the truck's movements and brake lights to anticipate any sudden stops or changes in direction.4. Be prepared for the truck to make a turn or change lanes, as it may require the ego vehicle to adjust its position accordingly.5. Stay vigilant and be prepared to react to any unexpected situations that may arise.By following these safe actions, the ego vehicle driver can minimize the risk of accidents and ensure a smooth and safe driving experience. |
| InternVL2-2b-LoRA-25k-drivelm | Keep going at the same speed, accelerate, and proceed. |
| InternVL2-2b-LoRA-25k-drivelm-offline-redcircle-ctag-bkgd | Keep going at the same speed, decelerate gradually without braking. |
| InternVL2-2b-LoRA-25k_plus_DL-PL-10pct | Keep going at the same speed, decelerate gradually without braking. |
| InternVL2-2b-LoRA-300k-drivelm | Keep going at the same speed, decelerate gradually without braking. |
| llava-v1.6-mistral-7b-hf | In the image provided, it appears to be a series of photos showing a vehicle's perspective while navigating through a construction zone with various types of equipment and vehicles. Here are some safe actions that the ego vehicle should take in such a scenario:
1. **Slow Down**: The vehicle should reduce its speed to match the posted speed limits or the speed of the construction zone, which is often lower than regular roads.
2. **Stay Alert**: The driver should be extra vigilant, paying attention to the road, signs, and any workers or equipment in the area.
3. **Follow Traffic Signs**: The driver should obey all traffic signs and signals, including any temporary signs that may be in place due to the construction.
4. **Maintain a Safe Following Distance**: Keeping a safe distance from the vehicle in front can help prevent collisions in case of sudden stops or changes in traffic flow.
5. **Be Prepared for Unexpected Changes**: Construction zones can be unpredictable, with workers moving around, equipment being moved, or temporary changes in road layout. The driver should be prepared for sudden changes in traffic patterns.
6. **Use Turn Signals**: If the vehicle needs to make a turn, it should use its turn signals well in advance to alert other drivers and workers in the area.
7. **Avoid Distractions**: The driver should avoid using mobile devices or engaging in any activity that could distract them from the road.
8. **Be Patient**: Construction zones can cause delays, so the driver should be patient and allow for extra travel time.
9. **Use Caution When Merging**: If the vehicle is merging into a lane that is being used by construction vehicles, it should merge slowly and carefully, ensuring it is visible to the construction vehicle operators.
10. **Stay in the Lane**: The driver should stay in the lane they are in and avoid making sudden lane changes that could put them in the path of construction equipment or vehicles.
By following these safety guidelines, the driver can help ensure a safe passage through the construction zone. |
| 7 | Questionbehavior | Predict the behavior of the ego vehicle. Please select the correct answer from the following options: A. The ego vehicle is slightly steering to the left. The ego vehicle is driving very fast. B. The ego vehicle is slightly steering to the left. The ego vehicle is driving fast. C. The ego vehicle is steering to the right. The ego vehicle is driving with normal speed. D. The ego vehicle is going straight. The ego vehicle is driving with normal speed. |
| GT Answer | B |
| InternVL2-2B | Based on the image, the ego vehicle is driving straight. The image shows the rear of the vehicle with no visible signs of steering to the left or right, nor any indication of the vehicle being driven at high speeds. The vehicle is also not shown to be driving very fast, as there is no visible motion blur or excessive speed indicators. Therefore, the correct answer is D. The ego vehicle is going straight. The ego vehicle is driving with normal speed. |
| Mini-InternVL2-2B-DA-DriveLM | C |
| llama_adapter_v2_multimodal7b | The ego vehicle is driving with normal speed, and it is going straight. |
| InternVL2-2b-LoRA-25k-drivelm | D |
| InternVL2-2b-LoRA-25k-drivelm-offline-redcircle-ctag-bkgd | D |
| InternVL2-2b-LoRA-25k_plus_DL-PL-10pct | D |
| InternVL2-2b-LoRA-300k-drivelm | D. The ego vehicle is going straight. The ego vehicle is driving with normal speed. |
| llava-v1.6-mistral-7b-hf | D. The ego vehicle is going straight. The ego vehicle is driving with normal speed. |