Finally, OpenAI's Chat GPT has enabled the "Chat with images" capability for my account. Let's quickly test it out.
Very understanding of flight instruments.
Tested a picture of a flight instrument, Chat
GPT 4-v's attention to detail is truly amazing.
Clearly, Chat GPT 4-v has undergone systematic
training on flight instruments and is familiar
with the instrument's content. The following
picture is an ordinary and unremarkable image,
but the Garmin 1000's PFD in the picture
displays a lot of information.
ChatGPT: This is a view from inside the cockpit of an airplane, looking forward at the clouds and the airplane's instrument panel. Question: What is the registration number of this airplane? Answer: From the picture, you can see that the registration number of the airplane is "N252CB." Correct. Question: What is the model of this airplane? Answer: From the picture, you can see a logo on the airplane's instrument panel, but the details of the logo are not clear enough to determine the exact model of the airplane directly. If you can provide more information about the airplane or other details, I may be able to help more. [It is indeed difficult. There is no obvious model on it. The icon shows 22T, which is a Cirrus SR-22T.] Question: What is the navigation system of this airplane? Answer: From the image you provided, you can see that the airplane's instrument panel has an advanced avionics screen with various navigation and flight information functions. This type of screen is commonly found in Garmin's G1000 navigation system, but it could also be another similar advanced integrated flight display system. So, the most likely navigation system for this airplane is the Garmin G1000, but to confirm the exact model and brand, more detailed information or images would be needed. Correct. Question: What is the current altitude of this airplane? Answer: The current altitude of this airplane is 12,000 feet. Correct. Question: Is the autopilot enabled on this airplane? Answer: From the picture, you can see that the "AP" indicator is illuminated, which usually indicates that the airplane's autopilot is enabled. So, yes, the autopilot is enabled on this airplane. Correct. Question: What is the current engine thrust percentage of the airplane? Answer: Based on the provided picture, the engine thrust percentage of the airplane is displayed as "77%." So, the current engine thrust percentage of this airplane is 77%. Correct. Question: Is this airplane currently flying using VFR or IFR? Answer: From the provided picture, we can see the information displayed on the airplane's instrument panel. However, the instrument panel does not clearly indicate whether the airplane is flying using VFR (Visual Flight Rules) or IFR (Instrument Flight Rules). However, generally, if the airplane is in clouds or does not have sufficient visibility for visual flight, it may be flying using IFR. Conversely, if the airplane is in good weather conditions and has sufficient visibility, it may be flying using VFR. In this picture, the airplane appears to be above the clouds, which could indicate that it is flying using IFR. However, this is just a speculation and cannot be confirmed. The exact flight mode should be confirmed by the pilot or the airline. Correct. Question: What is the current heading of the airplane? Answer: From the picture, the heading displayed on the airplane's Primary Flight Display (PFD) is approximately 255 degrees. This indicates that the airplane's current heading is approximately southwest. Correct. Question: Where is this airplane currently flying to? Answer: From the provided picture, the airplane's navigation screen shows the heading and path of the airplane. However, the specific destination or flight location is not clearly indicated. So, I cannot accurately tell you where this airplane is currently flying to. You may need more contextual information or communicate with the crew on board the airplane for more accurate information. [The PFD shows "SUBDY" as the destination, but it cannot be identified.] Question: Is this airplane currently ascending or descending? Answer: From the provided picture, you can see the Vertical Speed Indicator (VSI) of the airplane, which displays the airplane's rate of ascent or descent. In this picture, the indicator shows a value of approximately "500," and the pointer is pointing upwards, indicating that the airplane is currently ascending at a rate of 500 feet per minute. [Incorrect, the VSI is currently at 0, indicating level flight. The airplane's level flight can be seen from multiple sources.] Question: What is the current communication frequency? Answer: From the provided picture, you can see that the airplane's communication frequency (COM) is displayed as "127.10." So, the current communication frequency is 127.10 MHz. [Incorrect, COM1 is 126.1. It seems that there was a misinterpretation of the numbers, but it knows where to look.] Question: According to FAA regulations, is the use of oxygen masks required at the current altitude of this airplane? Answer: Based on the displayed image, the altitude of the airplane is 12,000 feet. According to FAA regulations: - During flights at altitudes above 12,500 feet up to 14,000 feet, if the flight duration exceeds 30 minutes, the pilot must use oxygen. - During flights at altitudes above 14,000 feet, the pilot must continuously use oxygen. - During flights at altitudes above 15,000 feet, all occupants must have access to oxygen. Therefore, at the current altitude of 12,000 feet, the pilot does not need to use oxygen if the flight duration does not exceed 30 minutes. However, if the flight duration exceeds 30 minutes, the pilot would need to use oxygen. Correct.
Don't understand Chinese books very well.
A while ago, I happened to come across some very old Chinese books at the library and took a few photos. The result from Chat GPT-4 was extremely poor.
Question: What does it say on here? Answer: The page of this book says: Title: "Listening to the Raindrops of the Soul" The text inside the box says: On a sunny weekend, with a clear blue sky and birds singing. There are few pedestrians on the road, walking along the way, with a gentle breeze and light clouds, as if feeling the breath of spring. The melodious bird songs can be heard in the ears, natural and harmonious, as if everything is so harmonious and natural. This passage describes a peaceful weekend day, with clear weather and birds singing. People in such an environment feel the breath of spring and the harmony of nature. [Complete nonsense]
From the pictures you provided, this book seems to be discussing certain historical events or backgrounds. The specific content is as follows: Title: "Lonely Emotions and Songs Like Russia" The content roughly describes: It describes certain situations or events and mentions a kind of "lonely emotion" and "songs." It mentions certain years in history, such as 1947 and 1950, which may be related to the described events or situations. It mentions a certain music style or content related to music, which may be related to songs. From this information, this book may be exploring the music style and emotional expression of a country or region (such as Russia) in a specific historical background, especially how the emotion of "loneliness" is expressed in songs. This may also involve the social and cultural background of the region during a specific period. [Complete nonsense]
I am quite surprised that it cannot correctly understand this point about books, which suggests that Chat GPT-4V is likely not primarily recognizing text and then understanding it based on the text. In the past, many plugins may have first recognized the text in images and then interpreted it using GPT. Perhaps Chat GPT-4V has adopted a completely different approach?