A collection of articles on the research and applications of Generative AI
Ensemble GenAI
GenAI for the Enterprises
Navigating the Challenges of Large-scale Chatbot Deployment: When considering the integration of GenAI chatbots within an enterprise’s operations, it’s crucial to recognize the potential challenges. Key insights include:
Published: April 4, 2024
Brainstorming with Chatbots on building a Universal Information Worker Chatbot: How to use an ensemble of debating chatbot to design and implement a Universal Information Worker chatbot for enterprises.
Published: March 14, 2024
The Manchurian Chatbot Problem: Are Chatbot Viruses Coming Our Way?: Large Language Model (LLM) chatbots are becoming more advanced, but this progress could also lead to the “Manchurian Candidate” problem: the risk of adversaries implanting malicious data into LLMs’ long-term memories, which could be triggered to cause harm.
Published: March 12, 2024
Building Enterprise GenAI Chatbots: It is important to recognize that enterprise GenAI chatbots are essentially the latest iteration of traditional enterprise software applications. As such, many of the same challenges faced during their development cycles should not be overlooked or underestimated.
Published: Feb 27, 2024
GenAI Vision Applications
Navigating the Pitfalls of Vision Language Models:
this is a story about how a winning Claude-3 lose an image classification debate to GPT-4V, due to its weak personality. It is also about a cougar on house deck, and how AI ecould mistaken it as just a dog.
Published: March 18, 2024
Is Anthropic’s Claude-3 Ready for AGI?: Does Anthropic’s multimodal Claude-3 have enough visual common sense to support AGI (Artificial General Intelligence)? It is actually kind of close.
Published: March 4, 2024
Is OpenAI’s GPT-4V Ready for AGI?: Does OpenAI’s vision model GPT-4V have enough visual common sense to support AGI (Artificial General Intelligence)? It is actually kind of close.
We ran 17 types of visual common sense tests against GPT-4V to find out how well it can deal with the real world, and here are the results.
Published: Feb 29, 2024
Hidden Problems in GenAI
Creative Imaginator: upload an image, select one of many predefined style, and the app automatically generate a new image that is different from the original in creative ways, but still follow the main theme in the original image.
This is basically an image-to-text-to-image tool with support for style injection, which is easier to manage than text-to-image system (which requires substantial prompt engieering skill), or the image-to-image approach (which is best for making small changes).
This tool is ideal for anyone who just need to have some illustrations. A gallery is available here for viewing.
Automatic Backseat Driver: upload a road scene, and have the scene analyzed in detail along with the recommended high-level action of whether to proceed, stop, or turn around.
This is a tool to test how well OpenAI’s vision model GPT-4V can be used as a high-level component for driving a Level 5 Autonomous Vehicle.
See this gallery for a long list of road scenes tested with this tool, which demonstrated that GPT-4V has performed surprising well even in highly challenging situations.
Radiologist V2: