Hello,
I’ve integrated the ChatGPT Plus API into my website, leveraging it to generate image prompts. This allows users to create images based on the complexity of their language, like adding “curves” to influence the output. To enhance prompt quality, I’ve developed an assistant and utilized threading.
This setup enables me to submit less refined prompts to the assistant, which then optimizes them for better clarity or mandates photorealism in the generated images. However, the cost of using the API is substantial, reflecting its position in ongoing beta and research phases. An interesting aspect of Dalle-3, given its experimental nature, is its occasional free reworking of prompts to include a broader representation of ethnicities and identities, should the prompts lack specificity.
Despite Dalle-3’s ability to produce visually stunning outputs, it often diverges from my exact requirements, leading me to explore alternatives like Midjourney for more predictable results.
Regarding the assistant, initial attempts at sticking to a single thread for multiple prompt revisions resulted in recycled prompts. I’ve since shifted to generating a new thread for each request. As of the latest update, there’s also support for a single-threaded assistant via GPT-4, though I’m still evaluating its effectiveness.
Using ChatGPT Plus and GPT-4 simplifies interactions with Dalle-3, albeit with some limitations. Direct API use offers more control over parameters like image size and quality, which are constrained within the ChatBot interface. Notably, accessing Dalle-3 through alternative means provides leniency in generating images with public figures or trademarks, a flexibility not found when using GPT-4 due to OpenAI’s content policies.
These policies, while designed to avoid copyright issues, can feel overly restrictive, hindering the full exploration of the API’s capabilities. For instance, requests for artwork resembling specific pieces by known artists are outright refused, pushing users towards more generic creative directions.
In conclusion, while the technology offers incredible potential for creative expression, the combination of high costs and restrictive content policies presents significant challenges. The journey with OpenAI’s tools is a complex blend of innovation and limitation, requiring a balance between creative ambition and the practicalities of policy compliance.