<?xml version="1.0" encoding="UTF-8"?><rss xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom" version="2.0" xmlns:itunes="http://www.itunes.com/dtds/podcast-1.0.dtd" xmlns:googleplay="http://www.google.com/schemas/play-podcasts/1.0"><channel><title><![CDATA[Andrew Roberts’ Substack]]></title><description><![CDATA[Enterpreneurship and AI]]></description><link>https://www.andrewroberts.blog</link><image><url>https://substackcdn.com/image/fetch/$s_!79Db!,w_256,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fd9f6536d-0d47-4d0f-b543-ca29473e1ee8_1024x1024.png</url><title>Andrew Roberts’ Substack</title><link>https://www.andrewroberts.blog</link></image><generator>Substack</generator><lastBuildDate>Wed, 13 May 2026 10:23:45 GMT</lastBuildDate><atom:link href="https://www.andrewroberts.blog/feed" rel="self" type="application/rss+xml"/><copyright><![CDATA[Andrew Roberts]]></copyright><language><![CDATA[en]]></language><webMaster><![CDATA[androb@substack.com]]></webMaster><itunes:owner><itunes:email><![CDATA[androb@substack.com]]></itunes:email><itunes:name><![CDATA[Andrew Roberts]]></itunes:name></itunes:owner><itunes:author><![CDATA[Andrew Roberts]]></itunes:author><googleplay:owner><![CDATA[androb@substack.com]]></googleplay:owner><googleplay:email><![CDATA[androb@substack.com]]></googleplay:email><googleplay:author><![CDATA[Andrew Roberts]]></googleplay:author><itunes:block><![CDATA[Yes]]></itunes:block><item><title><![CDATA[Interview Playbook Generator]]></title><description><![CDATA[Have you ever made a hiring mistake? Most of us have. Here is an AI to help you level up your game.]]></description><link>https://www.andrewroberts.blog/p/interview-playbook-generator</link><guid isPermaLink="false">https://www.andrewroberts.blog/p/interview-playbook-generator</guid><dc:creator><![CDATA[Andrew Roberts]]></dc:creator><pubDate>Fri, 12 Jan 2024 23:10:49 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/959dbf5b-430e-4bae-98c2-12bcce06786b_1456x816.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>A key to getting better at hiring decisions is getting better at interviews. </p><p>And getting better at interviews means adding structure. Time and time again, research has shown that structured interviews better predict job performance and reduce bias.</p><p>Yet almost no one does them. Why?</p><p>One barrier is the effort required for job analysis and interview question selection. Big companies are good at this and can afford to be because they recruit many people for similar roles. Smaller company interviewers are often frantically googling "interview questions for X" 10 minutes before an interview.</p><p>To level the playing field, I'm open-sourcing an AI to generate a complete 18-page interview playbook based on a job description and a company's values. This is a sample guide generated from a job description for an infrastructure software engineer on the Y Combinator website.</p><p>How can you use this?</p><ul><li><p><strong>ChatGPT</strong>. Available on the GPT Store here: <a href="https://lnkd.in/gYKaAWaf">https://lnkd.in/gYKaAWaf</a></p></li><li><p><strong>Python Notebook</strong>. Open source and available on GitHub here: <a href="https://lnkd.in/gRrUPgbi">https://lnkd.in/gRrUPgbi</a></p></li></ul><p>It is incredible to see the quality of the output. &lt;$1 of API calls to produce something that would have taken hours. And, in my tests, it does a better job than most of us could do even if we had the time.</p><p>I would have loved this as a founder; I hope you find it useful. In return, I'd ask you for feedback - how could it be improved? Is this something you could use? I'm happy to burn some API calls and generate examples for your current job openings. Hit me up!</p><p>Also, kudos to <a href="https://www.linkedin.com/in/mhideo/">Michael Hideo Smith</a>, who introduced a form of structured interviews called Targeted Selection at <a href="https://www.linkedin.com/company/jointiny/">TinyMCE</a> and lifted our hiring game.</p>]]></content:encoded></item><item><title><![CDATA[The Life Journey Question]]></title><description><![CDATA[Get better at culture fit interviews with just one question]]></description><link>https://www.andrewroberts.blog/p/the-life-journey-question</link><guid isPermaLink="false">https://www.andrewroberts.blog/p/the-life-journey-question</guid><dc:creator><![CDATA[Andrew Roberts]]></dc:creator><pubDate>Mon, 11 Dec 2023 18:40:33 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/4913bfc8-38a6-45bc-b2ad-6a5f37bd323a_2912x1632.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>As the CEO of <a href="https://www.tiny.cloud">TinyMCE</a>, I often wasn&#8217;t the direct hiring manager but the last &#8220;culture fit&#8221; interviewer, together with our CFO and head of HR, <a href="https://www.linkedin.com/in/amy-chen-cfo?miniProfileUrn=urn%3Ali%3Afs_miniProfile%3AACoAAAAQd4oBdq2S7LdvATkBIcl07i6tachPGu0">Amy Chen</a>. She has an intuition for people that is second to none, but I&#8217;ve had to develop mine over the years.</p><p>To elicit insights in these sorts of interviews, my cornerstone question is:</p><blockquote><p><strong>Take us back to high school and your interests then. What subjects did you take? What did you do after graduating? Take us on your journey from that point to sitting here today.</strong></p></blockquote><p>I call this the<strong> Life Journey Question</strong>.</p><p>The Life Journey Question allows the candidate to make the connection between their past and what led them to this role, highlighting their continuity in motivations and personality. People are generally great at telling their story and love the opportunity to share.</p><p>I&#8217;ve also found it a good ice breaker that helps calm candidates' nerves; everyone knows how to tell their story. It is a fantastic way to get to know candidates, understand their motivations and get a sense of them as a whole human.&nbsp;It is a way better way to learn about their experience than going back through their job history. Our lives move forward, not backward.</p><p>I hate repeating questions I know others on the hiring team have asked them, perhaps multiple times. I&#8217;m also often not the best person to evaluate specialized technical skills. By asking the Life Journey Question and paying keen attention to their answers, I hope to surprise them a little. I want to leave them the feeling that they would be valued and viewed in three dimensions if they were to join our company.</p><p>Clear communication skills are also essential for any role. A candidate who struggles to convey their personal story logically raises doubts about effectively sharing job-related ideas. Threading one's life journey into a coherent narrative demonstrates clear thinking. At a minimum, candidates should communicate a personal story that flows.</p><p>Finally, I&#8217;ve found this question makes it clear whether they are motivated by the opportunity. You can feel the misalignment of the job with their life journey. It just isn&#8217;t the next chapter in their story.</p><p>What are your favorite &#8220;fit&#8221; interview questions?</p>]]></content:encoded></item><item><title><![CDATA[Takeaways from the AI Engineer Summit]]></title><description><![CDATA[Last week I attended the inaugural AI Engineer Summit, a highly curated event by Shawn Wang (aka swyx) and Benjamin Dunphy.]]></description><link>https://www.andrewroberts.blog/p/takeaways-from-the-ai-engineer-summit</link><guid isPermaLink="false">https://www.andrewroberts.blog/p/takeaways-from-the-ai-engineer-summit</guid><dc:creator><![CDATA[Andrew Roberts]]></dc:creator><pubDate>Mon, 16 Oct 2023 23:01:40 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/b10b56ed-7ed4-4c71-9659-90c8e9e40d30_1456x816.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Last week I attended the inaugural<a href="https://www.linkedin.com/company/ai-engineer-summit/"> AI Engineer Summit</a>, a highly curated event by Shawn Wang (aka<a href="https://www.swyx.io/"> swyx</a>) and Benjamin Dunphy. The 30+ short talks and 500 attendees discussed the capabilities, challenges and future of building real-world applications with large language models (LLMs.) Here's what I took away.</p><h2><strong>The Transformative Power of LLM</strong></h2><p>LLM-based apps have already demonstrated immense potential.<a href="https://www.linkedin.com/in/mariorodriguez3?miniProfileUrn=urn%3Ali%3Afs_miniProfile%3AACoAAABhGskB91oU2LArcGdxU0qyLvDAOoMHmMw"> Mario Rodriguez</a>, VP of Product at GitHub, revealed in his talk that Copilot now has over 1 million users and generates over $100 million in annual recurring revenue. And the release of ChatGPT in November 2022 brought conversational interfaces into the mainstream.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.andrewroberts.blog/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Andrew Roberts&#8217; Substack! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div><p>However, there are inherent tradeoffs with these models to consider. Massive models like GPT-4 unlock advanced capabilities but are expensive and slow. Smaller domain-specific models are faster and cheaper to run but less flexible. Cloud-based models offer cutting-edge scale, while local models provide more control and privacy. There is no one-size-fits-all solution yet - the best approach depends on the context. Advanced applications will work with many different models.</p><p>Exciting LLM applications span code generation, task automation, and various forms of creative expression, like text, images, and music. Multimodal models that understand connections between vision, audio, and semantics open even more possibilities.</p><h2><strong>Real-World Deployment Challenges</strong></h2><p>While the demos at the conference were impressive, presenters repeatedly emphasized that building real-world systems using LLMs involves significant challenges like:</p><ul><li><p>Rigorously testing unpredictable model behaviors</p></li><li><p>Implementing guardrails and verification for reliability</p></li><li><p>Overcoming issues in data and prompt engineering</p></li><li><p>Improving retrieval algorithms and metadata usage for relevance</p></li><li><p>Establishing monitoring and benchmarks to refine components</p></li></ul><p>Techniques like running evaluations (or "evals") using synthetic data, adding constraints, and leveraging human feedback loops are important to make these applications robust and trustworthy enough.</p><h2><strong>The Vibrant AI Community</strong></h2><p>Events like this play an incredibly valuable role in strengthening the nascent AI engineering community. I was struck by the brainpower in the room and how freely everyone shared their best ideas.</p><p>User feedback is also critical, as evidenced by the success of open ecosystems like<a href="https://www.linkedin.com/company/autogpt/"> AutoGPT</a> and<a href="https://www.linkedin.com/company/langchain/"> LangChain</a> which have attracted huge community engagement. Feedback within LLM-based apps identifies edge cases and opportunities for improving prompts and fine-tuning models.</p><p>There is also a commitment across the field to pursue AI development ethically. This encompasses assessing risks, establishing governance principles, and educating practitioners on pitfalls like bias.</p><h2><strong>The Future</strong></h2><p>Some promising trends I saw at the conference included:</p><ul><li><p>Emerging multimodal applications combining language, vision, and audio. GPT-4V is seriously impressive; you should play with it on ChatGPT today.</p></li><li><p>Innovative contextual reasoning and user-adaptive interfaces</p></li><li><p>Agent training via reinforcement learning in simulated environments</p></li><li><p>Synthetic dataset creation by distilling knowledge from large models</p></li></ul><p>Going forward, key goals for AI-enhanced engineering include moving to more abstract, outcomes-based programming paradigms. Interpretability of model behaviors will grow in importance. There will also be a focus on mainstreaming access to AI and assessing potential risks.</p><h2><strong>Next Steps</strong></h2><p>The AI Engineer Summit will be back in San Francisco next year in a larger venue, and I highly recommend it. In the meantime, you can watch the replays of<a href="https://www.youtube.com/watch?v=veShHxQYPzo"> Day 1</a> and<a href="https://www.youtube.com/watch?v=qw4PrtyvJI0"> Day 2</a>.</p><div class="subscription-widget-wrap-editor" data-attrs="{&quot;url&quot;:&quot;https://www.andrewroberts.blog/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe&quot;,&quot;language&quot;:&quot;en&quot;}" data-component-name="SubscribeWidgetToDOM"><div class="subscription-widget show-subscribe"><div class="preamble"><p class="cta-caption">Thanks for reading Andrew Roberts&#8217; Substack! Subscribe for free to receive new posts and support my work.</p></div><form class="subscription-widget-subscribe"><input type="email" class="email-input" name="email" placeholder="Type your email&#8230;" tabindex="-1"><input type="submit" class="button primary" value="Subscribe"><div class="fake-input-wrapper"><div class="fake-input"></div><div class="fake-button"></div></div></form></div></div>]]></content:encoded></item><item><title><![CDATA[AI's take off in 2023. Why now?]]></title><description><![CDATA["Why now?" is a classic investor question for founders. I attempted to answer it for this current wave of AI, and it led me to some great stories on the origins of GPT-3.]]></description><link>https://www.andrewroberts.blog/p/ais-take-off-in-2023-why-now</link><guid isPermaLink="false">https://www.andrewroberts.blog/p/ais-take-off-in-2023-why-now</guid><dc:creator><![CDATA[Andrew Roberts]]></dc:creator><pubDate>Tue, 03 Oct 2023 22:37:44 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/ca353719-62f1-4140-9ac0-9f4e033fcbd3_1920x1080.jpeg" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p></p><p class="button-wrapper" data-attrs="{&quot;url&quot;:&quot;https://www.andrewroberts.blog/subscribe?&quot;,&quot;text&quot;:&quot;Subscribe now&quot;,&quot;action&quot;:null,&quot;class&quot;:null}" data-component-name="ButtonCreateButton"><a class="button primary" href="https://www.andrewroberts.blog/subscribe?"><span>Subscribe now</span></a></p>]]></content:encoded></item><item><title><![CDATA[Why PLG Isn't Enough]]></title><description><![CDATA[The Case for Product-Led Sales]]></description><link>https://www.andrewroberts.blog/p/why-plg-isnt-enough</link><guid isPermaLink="false">https://www.andrewroberts.blog/p/why-plg-isnt-enough</guid><dc:creator><![CDATA[Andrew Roberts]]></dc:creator><pubDate>Mon, 28 Aug 2023 21:05:00 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/9dcf32fa-67ab-49ad-8608-a96bacfed4e2_752x421.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>This newsletter is about &#8220;AI and Entrepreneurship,&#8221; so, for a change, I&#8217;m pivoting from large language models to SaaS business models and, ironically, the need for a human touch.</p><h3>Moving Beyond PLG to PLS</h3><p>The concept of product-led growth (PLG) took the SaaS world by storm a few years ago. The idea is irresistible - build a product so great that it sells itself through word-of-mouth, virality, and organic adoption. A shiny new object that renders traditional sales teams obsolete.</p><p>It's a compelling vision but also an incomplete one. While PLG has proven a powerful model for efficient acquisition, conversion, and launch, few companies have scaled into sustainable, high-growth businesses.</p><p>Congratulations if you&#8217;ve gotten your SaaS off the ground on PLG alone; it is not easy by any means. But to tap into the majority of revenue available in your market, you will likely need to layer in a complementary product-led sales (PLS) motion. One that combines the organic, bottom-up power of PLG with the benefits of sales assistance.</p><p>Done right, PLS reduces friction, boosts conversions, expands deal sizes, and enables efficient scaling into enterprise accounts. It brings together the best of both worlds - product-led virality and higher-touch sales relationships.</p><p>In this post, I'll explore the benefits of PLS, common myths, identifying PLS opportunities, implementation advice, and why this hybrid approach unlocks the most growth.</p><h3>The Benefits of Adopting Product-Led Sales</h3><p>What makes the PLS model so effective for growth? Throughout my career, I've consistently found that 70% of revenue comes from the top 10% of our customers. These large enterprise accounts typically require higher-touch sales assistance when making purchasing decisions. A "no sales" mantra misses out on tapping into this vast revenue pool. PLS provides customized guidance and account management that unlocks enterprise spend you would not otherwise get.</p><p>By having sales reps build closer customer relationships, PLS also improves retention and maximizes the expansion revenue driven by product usage and cross-selling. Our sales-managed accounts regularly achieved over 100% net dollar retention, while self-service customers churned as much as 5% a month. This increased lifetime value is challenging to match with PLG alone.</p><p>Large enterprise accounts expect white-glove service from sales reps when they make purchasing decisions. PLS provides enterprises with the customized assistance they want and stops them from going elsewhere.</p><h3>Common Myths About Product-Led Sales</h3><p>Given the novelty of PLS, there are some common myths worth addressing.&nbsp;</p><p>Many assume sales reps need to be located near customers. With tools like Zoom, I&#8217;ve seen that location matters less than cultural alignment. Sales can happen efficiently from anywhere. Indeed, our sister company, CKSource, has built a great business selling worldwide with its sales team entirely based in Poland.</p><p>You might even think PLG and sales are incompatible. On the contrary, they&#8217;re complementary. PLG seeds opportunities for sales to expand upon. Sales then fuel further virality and referrals.</p><p>Some believe integrating sales will erode company culture. But with careful hiring focused on cultural fit, sales and PLG teams can mutually thrive. I found that a stellar sales team made us even more customer-centric.</p><p>The biggest myth is that PLG precludes ever needing sales. However, even PLG pioneers like Atlassian, Slack, Zoom and Dropbox now have sales teams. They realized sales unlock additional growth at scale.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!a5f_!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ea1d5a6-99e9-4fdd-be1d-2f41b2cf1873_789x256.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!a5f_!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ea1d5a6-99e9-4fdd-be1d-2f41b2cf1873_789x256.png 424w, https://substackcdn.com/image/fetch/$s_!a5f_!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ea1d5a6-99e9-4fdd-be1d-2f41b2cf1873_789x256.png 848w, https://substackcdn.com/image/fetch/$s_!a5f_!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ea1d5a6-99e9-4fdd-be1d-2f41b2cf1873_789x256.png 1272w, https://substackcdn.com/image/fetch/$s_!a5f_!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ea1d5a6-99e9-4fdd-be1d-2f41b2cf1873_789x256.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!a5f_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ea1d5a6-99e9-4fdd-be1d-2f41b2cf1873_789x256.png" width="789" height="256" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/1ea1d5a6-99e9-4fdd-be1d-2f41b2cf1873_789x256.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:256,&quot;width&quot;:789,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!a5f_!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ea1d5a6-99e9-4fdd-be1d-2f41b2cf1873_789x256.png 424w, https://substackcdn.com/image/fetch/$s_!a5f_!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ea1d5a6-99e9-4fdd-be1d-2f41b2cf1873_789x256.png 848w, https://substackcdn.com/image/fetch/$s_!a5f_!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ea1d5a6-99e9-4fdd-be1d-2f41b2cf1873_789x256.png 1272w, https://substackcdn.com/image/fetch/$s_!a5f_!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F1ea1d5a6-99e9-4fdd-be1d-2f41b2cf1873_789x256.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">According to LinkedIn, Dropbox has 280 people in sales and another 229 in business development.</figcaption></figure></div><h3>Identifying Product-Led Sales Opportunities</h3><p>When should your sales team reach out to product-qualified leads? Many of the best prospects will self-identify through your &#8220;Contact Us&#8221; form, but there are also some key signals to watch for:</p><p>Look for power users who are deeply engaged with the product. They can become internal champions who advocate for purchasing and expansion. If users ask complex technical questions online or via support, they likely need the customized guidance a sales rep can provide.</p><p>When users submit feature requests, it may indicate they've outgrown current self-serve plans and need capabilities from higher tiers. I often refer product leaders to <a href="https://www.enterpriseready.io/">EnterpriseReady</a> for inspiration for enterprise-level features.</p><p>When users exceed thresholds like the number of projects or collaborators in self-serve plans, it means they're primed for upselling. If users seek to integrate your product with other tools via APIs, they may be ready for customized enterprise plans.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!tFbK!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcaa2b70f-eba0-476c-9690-f54d0645a064_1079x422.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!tFbK!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcaa2b70f-eba0-476c-9690-f54d0645a064_1079x422.png 424w, https://substackcdn.com/image/fetch/$s_!tFbK!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcaa2b70f-eba0-476c-9690-f54d0645a064_1079x422.png 848w, https://substackcdn.com/image/fetch/$s_!tFbK!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcaa2b70f-eba0-476c-9690-f54d0645a064_1079x422.png 1272w, https://substackcdn.com/image/fetch/$s_!tFbK!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcaa2b70f-eba0-476c-9690-f54d0645a064_1079x422.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!tFbK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcaa2b70f-eba0-476c-9690-f54d0645a064_1079x422.png" width="1079" height="422" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/caa2b70f-eba0-476c-9690-f54d0645a064_1079x422.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:422,&quot;width&quot;:1079,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!tFbK!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcaa2b70f-eba0-476c-9690-f54d0645a064_1079x422.png 424w, https://substackcdn.com/image/fetch/$s_!tFbK!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcaa2b70f-eba0-476c-9690-f54d0645a064_1079x422.png 848w, https://substackcdn.com/image/fetch/$s_!tFbK!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcaa2b70f-eba0-476c-9690-f54d0645a064_1079x422.png 1272w, https://substackcdn.com/image/fetch/$s_!tFbK!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcaa2b70f-eba0-476c-9690-f54d0645a064_1079x422.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Features like SSO and audit logs can help take you up-market.</figcaption></figure></div><h3>Advice for Implementing Product-Led Sales</h3><p>When hiring sales reps, prioritize finding people who align culturally and embrace a product-led mindset over those who just want to "hunt." Invest in resources and training to educate users on the evolving value as your product expands.</p><p>Use pricing tiers like "Free, Good, Better, Best" with a "Contact Sales" CTA on the Best tier to encourage upgrades. Have leaders personally engage promising inbound leads to build knowledge and ensure they pay attention to what is being said via Gong.io (or whatever you use.)</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!b6u-!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac79572-7353-484c-ac01-4b21bf368fc4_1101x1000.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!b6u-!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac79572-7353-484c-ac01-4b21bf368fc4_1101x1000.png 424w, https://substackcdn.com/image/fetch/$s_!b6u-!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac79572-7353-484c-ac01-4b21bf368fc4_1101x1000.png 848w, https://substackcdn.com/image/fetch/$s_!b6u-!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac79572-7353-484c-ac01-4b21bf368fc4_1101x1000.png 1272w, https://substackcdn.com/image/fetch/$s_!b6u-!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac79572-7353-484c-ac01-4b21bf368fc4_1101x1000.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!b6u-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac79572-7353-484c-ac01-4b21bf368fc4_1101x1000.png" width="1101" height="1000" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/eac79572-7353-484c-ac01-4b21bf368fc4_1101x1000.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:1000,&quot;width&quot;:1101,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="" title="" srcset="https://substackcdn.com/image/fetch/$s_!b6u-!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac79572-7353-484c-ac01-4b21bf368fc4_1101x1000.png 424w, https://substackcdn.com/image/fetch/$s_!b6u-!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac79572-7353-484c-ac01-4b21bf368fc4_1101x1000.png 848w, https://substackcdn.com/image/fetch/$s_!b6u-!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac79572-7353-484c-ac01-4b21bf368fc4_1101x1000.png 1272w, https://substackcdn.com/image/fetch/$s_!b6u-!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Feac79572-7353-484c-ac01-4b21bf368fc4_1101x1000.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Free Good Better Best on the TinyMCE pricing page.</figcaption></figure></div><p>When transitioning sales reps from traditional techniques to a product-led sales approach, keep these training tips in mind:</p><ul><li><p>Prospects are often highly informed about the product already. Avoid overselling or relying on scripts.&nbsp;</p></li><li><p>Focus on efficiently solving the specific problems or use cases that brought prospects in. Don't distract them. You may not need a demo, and our reps haven&#8217;t used a PowerPoint presentation for years.</p></li><li><p>Leverage tools like Gong.io and Outreach to refine approaches for optimizing inbound leads.</p></li><li><p>Prioritize cultural alignment and a customer-centric mindset over being hard-charging&#8212;structure compensation to reward expansion within accounts, not just net new logos.</p></li><li><p>Highlight how sales can tap into revenue potential from enterprise accounts missed by PLG.</p></li><li><p>Encourage collaboration with product and marketing to educate users on new value continually.</p></li></ul><p>I found it always helpful to share examples of other companies succeeding with a hybrid PLG + sales approach. Our exec team often discussed MongoDB (this <a href="https://www.youtube.com/watch?v=n-UFdGvTFMs">video</a> in particular.)</p><h3>The Hybrid Model is the New Normal</h3><p>In today's world of self-service software, product-led growth is an alluring vision of viral acquisition and frictionless expansion. But very few companies have managed to sustain growth on PLG alone.</p><p>To tap into the majority of revenue potential in your market, you'll likely need to layer in a complementary product-led sales approach. One that reduces friction for larger buyers, expands deal sizes, and enables expansion.</p><p>PLS brings together the best of both worlds - viral product-led motion with higher-touch sales relationships. Done right, it unlocks growth from both expanding existing accounts and penetrating untapped enterprise opportunities.</p><p>Further reading:</p><ul><li><p><a href="https://openviewpartners.com/blog/your-guide-to-product-led-sales/">Guide to Product-Led Sales</a> from OpenView</p></li><li><p>McKinsey's <a href="https://www.mckinsey.com/industries/technology-media-and-telecommunications/our-insights/from-product-led-growth-to-product-led-sales-beyond-the-plg-hype">From product-led growth to product-led sales: Beyond the PLG hype</a></p></li><li><p><a href="https://www.arthurventures.com/blog/introduction-to-product-led-salesnbsp">Introduction to Product Led Sales</a> by Arthur Ventures</p></li><li><p><a href="https://www.youtube.com/watch?v=n-UFdGvTFMs">Video: Consumerizing MongoDB to Grow to $100M+</a> by Sahir Azam</p></li></ul>]]></content:encoded></item><item><title><![CDATA[The Limits of Narrow AI ]]></title><description><![CDATA[And the Path to More Human-like Intelligence]]></description><link>https://www.andrewroberts.blog/p/the-limits-of-narrow-ai</link><guid isPermaLink="false">https://www.andrewroberts.blog/p/the-limits-of-narrow-ai</guid><dc:creator><![CDATA[Andrew Roberts]]></dc:creator><pubDate>Tue, 08 Aug 2023 21:01:00 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/416a81cc-bc4a-4ad0-819f-d953e1e7edf2_752x421.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Every few months, a new AI system surpasses human ability in yet another domain, and the buzz around ChatGPT has reached a fever pitch. So we must be on the cusp of creating human-level artificial general intelligence, right?</p><p>Not so fast, says Meta&#8217;s Chief AI Scientist, <a href="https://www.linkedin.com/in/yann-lecun?miniProfileUrn=urn%3Ali%3Afs_miniProfile%3AACoAAAADFk0BbiOeu2Wrer11SaPH_5m1GM8pG6Q">Yann LeCun</a>. When I get excited about new technology, it is valuable to hear the counterarguments, and, as a winner of the Turing Award - essentially the Nobel Prize of computing - Yann knows what he's talking about. Under his guidance, Meta is also a leading developer of large language models, so it is worth paying attention when he criticizes them.</p><p>In a <a href="https://www.youtube.com/watch?v=vyqXLJsmsrk&amp;list=PLKemzYMx2_Ot1MZ_er2vFiINdJEgDO8Hg">keynote address at MIT</a> last week, LeCun discussed the fundamental limitations of current approaches and his vision for moving beyond what he calls &#8220;narrow AI&#8221; to achieve more robust, flexible, and human-like intelligence. Given the many open challenges, he says it may take "years to decades" to reach human-level AI.&nbsp;</p><div id="youtube2-vyqXLJsmsrk" class="youtube-wrap" data-attrs="{&quot;videoId&quot;:&quot;vyqXLJsmsrk&quot;,&quot;startTime&quot;:null,&quot;endTime&quot;:null}" data-component-name="Youtube2ToDOM"><div class="youtube-inner"><iframe src="https://www.youtube-nocookie.com/embed/vyqXLJsmsrk?rel=0&amp;autoplay=0&amp;showinfo=0&amp;enablejsapi=0" frameborder="0" loading="lazy" gesture="media" allow="autoplay; fullscreen" allowautoplay="true" allowfullscreen="true" width="728" height="409"></iframe></div></div><p>In this article, I'll explore why LeCun feels current methods fall short and what different architectures and training techniques may get us to the next level. He said that LLMs &#8220;suck&#8221; and are &#8220;doomed.&#8221; Are they?</p><h2>The Limits of Narrow AI</h2><p>LeCun used autoregressive language models like ChatGPT as a prime example of impressive, narrow AI. Autoregressive means these models generate text or data sequentially, with each new token based only on the previous ones. As a result, they cannot reason about the overall meaning of what they're generating. LeCun compared them to <a href="https://www.scientificamerican.com/article/kahneman-excerpt-thinking-fast-and-slow/">Kahneman's System 1 thinking</a> - quick, subconscious reactions rather than conscious reasoning.</p><p>While they seem amazingly fluent and knowledgeable at first glance, language models like ChatGPT suffer from significant shortcomings:</p><ul><li><p>They hallucinate plausible-sounding but false or nonsensical answers</p></li><li><p>Without broader reasoning, they lack common sense and knowledge of how the world works</p></li><li><p>They can't incorporate recent information outside their training data</p></li><li><p>Making them 100% safe is impossible - they can always be "jailbroken."</p></li></ul><p>As LeCun said, "Their intelligence is very limited and really nothing like human intelligence.&#8221;</p><p>What's more, LeCun explained that these models have no connection to physical reality. They acquire knowledge purely through language rather than observing and interacting with the world as we do. Consider how babies learn object permanence, gravity, and basic physics through observation months before they can even speak. Animals likewise navigate the world adaptively without using language. Autoregressive models cannot access this intuitive "background knowledge" we take for granted.</p><h2>Moving Beyond Narrow AI</h2><p>Given the fundamental limits of today's AI, LeCun argues that new approaches are needed to create more capable, general architectures. He believes the key is to move beyond narrow pattern recognition and text generation tasks towards AI that can reason, plan, and understand the physics of the world.</p><p>Several of LeCun's proposals resonated with my intuition for how human and animal minds work:</p><ul><li><p><em>Cognitive architectures</em> incorporate different modules to perceive the world, model dynamics, assess costs, and plan actions like teams of specialists cooperating flexibly.</p></li><li><p><em>Objective-driven systems</em> that optimize hardwired and learned goals and constraints. This prevents uncontrolled or unsafe behavior.</p></li><li><p><em>Hierarchical planning</em> to break down goals and develop solutions at multiple levels of abstraction.</p></li><li><p><em>Multimodal self-supervised learning</em> that models physics and relationships by observing raw sensory data like images, video, and sound - not just text.</p></li></ul><p>The last point echoes how my sons have learned what different animals look like from picture books long before they could read their names or facts about them. The visual experience encoded the concepts in their brains on a deeper level than language alone. LeCun stresses that capturing this implicit knowledge is vital for more human-like learning in AI systems.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!vnGH!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c3673e5-d730-4baf-887a-f6cf1b459dbc_791x443.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!vnGH!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c3673e5-d730-4baf-887a-f6cf1b459dbc_791x443.png 424w, https://substackcdn.com/image/fetch/$s_!vnGH!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c3673e5-d730-4baf-887a-f6cf1b459dbc_791x443.png 848w, https://substackcdn.com/image/fetch/$s_!vnGH!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c3673e5-d730-4baf-887a-f6cf1b459dbc_791x443.png 1272w, https://substackcdn.com/image/fetch/$s_!vnGH!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c3673e5-d730-4baf-887a-f6cf1b459dbc_791x443.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!vnGH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c3673e5-d730-4baf-887a-f6cf1b459dbc_791x443.png" width="791" height="443" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/7c3673e5-d730-4baf-887a-f6cf1b459dbc_791x443.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:443,&quot;width&quot;:791,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!vnGH!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c3673e5-d730-4baf-887a-f6cf1b459dbc_791x443.png 424w, https://substackcdn.com/image/fetch/$s_!vnGH!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c3673e5-d730-4baf-887a-f6cf1b459dbc_791x443.png 848w, https://substackcdn.com/image/fetch/$s_!vnGH!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c3673e5-d730-4baf-887a-f6cf1b459dbc_791x443.png 1272w, https://substackcdn.com/image/fetch/$s_!vnGH!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F7c3673e5-d730-4baf-887a-f6cf1b459dbc_791x443.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">LeCun's vision for a cognitive architecture</figcaption></figure></div><h2>New Training Methods</h2><p>Today's large neural networks are famously data-hungry. For example, Llama-2 trained on massive amounts of online text totaling over two trillion tokens! LeCun says these models cannot learn from experience and incorporate new knowledge over time as humans do, and their knowledge is fixed during training. I wonder if he gives enough credit to prompt engineering techniques such as <a href="https://www.linkedin.com/pulse/demystifying-ai-integration-how-context-stuffing-can-your-roberts%3FtrackingId=K18HwtxTk2XX8E%252FCgDIJUA%253D%253D/?trackingId=K18HwtxTk2XX8E%2FCgDIJUA%3D%3D">retrieval-augmented generation (RAG)</a> and their ability to work around limitations. Still, he has a point about the language models themselves.&nbsp;</p><p>LeCun proposes training advanced AI architectures using very different methods that don't require massive datasets:</p><ul><li><p><em>Self-supervised learning</em> techniques can extract representations from unlabeled data like images, video, and audio. This approach was inspired by how animals learn about the world through observation long before receiving explicit labels or reinforcement.</p></li><li><p><em>Energy-based models</em> aim to learn relationships directly from underlying dependencies in the data rather than relying on huge datasets to cover all possibilities. LeCun says this is closer to how human reasoning works.</p></li><li><p><em>Joint embeddings</em>, where representations are predicted in abstract vector spaces rather than generating pixels or tokens directly, avoid many pitfalls of mainstream generative models.</p></li><li><p><em>Regularization techniques</em> like VICReg and DINO constrain models during training. They help prevent the learned representations from becoming too correlated, collapsed, or meaningless - which is especially useful in self-supervised learning where the model trains on unlabeled data without guiding feedback from labels.</p></li></ul><p>By incorporating these techniques, LeCun and his team at Meta aim to create AI that learns broadly about relationships in different domains in a more efficient, human-like manner rather than brute-force pattern recognition from vast data.</p><h2>The Path Ahead</h2><p>The keynote was technically dense in parts. Nevertheless, the main points that came through for me are:</p><ul><li><p>Today's narrow AI still has fundamental limitations compared to human intelligence.</p></li><li><p>Truly capable, general artificial intelligence will require rethinking system architectures and training.</p></li><li><p>Objective-driven, hierarchical, multimodal learning systems hold promise for achieving more human-like flexible intelligence.</p></li><li><p>New methods like self-supervised learning and energy-based models can reduce dependence on massive datasets.</p></li></ul><p>LeCun's perspective is a refreshing take on the hype. AI has come a long way, but the path to human-level artificial intelligence is still long.</p><p>It is worth finishing that LeCun is no AI doomer. He believes that risks associated with advanced AI can be mitigated through responsible R&amp;D practices. He is optimistic that future human-level AI will greatly benefit humanity. Large language models may be doomed, but we are not.</p><p></p><p><em>Note: </em>Anthropic's<em> Claude-2 and </em>Glasp's<em> YouTube Summary browser extension helped write the first draft of this post. Article image <a href="https://www.midjourney.com/app/jobs/21ca6ac0-7083-4854-b837-50142b477704/">generated</a> by </em>Midjourney</p>]]></content:encoded></item><item><title><![CDATA[The Battle of Large Language Models]]></title><description><![CDATA[Open Source vs Proprietary]]></description><link>https://www.andrewroberts.blog/p/the-battle-of-large-language-models</link><guid isPermaLink="false">https://www.andrewroberts.blog/p/the-battle-of-large-language-models</guid><dc:creator><![CDATA[Andrew Roberts]]></dc:creator><pubDate>Tue, 01 Aug 2023 21:40:00 GMT</pubDate><enclosure url="https://substack-video.s3.amazonaws.com/video_upload/post/136652626/44547567-7c30-4b35-bb13-a80d0c895343/transcoded-00003.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>My views and understanding of the open-source AI landscape have changed enormously over the past few months. On June 24, I shared some of what I've learned at a talk at <a href="https://www.linkedin.com/company/aiify-io/">Aiify.io</a>'s event in San Francisco.<br><br>Thanks to <a href="https://www.linkedin.com/in/ACoAAAAnz1gB3qFVoF43zBlXcQACz0o5WSRXamo">Steven Echtman</a> for putting on such a great event; the entire day was streamed on YouTube and had lots of excellent content.</p>]]></content:encoded></item><item><title><![CDATA[The Rise of AI UX ]]></title><description><![CDATA[How to design AI assistants that users will love]]></description><link>https://www.andrewroberts.blog/p/the-rise-of-ai-ux</link><guid isPermaLink="false">https://www.andrewroberts.blog/p/the-rise-of-ai-ux</guid><dc:creator><![CDATA[Andrew Roberts]]></dc:creator><pubDate>Thu, 20 Jul 2023 20:57:00 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/dfc4420b-5297-4100-b18a-c2d17ba2b9d0_752x421.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Imagine having an assistant who could chat with you to help get work done, answer questions, or brainstorm ideas. We are all familiar with ChatGPT, but what if its smarts could be directly embedded in our workflow? AI "copilots" are making this a reality. But designing great copilot experiences takes way more thoughtfulness and care than I initially thought.</p><p>A few weeks ago, I stumbled on a great video on <a href="https://www.youtube.com/watch?v=WiCVEMH4HTI">AI user experience (UX) design</a> from the Microsoft Build conference. <a href="https://www.linkedin.com/in/starparticle?miniProfileUrn=urn%3Ali%3Afs_miniProfile%3AACoAAAZ-paoBFWSBCGHmiKo2TJlCMiVwG-kTgVw">Rachel Shepard</a>, Director of Design for AI Platform, and <a href="https://www.linkedin.com/in/kurtisbeavers?miniProfileUrn=urn%3Ali%3Afs_miniProfile%3AACoAAAHSzUkBcRqf8rEfO25ELpI-yyeru7-Hp4M">Kurtis Beavers</a>, Fluent AI Design Director, discussed designing for Copilot - Microsoft&#8217;s name for embedded AI assistants surfacing throughout their products. I highly recommend that you check out the whole video.</p><p>As design leaders for Microsoft AI products, they&#8217;ve seen firsthand how to create copilots, or AI assistants, with quality UX. In this post, I&#8217;ll share some of their insider tips on crafting a copilot, from managing expectations to continuous improvement. Deceptively simple, there is so much to be thought through that I believe that AI UX will blossom into a whole specialty unto itself.</p><h2>Defining Copilot</h2><p>To start, what exactly is a copilot? Essentially, an AI agent collaborates with you on tasks through natural conversation. Under the hood, copilots leverage large language models like GPT-4 to generate responses based on their vast training data and augment it with your application's data. But to users, it just feels like chatting with a helpful assistant.&nbsp;</p><p>Compared to traditional apps, copilot interactions are more unpredictable, like a choose-your-own-adventure story. Users drive personalized experiences based on their questions and requests. Designers have to embrace this ambiguity rather than try to control rigid paths. In the video, Rachel calls for a probabilistic mindset attuned to likely outcomes, not prescripted certainties.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!rFvn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec5f654d-ec2c-4f49-8409-5f925709dbb9_2074x796.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!rFvn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec5f654d-ec2c-4f49-8409-5f925709dbb9_2074x796.png 424w, https://substackcdn.com/image/fetch/$s_!rFvn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec5f654d-ec2c-4f49-8409-5f925709dbb9_2074x796.png 848w, https://substackcdn.com/image/fetch/$s_!rFvn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec5f654d-ec2c-4f49-8409-5f925709dbb9_2074x796.png 1272w, https://substackcdn.com/image/fetch/$s_!rFvn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec5f654d-ec2c-4f49-8409-5f925709dbb9_2074x796.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!rFvn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec5f654d-ec2c-4f49-8409-5f925709dbb9_2074x796.png" width="1456" height="559" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/ec5f654d-ec2c-4f49-8409-5f925709dbb9_2074x796.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:559,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!rFvn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec5f654d-ec2c-4f49-8409-5f925709dbb9_2074x796.png 424w, https://substackcdn.com/image/fetch/$s_!rFvn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec5f654d-ec2c-4f49-8409-5f925709dbb9_2074x796.png 848w, https://substackcdn.com/image/fetch/$s_!rFvn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec5f654d-ec2c-4f49-8409-5f925709dbb9_2074x796.png 1272w, https://substackcdn.com/image/fetch/$s_!rFvn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fec5f654d-ec2c-4f49-8409-5f925709dbb9_2074x796.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><h2>Laying the Foundation&nbsp;</h2><p>The <a href="https://fluent2.microsoft.design/">Microsoft Fluent 2 Design System</a> provides the foundation when constructing a copilot, at least for Microsoft products. They have expanded it beyond the typical focus on visual components to include education, conversation norms, trust building, and, crucially, prompt design.&nbsp;Their core AI UX principles include:</p><ul><li><p>Establish <strong>appropriate trust</strong> in the system</p></li><li><p>Help people <strong>spot inaccurate or potentially harmful content</strong></p></li><li><p>Help people form <strong>better inputs</strong></p></li><li><p>Help people <strong>understand &amp; use outputs</strong></p></li></ul><p>Framing the experience properly is critical so users understand the technology&#8217;s current stage. We are coaching users on collaborating effectively rather than setting sky-high expectations. The proper framing shapes how people perceive the system&#8217;s strengths and limitations.</p><h2>Copilot Altitude</h2><p>Copilots can engage with users across different modalities, or "altitudes," as they call them. Full-screen chat interfaces for free-form dialogue are on one end of the spectrum ("above"). This works well for open-ended queries or back-and-forth brainstorming.</p><p>In other cases, a side-panel copilot (&#8220;beside&#8221;) might monitor the core app experience and interactively assist users. Copilots can also be embedded "inside" specific UI elements for focused micro-tasks to provide just-in-time guidance.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!-qDE!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2450761-4dfa-4d1d-9535-78b39ea0729a_2210x1054.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!-qDE!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2450761-4dfa-4d1d-9535-78b39ea0729a_2210x1054.png 424w, https://substackcdn.com/image/fetch/$s_!-qDE!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2450761-4dfa-4d1d-9535-78b39ea0729a_2210x1054.png 848w, https://substackcdn.com/image/fetch/$s_!-qDE!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2450761-4dfa-4d1d-9535-78b39ea0729a_2210x1054.png 1272w, https://substackcdn.com/image/fetch/$s_!-qDE!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2450761-4dfa-4d1d-9535-78b39ea0729a_2210x1054.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!-qDE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2450761-4dfa-4d1d-9535-78b39ea0729a_2210x1054.png" width="1456" height="694" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/a2450761-4dfa-4d1d-9535-78b39ea0729a_2210x1054.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:694,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!-qDE!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2450761-4dfa-4d1d-9535-78b39ea0729a_2210x1054.png 424w, https://substackcdn.com/image/fetch/$s_!-qDE!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2450761-4dfa-4d1d-9535-78b39ea0729a_2210x1054.png 848w, https://substackcdn.com/image/fetch/$s_!-qDE!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2450761-4dfa-4d1d-9535-78b39ea0729a_2210x1054.png 1272w, https://substackcdn.com/image/fetch/$s_!-qDE!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fa2450761-4dfa-4d1d-9535-78b39ea0729a_2210x1054.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Determining the appropriate modality depends on the use case and context. Sidebars suit complex workflows by advising on app functionality. Embedded copilots help with precise steps. Chat excels for discovery and exploration. Mixing modalities allows each interaction to feel fluid and natural.</p><h2>The Deceptive Simplicity of Copilot UI&nbsp;&nbsp;</h2><p>Copilot UIs look straightforward on the surface. Behind the minimalist interfaces, though, designers must craft details thoughtfully&#8212;elements like previews, citations, and notifications.&nbsp;</p><p>The teams at Microsoft spend lots of time working through nuanced functionality around latency, accuracy, grounding in source materials, and more. How might they balance a snappy response time while delivering an in-depth quality result? What sources or data should responses link to so users can verify credibility? How do they indicate when a copilot hallucinates an answer rather than pulling from reliable information?</p><p>Every design choice aims to build users&#8217; trust at the appropriate level. While copilots can generate helpful information, they sometimes fabricate or hallucinate content that requires correction. UI tweaks help flag these issues, allowing people to steer toward truthful insights.</p><h2>The Art of Prompt Engineering&nbsp;&nbsp;</h2><p>Prompt engineering takes experimentation and patience. You could compare it to perfecting a recipe. The ingredients and instructions must precisely guide the AI to serve the knowledge you&#8217;re hungry for. Going too broad or too vague produces mushy meaningless results. Overly narrow or complex prompts also risk failure.&nbsp;</p><p>Start by writing prompts that clearly describe the problem to be solved or the question to be answered. Build in guardrails to bound the scope if needed. Provide examples that illustrate the desired tone and depth. Work iteratively, tweaking prompts based on the outputs generated until you consistently get delicious results.&nbsp;</p><p>Lately, I have been successfully expanding my often simplistic prompts into much more detailed instructions for the language model. A well-crafted prompt makes a massive difference to the output from an LLM, and there are <a href="https://www.promptingguide.ai/techniques">many techniques</a> you can use to improve them.</p><h2>The People Powering Copilots</h2><p>Copilots demand increased collaboration between groups that previously didn&#8217;t intersect much, like design, research, engineering, content strategy, and trust and safety teams.&nbsp;</p><p>This rallying of different experts unlocks creativity and checks potential pitfalls. Designers ground lofty AI ambitions in realistic user needs. Engineers advise on technical constraints that enable seamless experiences. Researchers supply data-driven insights and identify pain points. Prompt engineers finesse conversation flow. Ethicists monitor for potential harm. The combined superpowers drive breakthroughs.</p><p>Developing AI responsibly requires this collaborative spirit. Copilots augment human capabilities rather than replace them. So designing copilots together makes the output more insightful than any individual could produce alone.</p><h2>Launching and Iterating&nbsp;</h2><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!__Ub!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F645d26ae-b6d3-42aa-98d5-9f8ce3964427_2096x1108.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!__Ub!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F645d26ae-b6d3-42aa-98d5-9f8ce3964427_2096x1108.png 424w, https://substackcdn.com/image/fetch/$s_!__Ub!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F645d26ae-b6d3-42aa-98d5-9f8ce3964427_2096x1108.png 848w, https://substackcdn.com/image/fetch/$s_!__Ub!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F645d26ae-b6d3-42aa-98d5-9f8ce3964427_2096x1108.png 1272w, https://substackcdn.com/image/fetch/$s_!__Ub!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F645d26ae-b6d3-42aa-98d5-9f8ce3964427_2096x1108.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!__Ub!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F645d26ae-b6d3-42aa-98d5-9f8ce3964427_2096x1108.png" width="1456" height="770" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/645d26ae-b6d3-42aa-98d5-9f8ce3964427_2096x1108.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:770,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!__Ub!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F645d26ae-b6d3-42aa-98d5-9f8ce3964427_2096x1108.png 424w, https://substackcdn.com/image/fetch/$s_!__Ub!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F645d26ae-b6d3-42aa-98d5-9f8ce3964427_2096x1108.png 848w, https://substackcdn.com/image/fetch/$s_!__Ub!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F645d26ae-b6d3-42aa-98d5-9f8ce3964427_2096x1108.png 1272w, https://substackcdn.com/image/fetch/$s_!__Ub!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F645d26ae-b6d3-42aa-98d5-9f8ce3964427_2096x1108.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a></figure></div><p>Shipping copilots requires a build-measure-learn approach. Since you can&#8217;t fully predict how users will converse, regularly releasing minimum viable products lets you gather feedback and continuously refine.&nbsp;</p><p>Plan ahead for how improvements will integrate back into the copilot post-launch. Monitor both qualitative signals like user delight as well as usage metrics. Be ready to update content and prompts based on common queries. Expand capabilities over time as the models advance.&nbsp;</p><p>Occasionally, you may decide certain features are no longer needed if the copilot proves adept at handling associated tasks. It takes humility to recognize when technology evolves past previous solutions. But minimizing duplication avoids clutter and complexity.</p><h2>The Future of AI Assistance&nbsp;&nbsp;&nbsp;</h2><p>It&#8217;s an exciting time to think about how AI assistants integrate into our lives. We will continue dreaming up ways to combine human strengths with intelligent machines. AI can help us achieve our goals while avoiding drudgery. Approached thoughtfully, it will enhance how we learn new skills, complete workflows, run our businesses, and more.&nbsp;</p><p>I can't wait to see how "AI UX" designers build out the patterns and frameworks we will need to be successful. We are still in the Geocities era of AI.</p>]]></content:encoded></item><item><title><![CDATA[GPT for the real world]]></title><description><![CDATA[Generative AI in geospatial applications]]></description><link>https://www.andrewroberts.blog/p/gpt-for-the-real-world</link><guid isPermaLink="false">https://www.andrewroberts.blog/p/gpt-for-the-real-world</guid><dc:creator><![CDATA[Andrew Roberts]]></dc:creator><pubDate>Tue, 06 Jun 2023 20:53:00 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/71f666ae-3635-422c-b7c2-e626747b5bc5_752x421.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>We visited Australia Zoo on holiday in December, and my son has gone all-in on the Irwins ever since. So, naturally, I asked ChatGPT to create a text-based role-playing game to indulge his croc obsession.</p><pre><code>Pretend you are a computer game that uses text to describe a scene. 
The player can then interact with the scene using natural language. 
The initial starting point for the game is the entrance to 
Australia Zoo.</code></pre><p>What followed was a realistic interaction about a real place. It knew the shows, attractions, and, most surprisingly, the geography. Crikey! I haven&#8217;t seen much coverage of generative AI in geospatial applications, but the models are wrestling with some emergent location-based understanding.</p><p>Fortunately, some good folks from academia had the same thought, and on May 30, they published the <a href="https://arxiv.org/abs/2306.00020">GPT4GEO paper</a>. The collaboration between researchers at universities in the UK, Germany and China explores the factual geographic knowledge of GPT-4 and its application-centric geographic reasoning. They conclude, &#8220;We find GPT-4 to have a remarkable understanding of the world.&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2" target="_blank" href="https://substackcdn.com/image/fetch/$s_!6Dbb!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbf0a7e8-471b-464d-bc9f-3d05788b8a52_719x125.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!6Dbb!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbf0a7e8-471b-464d-bc9f-3d05788b8a52_719x125.png 424w, https://substackcdn.com/image/fetch/$s_!6Dbb!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbf0a7e8-471b-464d-bc9f-3d05788b8a52_719x125.png 848w, https://substackcdn.com/image/fetch/$s_!6Dbb!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbf0a7e8-471b-464d-bc9f-3d05788b8a52_719x125.png 1272w, https://substackcdn.com/image/fetch/$s_!6Dbb!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbf0a7e8-471b-464d-bc9f-3d05788b8a52_719x125.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!6Dbb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbf0a7e8-471b-464d-bc9f-3d05788b8a52_719x125.png" width="719" height="125" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/cbf0a7e8-471b-464d-bc9f-3d05788b8a52_719x125.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:125,&quot;width&quot;:719,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!6Dbb!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbf0a7e8-471b-464d-bc9f-3d05788b8a52_719x125.png 424w, https://substackcdn.com/image/fetch/$s_!6Dbb!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbf0a7e8-471b-464d-bc9f-3d05788b8a52_719x125.png 848w, https://substackcdn.com/image/fetch/$s_!6Dbb!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbf0a7e8-471b-464d-bc9f-3d05788b8a52_719x125.png 1272w, https://substackcdn.com/image/fetch/$s_!6Dbb!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Fcbf0a7e8-471b-464d-bc9f-3d05788b8a52_719x125.png 1456w" sizes="100vw" fetchpriority="high"></picture><div></div></div></a><figcaption class="image-caption">GPT4GEO experiments taxonomy [1]</figcaption></figure></div><p>The factual or descriptive knowledge of GPT-4 is fascinating, but I don&#8217;t think it matters too much. <a href="https://www.linkedin.com/pulse/demystifying-ai-integration-how-context-stuffing-can-your-roberts%3FtrackingId=IKeOBgT0SkmhWfLQmZ%252Bi%252FA%253D%253D/?trackingId=IKeOBgT0SkmhWfLQmZ%2Bi%2FA%3D%3D">Retrieval augmented generation (RAG)</a> could provide accurate geospatial data from Google Maps and other sources. I was surprised that there isn&#8217;t yet a plugin on ChatGPT to do this; it won&#8217;t be long before there is.</p><p>The application-centric geographic reasoning, such as route planning, navigation, itinerary planning, and supply chain analysis, gets interesting. The paper showed that the model&nbsp;can develop &#8220;creative, plausible travel routes&#8221; and &#8220;shows strong capabilities of direction-based navigation.&#8221;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!ik-X!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a3765f1-ecf5-49b7-9c18-4ec9af46e969_714x251.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!ik-X!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a3765f1-ecf5-49b7-9c18-4ec9af46e969_714x251.png 424w, https://substackcdn.com/image/fetch/$s_!ik-X!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a3765f1-ecf5-49b7-9c18-4ec9af46e969_714x251.png 848w, https://substackcdn.com/image/fetch/$s_!ik-X!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a3765f1-ecf5-49b7-9c18-4ec9af46e969_714x251.png 1272w, https://substackcdn.com/image/fetch/$s_!ik-X!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a3765f1-ecf5-49b7-9c18-4ec9af46e969_714x251.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!ik-X!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a3765f1-ecf5-49b7-9c18-4ec9af46e969_714x251.png" width="714" height="251" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/9a3765f1-ecf5-49b7-9c18-4ec9af46e969_714x251.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:251,&quot;width&quot;:714,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!ik-X!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a3765f1-ecf5-49b7-9c18-4ec9af46e969_714x251.png 424w, https://substackcdn.com/image/fetch/$s_!ik-X!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a3765f1-ecf5-49b7-9c18-4ec9af46e969_714x251.png 848w, https://substackcdn.com/image/fetch/$s_!ik-X!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a3765f1-ecf5-49b7-9c18-4ec9af46e969_714x251.png 1272w, https://substackcdn.com/image/fetch/$s_!ik-X!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F9a3765f1-ecf5-49b7-9c18-4ec9af46e969_714x251.png 1456w" sizes="100vw"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Travel itinerary suggestion for an 8-day trip in Ireland starting from Miami [1]</figcaption></figure></div><h3>Fire Safety GPT</h3><p>I&#8217;ve been advising <a href="https://www.linkedin.com/company/locatrix/">Locatrix</a>, an indoor mapping software company. Locatrix focuses on fire safety, and I was curious whether GPT-4 could generate an evacuation plan based on an office floor plan or recommend where to place fire extinguishers. The short answer is yes, yes, it can.&nbsp;</p><p>I asked GPT-4 to &#8220;Create the floor plan and detailed layout of a 5,000 square foot floor of an office building. Return it as JSON.&#8221; It did a solid job describing the area, including a reception, four meeting rooms, six private offices and an open office space.</p><p>I then followed up with &#8220;Create an evacuation plan for this office&#8221;, which it proceeded to do. At a high level, it looks pretty good and claims, &#8220;Each exit point has a route from every section of the office floor (Reception, OpenOfficeSpace, PrivateOffices, MeetingRooms). The path represents the sequence of positions to be traversed from each section to reach the exit point.&#8221;</p><p>To push the limit, I asked, &#8220;Suggest locations for fire extinguisher locations to meet the fire code in San Francisco.&#8221; Sure enough, it returned suggested plausible locations and details on how it complied with the local code. &#8220;As per the specific codes applicable to San Francisco, each extinguisher should be: within 75 feet of all portions of the premise, within 30 feet of any cooking equipment, and regularly inspected and maintained.&#8220;</p><p>You can read the full transcript of my <a href="https://chat.openai.com/share/1182c71a-bc04-4142-818b-6e64213f1cab">interaction with ChatGPT</a> and try it out for yourself. It could be better. For example, I couldn&#8217;t get it to produce any decent visualizations. Nevertheless, you can see the future from here.</p><p>Coding and data analysis are tasks already well suited to the GPT architecture. Geospatial features can all be represented as text too, and I suspect that a few years from now, every large building or campus will have a copilot to help with space planning, safety, and maintenance.&nbsp;</p><p>&#8220;Robot, we need to evacuate. Help everyone get out now!&#8221; is not far off.&nbsp;Every building will be an AI building.</p><p><em>[1] <a href="https://arxiv.org/abs/2306.00020">GPT4GEO: How a Language Model Sees the World's Geography</a> by Jonathan Roberts, Timo L&#252;ddecke, Sowmen Das, Kai Han, Samuel Albanie. Submitted to Arxiv on May 30, 2023.</em></p>]]></content:encoded></item><item><title><![CDATA[Open source AI: it's here, but is it a good thing?]]></title><description><![CDATA[Not long ago, I believed that the future of large language models (LLMs) would be in the hands of large software companies. Companies like Microsoft (OpenAI), Google, and Anthropic had deep pockets and were in pole position. They could afford to pour hundreds of millions of dollars into crafting and upgrading LLMs. The idea of open source AI being a rival seemed far-fetched.]]></description><link>https://www.andrewroberts.blog/p/open-source-ai-its-here-but-is-it</link><guid isPermaLink="false">https://www.andrewroberts.blog/p/open-source-ai-its-here-but-is-it</guid><dc:creator><![CDATA[Andrew Roberts]]></dc:creator><pubDate>Wed, 24 May 2023 20:50:00 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/9c18af91-3980-4b6c-9680-44703685c4c7_752x421.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>Not long ago, I believed that the future of large language models (LLMs) would be in the hands of large software companies. Companies like Microsoft (OpenAI), Google, and Anthropic had deep pockets and were in pole position. They could afford to pour hundreds of millions of dollars into crafting and upgrading LLMs. The idea of open source AI being a rival seemed far-fetched.</p><p>Recently, I've had a change of heart. Open source is no underdog. In fact, it&#8217;s stepping up, creating models that match and potentially surpass the capabilities of the best foundational models. How did we get here?</p><p>Trust and transparency will motivate larger end-user organisations to work with open source models. Open source solutions are often easier to self-host, and many organisations value keeping data behind a firewall or within their country&#8217;s borders. The models and training methodology will also be open and accessible to anyone and allow for scrutiny. For industries dealing with sensitive information, this control over data is paramount. This preference among large users will give open source a business model. Vendors can and will commercialise open source projects with proprietary add-ons (&#8220;open core&#8221;) and support. At Tiny, we saw the power of this approach in action and signed up over 2,000 paying customers.</p><p>As a result of this now tried-and-true business model, commercial open source AI vendors will almost certainly generate tens of millions in revenue and attract venture capital to support their growth. That&#8217;s enough for many great engineers, but surely more is needed for the compute time to compete with hyperscalers? The enormous costs associated with competing with tech giants still seemed like a deal breaker, but two factors are pushing back.</p><p>First, open source models can tap into a massive community of developers, researchers, and users. The new algorithms and approaches they invent, such as low-rank adaptation (<a href="https://arxiv.org/abs/2106.09685">LoRA</a>), can significantly reduce training costs. Stanford&#8217;s <a href="https://github.com/tatsu-lab/stanford_alpaca">Alpaca</a> model, announced in March, can be trained for less than $600 and <a href="https://crfm.stanford.edu/2023/03/13/alpaca.html">behaves</a> &#8220;qualitatively similarly to OpenAI&#8217;s text-davinci-003 while being surprisingly small and easy/cheap to reproduce.&#8221; Another promising contender, Guanaco, was announced today, and you can play <a href="https://huggingface.co/spaces/uwnlp/guanaco-playground-tgi">with it here</a>.</p><p>Second, big tech companies trailing behind Microsoft and Google, such as Meta, Amazon, Intel, IBM or Alibaba, see the strategic value in supporting open source AI. A prime example is Meta's LLaMa model, which was transformed by academia into an even more powerful solution within weeks. This resulted in the infamous &#8220;we don't have a moat, and neither does OpenAI&#8221; <a href="https://www.semianalysis.com/p/google-we-have-no-moat-and-neither">memo</a> from a Google employee.&nbsp;</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!XcKN!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4670a1e9-e249-431c-a070-d9dd6557118b_1366x588.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!XcKN!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4670a1e9-e249-431c-a070-d9dd6557118b_1366x588.png 424w, https://substackcdn.com/image/fetch/$s_!XcKN!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4670a1e9-e249-431c-a070-d9dd6557118b_1366x588.png 848w, https://substackcdn.com/image/fetch/$s_!XcKN!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4670a1e9-e249-431c-a070-d9dd6557118b_1366x588.png 1272w, https://substackcdn.com/image/fetch/$s_!XcKN!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4670a1e9-e249-431c-a070-d9dd6557118b_1366x588.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!XcKN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4670a1e9-e249-431c-a070-d9dd6557118b_1366x588.png" width="1366" height="588" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/4670a1e9-e249-431c-a070-d9dd6557118b_1366x588.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:588,&quot;width&quot;:1366,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!XcKN!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4670a1e9-e249-431c-a070-d9dd6557118b_1366x588.png 424w, https://substackcdn.com/image/fetch/$s_!XcKN!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4670a1e9-e249-431c-a070-d9dd6557118b_1366x588.png 848w, https://substackcdn.com/image/fetch/$s_!XcKN!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4670a1e9-e249-431c-a070-d9dd6557118b_1366x588.png 1272w, https://substackcdn.com/image/fetch/$s_!XcKN!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F4670a1e9-e249-431c-a070-d9dd6557118b_1366x588.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Image from the Google memo. Note that the improvement is in weeks, not months or years</figcaption></figure></div><p>So, the conclusion appears clear. Open source AI isn't just surviving, it's thriving. But is this a good thing? The dangers of AI are real and growing. Powerful models in the wrong hands can aid fraud and disinformation, as demonstrated in this <a href="https://twitter.com/60minutes/status/1660428419438354435?s=61&amp;t=ekeS-B24UOjLqn2xEP9uFQ">60 Minutes clip</a>.</p><p>It will also power the war machines of the future. Russia&#8217;s invasion of Ukraine and the growing tensions across the Taiwan Strait have set us on a new geopolitical path. Powerful open source models enable enemies as much as allies. Should we fail in AI alignment, we could inadvertently give birth to autonomous weapons with their own agendas and no regard for human life. It is easy to argue that open source will make alignment harder, not easier.</p><p>Open source has contributed to tremendous progress over the past twenty years. At Tiny, we were proud of our open source software's positive impact &#8211; from helping fight California wildfires to assisting social workers in Sweden. These stories often infused us with a sense of purpose.</p><p>My own career also owes a lot to open source software, and I want to engage more in open source AI. Yet, I grapple with this question: Are we promoting open source AI to fuel a potential catastrophe? Or is open source AI, despite its risks, a force for good in the world, preferable to a future dominated by a handful of megacorps? I&#8217;m torn, and I'd love to hear your thoughts.</p>]]></content:encoded></item><item><title><![CDATA[Do not go gentle into that code night. ]]></title><description><![CDATA[Rage, rage with AI by your side in the fight.]]></description><link>https://www.andrewroberts.blog/p/do-not-go-gentle-into-that-code-night</link><guid isPermaLink="false">https://www.andrewroberts.blog/p/do-not-go-gentle-into-that-code-night</guid><dc:creator><![CDATA[Andrew Roberts]]></dc:creator><pubDate>Wed, 17 May 2023 20:39:00 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/69072286-c10c-4b40-9d48-473589963c2e_752x421.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>The tech industry is like a perpetual high-speed chase &#8211; insanely competitive, ever-evolving, never a dull moment. We're in a constantly changing landscape that never lets you catch your breath. And let's face it, there's a lot of ageism. Remember when Mark Zuckerberg said, &#8220;Young people are just smarter&#8221;? That must be a fun memory for him now, as he's getting ready to hit the big 4-0.</p><p>Around 20 years ago, I swapped out my coder's beanie for a founder's cap. I launched a company and left behind hands-on software development. It was a wild journey where I learned so much, but my coding skills? They started gathering dust. As I step back into the developer's arena, it feels like I'm the new kid on the tech block.</p><p>Recently, I <a href="https://www.linkedin.com/feed/update/urn:li:activity:7037552753623130112/">sold my company</a> and felt this urge to dive back into product building. I missed the thrill of creating something new from scratch. Plus, AI's current wave has me utterly fascinated. Having had a few encounters with AI throughout my career, this time, it feels different. The shift reminds me of witnessing the birth of the internet. It's kindled tech enthusiasm in me all over again.</p><p>How is my coding comeback going? It's been really fun, all thanks to AI. Using tools like ChatGPT, GitHub CoPilot, and Replit&#8217;s Ghostwriter feels like having a magic wand. It helps me dust off my knowledge of server architecture, Unix commands, JavaScript, and my 90s-era computer science degree, turning it into something practical. Now I can ask questions, fix errors, and even enhance my code, all with a little help from my AI friends.</p><p>Some folks are worried that AI will replace programmers. I find that scenario unlikely, but I do see a massive productivity boost on the horizon. AI enables engineers to stay sharp longer and shift between technologies much more quickly.</p><p>Without AI, plunging back into product creation could've felt like navigating a dense jungle without a guide. Now, it's like taking a guided tour with a wise companion.</p><p>--</p><p>And, for some fun, I've included an adaptation of Dylan Thomas's famous poem written, of course, by ChatGPT.</p><div class="preformatted-block" data-component-name="PreformattedTextBlockToDOM"><label class="hide-text" contenteditable="false">Text within this block will maintain its original spacing when published</label><pre class="text">Do not go gentle into that code night,
Software crafters nearing twilight's sight,
Burn and debug at the end of day;
Rage, rage against the obsolescence of the byte.

Though wise men at their terminals concede time's flight,
With AI assistance, their craft ignites,
They do not go gentle into that code night.

Good coders, the last wave by, sensing the bright
Their frail code might have danced with AI's light,
Rage, rage against the obsolescence of the byte.

Wild coders who caught and sang the sun in flight,
With machine learning, regained their height,
Do not go gentle into that code night.

Grave coders, near dusk, who see with blinding sight
Old algorithms could blaze like meteors with AI's might,
Rage, rage against the obsolescence of the byte.

And you, my coder, there on the sad height,
Curse, bless, me now with your fierce tears, I pray.
Do not go gentle into that code night.
Rage, rage with the AI by your side in the fight.</pre></div>]]></content:encoded></item><item><title><![CDATA[The easy way to integrate data with generative AI]]></title><description><![CDATA[You&#8217;ve probably heard that data will be super important in our brave new world of generative AI.]]></description><link>https://www.andrewroberts.blog/p/the-easy-way-to-integrate-data-with-ai</link><guid isPermaLink="false">https://www.andrewroberts.blog/p/the-easy-way-to-integrate-data-with-ai</guid><dc:creator><![CDATA[Andrew Roberts]]></dc:creator><pubDate>Tue, 16 May 2023 19:45:00 GMT</pubDate><enclosure url="https://substack-post-media.s3.amazonaws.com/public/images/de2b4c70-83bf-4770-8fe1-12e7ff3a1a1b_752x421.png" length="0" type="image/jpeg"/><content:encoded><![CDATA[<p>You&#8217;ve probably heard that data will be super important in our brave new world of generative AI. As an app developer, you&#8217;ve probably heard that your existing reservoir of customer data means you are in a great position to use AI. But what does that mean? How do you actually build it?</p><p>Until recently, I had thought &#8220;fine-tuning&#8221; a large language model (LLM) was necessary. I understand fine-tuning at a high level, and it is complicated to implement and keep up to date. A new category of &#8220;LLM Ops&#8221; tools is emerging to make this easier, but it still feels like a heavy lift.</p><p>Since <a href="https://www.linkedin.com/feed/update/urn:li:activity:7037552753623130112/">handing over the reins</a> at Tiny, I&#8217;ve dusted off my rusty developer skills and tried to get up-to-speed on AI. I want to share a recent &#8220;aha&#8221; on using prompt engineering and <a href="https://twitter.com/gdb/status/1271311766639611906">context stuffing</a> of out-of-the-box LLMs such as ChatGPT to achieve extraordinary integration with your data. I believe this technique is also called &#8220;retrieval-augmented generation&#8221;, or RAG. You have to love tech industry acronyms!</p><p>When you hear &#8220;prompt engineering&#8221;, you probably think of the one-or-two sentences you type into ChatGPT to get a poem about cats in the style of Eminem. Indeed, there is helpful guidance to an LLM that you can give in just a few words, but you can give them a lot more context, significantly more! Anthropic recently announced a 100,000 token model, and the head of OpenAI, Sam Altman, has floated the idea of a 1 million token model in the future. (For the uninitiated, a token can be considered a word.)</p><p>By putting your software between the user and the LLM, you can retrieve data and assemble a big prompt. Or even a colossal prompt. While this might seem obvious, the penny didn&#8217;t drop for me until I listened to this <a href="https://www.youtube.com/watch?v=tW2EA4aZ_YQ">video</a> from Microsoft on how to use ChatGPT with your enterprise data. In their related GitHub <a href="https://github.com/Azure-Samples/azure-search-openai-demo/">repo</a>, they have a helpful diagram that I&#8217;ve adapted here (with added inspiration <a href="https://aws.amazon.com/blogs/machine-learning/question-answering-using-retrieval-augmented-generation-with-foundation-models-in-amazon-sagemaker-jumpstart/">from</a> AWS):</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!QYjn!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75ceead-7a6c-48bd-a3b0-2e8f5791b0d1_1488x959.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!QYjn!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75ceead-7a6c-48bd-a3b0-2e8f5791b0d1_1488x959.png 424w, https://substackcdn.com/image/fetch/$s_!QYjn!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75ceead-7a6c-48bd-a3b0-2e8f5791b0d1_1488x959.png 848w, https://substackcdn.com/image/fetch/$s_!QYjn!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75ceead-7a6c-48bd-a3b0-2e8f5791b0d1_1488x959.png 1272w, https://substackcdn.com/image/fetch/$s_!QYjn!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75ceead-7a6c-48bd-a3b0-2e8f5791b0d1_1488x959.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!QYjn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75ceead-7a6c-48bd-a3b0-2e8f5791b0d1_1488x959.png" width="1456" height="938" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/f75ceead-7a6c-48bd-a3b0-2e8f5791b0d1_1488x959.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:938,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:false,&quot;topImage&quot;:true,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!QYjn!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75ceead-7a6c-48bd-a3b0-2e8f5791b0d1_1488x959.png 424w, https://substackcdn.com/image/fetch/$s_!QYjn!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75ceead-7a6c-48bd-a3b0-2e8f5791b0d1_1488x959.png 848w, https://substackcdn.com/image/fetch/$s_!QYjn!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75ceead-7a6c-48bd-a3b0-2e8f5791b0d1_1488x959.png 1272w, https://substackcdn.com/image/fetch/$s_!QYjn!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2Ff75ceead-7a6c-48bd-a3b0-2e8f5791b0d1_1488x959.png 1456w" sizes="100vw" fetchpriority="high"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">The basic flow of retrieving search results to add to a prompt you send to an LLM.</figcaption></figure></div><p>The example used is an internal HR application that lets users ask questions such as &#8220;Do my benefits cover this expense?&#8221; The app server searches an internal knowledge system to find PDFs that describe the benefits. It then uses the snippets of relevant knowledge from these PDFs and a specially worded prompt to get the large language model to produce a custom answer specific to the organisation.</p><p>The beauty of this approach is that your app can respect user permissions. You integrate only the data your user has permission to see into the prompt you send to the language model.</p><p>Once you have your proprietary data, you can build the prompt with additional system instructions. In Microsoft&#8217;s example, <a href="https://github.com/Azure-Samples/azure-search-openai-demo/blob/ffc416ca2d5f7cc4d668e78d8dbf4972dee5fd8f/notebooks/read-decompose-ask.ipynb?short_path=6c3c642#L169">they</a> lock down the AI with the following instructions added to the prompt:&nbsp;</p><pre><code>Answer questions as shown in the following examples, by 
splitting the question into individual search or lookup actions 
to find facts until you can answer the question. Observations 
are prefixed by their source name in square brackets, source 
names MUST be included with the actions in the answers. All 
questions must be answered from the results from search or 
look up actions, only facts resulting from those can be used 
in an answer. Answer questions as truthfully as possible, and 
ONLY answer the questions using the information from 
observations, do not speculate or your own knowledge.</code></pre><p>With these instructions, the AI will only use facts you have provided and not hallucinate new information for your users. You can see how Microsoft implemented this idea with Bing Chat.</p><div class="captioned-image-container"><figure><a class="image-link image2 is-viewable-img" target="_blank" href="https://substackcdn.com/image/fetch/$s_!MXdv!,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f1effe-6e65-43a2-b419-84f452305c28_1654x832.png" data-component-name="Image2ToDOM"><div class="image2-inset"><picture><source type="image/webp" srcset="https://substackcdn.com/image/fetch/$s_!MXdv!,w_424,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f1effe-6e65-43a2-b419-84f452305c28_1654x832.png 424w, https://substackcdn.com/image/fetch/$s_!MXdv!,w_848,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f1effe-6e65-43a2-b419-84f452305c28_1654x832.png 848w, https://substackcdn.com/image/fetch/$s_!MXdv!,w_1272,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f1effe-6e65-43a2-b419-84f452305c28_1654x832.png 1272w, https://substackcdn.com/image/fetch/$s_!MXdv!,w_1456,c_limit,f_webp,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f1effe-6e65-43a2-b419-84f452305c28_1654x832.png 1456w" sizes="100vw"><img src="https://substackcdn.com/image/fetch/$s_!MXdv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f1effe-6e65-43a2-b419-84f452305c28_1654x832.png" width="1456" height="732" data-attrs="{&quot;src&quot;:&quot;https://substack-post-media.s3.amazonaws.com/public/images/21f1effe-6e65-43a2-b419-84f452305c28_1654x832.png&quot;,&quot;srcNoWatermark&quot;:null,&quot;fullscreen&quot;:null,&quot;imageSize&quot;:null,&quot;height&quot;:732,&quot;width&quot;:1456,&quot;resizeWidth&quot;:null,&quot;bytes&quot;:null,&quot;alt&quot;:&quot;No alt text provided for this image&quot;,&quot;title&quot;:null,&quot;type&quot;:null,&quot;href&quot;:null,&quot;belowTheFold&quot;:true,&quot;topImage&quot;:false,&quot;internalRedirect&quot;:null,&quot;isProcessing&quot;:false,&quot;align&quot;:null,&quot;offset&quot;:false}" class="sizing-normal" alt="No alt text provided for this image" title="No alt text provided for this image" srcset="https://substackcdn.com/image/fetch/$s_!MXdv!,w_424,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f1effe-6e65-43a2-b419-84f452305c28_1654x832.png 424w, https://substackcdn.com/image/fetch/$s_!MXdv!,w_848,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f1effe-6e65-43a2-b419-84f452305c28_1654x832.png 848w, https://substackcdn.com/image/fetch/$s_!MXdv!,w_1272,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f1effe-6e65-43a2-b419-84f452305c28_1654x832.png 1272w, https://substackcdn.com/image/fetch/$s_!MXdv!,w_1456,c_limit,f_auto,q_auto:good,fl_progressive:steep/https%3A%2F%2Fsubstack-post-media.s3.amazonaws.com%2Fpublic%2Fimages%2F21f1effe-6e65-43a2-b419-84f452305c28_1654x832.png 1456w" sizes="100vw" loading="lazy"></picture><div class="image-link-expand"><div class="pencraft pc-display-flex pc-gap-8 pc-reset"><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container restack-image"><svg role="img" width="20" height="20" viewBox="0 0 20 20" fill="none" stroke-width="1.5" stroke="var(--color-fg-primary)" stroke-linecap="round" stroke-linejoin="round" xmlns="http://www.w3.org/2000/svg"><g><title></title><path d="M2.53001 7.81595C3.49179 4.73911 6.43281 2.5 9.91173 2.5C13.1684 2.5 15.9537 4.46214 17.0852 7.23684L17.6179 8.67647M17.6179 8.67647L18.5002 4.26471M17.6179 8.67647L13.6473 6.91176M17.4995 12.1841C16.5378 15.2609 13.5967 17.5 10.1178 17.5C6.86118 17.5 4.07589 15.5379 2.94432 12.7632L2.41165 11.3235M2.41165 11.3235L1.5293 15.7353M2.41165 11.3235L6.38224 13.0882"></path></g></svg></button><button tabindex="0" type="button" class="pencraft pc-reset pencraft icon-container view-image"><svg xmlns="http://www.w3.org/2000/svg" width="20" height="20" viewBox="0 0 24 24" fill="none" stroke="currentColor" stroke-width="2" stroke-linecap="round" stroke-linejoin="round" class="lucide lucide-maximize2 lucide-maximize-2"><polyline points="15 3 21 3 21 9"></polyline><polyline points="9 21 3 21 3 15"></polyline><line x1="21" x2="14" y1="3" y2="10"></line><line x1="3" x2="10" y1="21" y2="14"></line></svg></button></div></div></div></a><figcaption class="image-caption">Bing uses retrieval augmented generation to ground its answers in search results.</figcaption></figure></div><p>Perhaps this is old news for you but, for me, hearing about this approach was an &#8220;aha&#8221; that existing data that be used to build a differentiated generative AI experience without training or fine-tuning or a custom model. OpenAI is yet to offer a way to fine-tune GPT-4, so they see the value in how much customisation can be achieved through context stuffing and similar techniques.</p><p>If this is still mysterious and my explanations confuse you, I recommend the &#8220;<a href="https://www.youtube.com/watch?v=tW2EA4aZ_YQ">Can ChatGPT work with your enterprise data</a>?&#8221; YouTube mentioned above.</p>]]></content:encoded></item></channel></rss>