AI Learning YouTube News & VideosMachineBrain

Mastering Deep Seek: Hacks for Agent Integration with Pantic AI

Mastering Deep Seek: Hacks for Agent Integration with Pantic AI
Image copyright Youtube
Authors
    Published on
    Published on

In this episode, the team delves into the intricate world of Deep seek, a powerful model designed for structured responses. They confront the challenges posed by the model's lack of support for function calling and JSON output, crucial components in the realm of agent-building. Through ingenious hacks, they showcase how to maneuver around these obstacles and seamlessly integrate Deep seek into agents using the versatile Pantic AI platform. The team sheds light on the similar hurdles faced by the Gemini 2.0 thinking models, hinting at a shared journey towards enhanced functionality.

Venturing deeper into the intricacies of structured responses, the team unveils Deep seek's own insights on the matter, emphasizing the significance of prompt engineering and API configuration. By demonstrating a practical method to leverage Pantic AI with Deep seek, they provide a roadmap for obtaining structured outputs efficiently. By setting up the Deep seek API within the Pantic AI framework, they demonstrate the flexibility of switching between models to tailor responses to specific tasks, showcasing the adaptability and power of these cutting-edge technologies.

The team's hands-on approach involves utilizing the Deep seek chat model for a search agent, while grappling with the limitations of the Deep seek R1 reasoning model in handling function calling. To overcome this hurdle, they ingeniously employ a simpler model for formatting structured outputs, ensuring a smooth flow of information. Emphasizing the importance of capturing both content and reasoning content from Deep seek R1's responses, they delve into the intricacies of the model's output structure, highlighting the need for a comprehensive understanding of the reasoning chain of thought.

In a captivating twist, the team navigates through the nuances of multi-round conversations, underlining the strategic storage and utilization of the Chain of Thought for optimal results. By showcasing a method to extract both content and reasoning content from Deep seek R1's responses using a standard OpenAI call, they demonstrate a blend of innovation and practicality in harnessing the full potential of this groundbreaking technology.

mastering-deep-seek-hacks-for-agent-integration-with-pantic-ai

Image copyright Youtube

mastering-deep-seek-hacks-for-agent-integration-with-pantic-ai

Image copyright Youtube

mastering-deep-seek-hacks-for-agent-integration-with-pantic-ai

Image copyright Youtube

mastering-deep-seek-hacks-for-agent-integration-with-pantic-ai

Image copyright Youtube

Watch DeepSeek R1 for Structured Agents on Youtube

Viewer Reactions for DeepSeek R1 for Structured Agents

Using a reasoning model as a tool for various processes

Combining with Gemini Flash for cleaning output

Concerns about effort and potential obsolescence of techniques

Mention of potential new models like o3 and Opus

Converting JSON output to XML

Building an MCP server with reasoning tools

Appreciation for providing Hindi track

Mention of trying Kimi 1.5

Using models for cybersecurity and penetration testing

Comparison between DeepSeek and OpenAI search

quens-qwq-32b-model-local-reasoning-powerhouse-outshines-deep-seek-r1
Sam Witteveen

Quen's qwq 32b Model: Local Reasoning Powerhouse Outshines Deep seek R1

Quen introduces the powerful qwq 32b local reasoning model, outperforming the Deep seek R1 in benchmarks. Available on Hugging Face for testing, this model offers top-tier performance and accessibility for users interested in cutting-edge reasoning models.

microsofts-f4-and-54-models-revolutionizing-ai-with-multimodal-capabilities
Sam Witteveen

Microsoft's F4 and 54 Models: Revolutionizing AI with Multimodal Capabilities

Microsoft's latest F4 and 54 models offer groundbreaking features like function calling and multimodal capabilities. With billions of parameters, these models excel in tasks like OCR and translation, setting a new standard in AI technology.

unveiling-openais-gpt-4-5-underwhelming-performance-and-high-costs
Sam Witteveen

Unveiling OpenAI's GPT 4.5: Underwhelming Performance and High Costs

Sam Witteveen critiques OpenAI's GPT 4.5 model, highlighting its underwhelming performance, high cost, and lack of innovation compared to previous versions and industry benchmarks.

unleashing-ln-ais-m-ocr-revolutionizing-pdf-data-extraction
Sam Witteveen

Unleashing Ln AI's M OCR: Revolutionizing PDF Data Extraction

Discover Ln AI's groundbreaking M OCR model, fine-tuned for high-quality data extraction from PDFs. Unleash its power for seamless text conversion, including handwriting and equations. Experience the future of OCR technology with Ln AI's transparent and efficient solution.