Practical AI in SQL Server 2025: A Vector Demo Database For You

Today, I have the honor and pleasure of debuting a new presentation for MSSQLTips: A Practical Introduction to Vector Search in SQL Server 2025 (you can watch the recording here too). To accompany that new presentation, I opted to create a new demo database instead of retrofitting one of my existing demo databases. And I’m sharing it with you so you don’t have to go through the headache of taking an existing database and creating vector embeddings.

RecipesDemoDB

Background about the Database

This new database was built with SQL Server 2025 CTP 2.1, and backed up using ZSTD-high compression, weighs in around 16GB striped across 8 backup files.

The dbo.recipes table contains just under 500k recipes, and weighs in at about 2GB. This data was sourced from kaggle and is a dump of recipes from food.com.

Next, there’s other tables under the vectors schema, that contain vector embeddings. The naming scheme is such that those tables correspond to the same named column in dbo.recipes. ex: dbo.recipes.description -> vectors.recipes_description. There is one table that is called recipes_other_cols, which is a JSON concatenation of some of the shorter columns from dbo.recipes – name, servings, and serving_size. Each of the vectors.* tables also have a vector index. All of the vector data is about 22 or 23GB, bringing the total database to about 24-25GB in full.

And finally, there’s a few example stored procedures with KNN and ANN code examples. I would also suggest checking my Practical Intro to Vector Search repo which has some other demo code.

You’ll still need to have Ollama setup and make a few changes to match your own environment. Make sure you use the same embedding model that I did (nomic-embed-text) so any vector embeddings you subsequently create match.

And finally, there is also a sub-folder on the demo-dbs repo that has all of the different “steps” that I took to create the various tables and generate the vector embeddings.

Why Should I Use this Database? Creating Vector Embeddings

I am running a Lenovo P14S with a Nvidia GeForce 3080 GPU connected via TBT3 to an external GPU housing. For the ~500k recipes, and 5 or 6 embedding tables, the entire process took an entire weekend. I don’t have an exact time, because I’d kick off one table to process, then come back later/the next day, validate the data, then run the next one. So yeah, it took a while, hence why I thought I’d share this database to save time for others.

Wrapping Up

If you decide to start using this demo database for your own learning and testing of vector search, I’d love to hear about it. And if you write any interesting demo code that you’d be willing to share, that’d also be amazing as well! As always, please let me know if you run into any quirks or have any feedback.

Happy learning – thanks for reading!

Announcing Data Saturday Chicago 2026!!

On behalf of the team that brought you SQL Saturday Chicago, I am ecstatic to announce that we are returning as Data Saturday Chicago on March 14, 2026 at Harper College.


Image

Sign up to receive E-mail Updates


How We Got Here

Chicago has a rich history of SQL Saturday events – NINE events since 2010!!!

  1. SQLSaturday #31April 17, 2010
  2. SQLSaturday #67March 26, 2011
  3. SQLSaturday #119May 19, 2012
  4. SQLSaturday #211April 13, 2013
  5. SQLSaturday #291April 26, 2014
  6. SQLSaturday #484March 5, 2016
  7. SQLSaturday #600March 11, 2017
  8. SQLSaturday #719March 17, 2018
  9. SQLSaturday #825 March 23, 2019

And we were ready to celebrate our 10th SQL Saturday event, SQL Saturday #945, on March 21st of 2020. But well, we all know how that went to straight to hell.

My Personal Journey

Writing this blog post took me down memory lane. I first attended SQL Saturday Chicago in 2012, #119. For #211 in 2013, I opted to volunteer and get involved. As chance would have it, I went from helping stuff attendee bags on Friday night, to being in charge of room monitors all day Saturday! And that started my journey as an official co-organizer, from 2014 onward.

After we got sunk in 2020, I moved to Boston in 2021. As such, I stepped down as co-leader of the Chicago Suburban SQL Server User Group and stepped away from SQL Saturday Chicago. The remaining team found it challenging to reorganize and start again. Frankly speaking, we all got our asses kicked by 2020. The landscape for in-person events is a shade of its former self, and rebooting an event is a hell of a lot more complicated than you might think.

Why Am I Doing This Again (Remotely)?

The answer is simple: I love our data community and the people.

By the community… for the community.

This community has given so much to me and I am more than happy to keep giving back. And I believe that I speak for the Data Saturday Chicago team when I say that this is one of our core values. Speaking of whom…

Introducing the 2026 Organizing Team

I am honored to be a part of an amazing team of volunteers, working together to put this event together for you (in alphabetical order):

  • Andy Yun
  • Bill Lescher
  • Bob Pusateri
  • Brandon Leach
  • Dave Bland
  • Frank Gill
  • Jared Karney
  • Lowry Kozlowski
  • Rod Carlson
  • Ross Reed
  • Wendy Pastrick – who has literally been organizing SQL Sat Chicago since the very first!

Why Rebrand as Data Saturday?

Since its inception, SQL Saturday was all about SQL Server. But back then, SQL Server was a much smaller product. It was possible to learn the entirety of what SQL Server had to offer (who remembers Microsoft Certified Masters – MCMs?).

But today, like the sizes of our databases (and my waistline), SQL Server has grown and evolved. We decided, collectively as a team, to also evolve and rebrand under Data Saturdays. The Microsoft Data Platform will continue to be our primary focus, but nowadays that encompasses so much more than simply SQL Server.

What to Expect Next

We’ve accomplished the hardest task, which is to lock down a suitable venue and date. Next, we will be focusing on our website, getting a registration platform setup, and opening a Call for Speakers. Please be patient with us as we are still a team of volunteers working this in our spare time.

I hope you are as excited as we are. See you in March 2026!

P.S. Sign up to receive e-mail updates!

T-SQL Tuesday #188 – Growing the Next Generation

T-SQL Tuesday Logo

Welcome back to another edition of T-SQL Tuesday. This month’s topic is an inspirational one, hosted by John Sterrett. John would like us to blog about what we can do to help grow the data community.

Growing new speakers is a topic that I have done a lot with already, so I won’t beat that drum. Instead, what I will suggest is something that anyone can do:

TL;DR – Recognize, Ask, & Offer

Step 1: Recognize Talent and Potential in Others

If you’re reading this, you’re part of the data community. You may not speak or blog regularly, but nonetheless, you are a participant. As such, you can watch for and recognize others. Maybe you’re reading an interesting thread on the SQL Server sub-Reddit… maybe you’re watching a conversation on the SQL Slack channel… Look for statements by individuals that pique your interest.

Step 2: Ask or Invite Them to Contribute

You’ve noticed someone talking about something really interesting? Ask them to share more of what they know about that topic. Invite them to write something more in-depth about it. Being asked and invited to share more, is validation for that other individual. And I believe that each one of us should always be looking for opportunities to validate others and raise each other up.

Step 3: Offer to Help

Anyone can help anyone else.

Stop – re-read that first statement again.

You can help.

There’s a mental trap that many of us fall into (I used to be one of those), where we believe that we are not in a position to help, because we do not believe we are an expert. You do not have to be an expert to help. You can help, simply by encouraging someone get their ideas out there and become more involved in the data community. Words of encouragement… or saying thank you for something you found useful… these are all ways to help.

One More Twist

This blog has been written, from the angle of asking you, the reader, to recognize potential in someone else, encouraging you to ask them to contribute, and offering your own help to further grow them.

Are YOU, whoever is reading this right now, the one who has that potential?

Look in the mirror… the answer is yes. You can be a part of the next generation… and there are many out there who will help you. Don’t know where to find someone? Ask… ask me… you’re reading my blog right now… contact me.