Victoria Drake on victoria.dev

Why the Best Engineers Will Thrive Alongside AI

2025-07-03T04:04:18-05:00

Every time I see another “AI will replace programmers” headline, I think about the best engineers I’ve worked with. They’re not the ones who write the most code or know the most algorithms. They’re the ones who see problems clearly, design elegant solutions, and build systems that last. AI won’t replace these people. It will make them unstoppable.

The engineers who thrive in an AI-augmented world won’t be fighting against the technology or ignoring it. They’ll understand how to amplify their strengths through intelligent collaboration with AI systems. Instead of asking “Will AI take my job?” they’re asking “How can AI make me 10x more effective at the work that matters most?”

What’s fascinating is that the skills that make you great at working with AI are remarkably similar to the skills that make you great at working with other engineers. Clear communication, structured thinking, and productive division of labor are fundamentals that remain constant whether you’re pair programming with a colleague or collaborating with an AI model.

Here’s what that collaboration looks like in practice, and how to position yourself to lead in an AI-first world.

AI Amplifies Systems Thinking Through Better Collaboration

The biggest opportunity comes from using AI to think through complex systems more thoroughly. AI excels at analyzing patterns, suggesting edge cases, and helping you reason through architectural decisions. Engineers who learn to collaborate effectively with AI on design and planning create better systems than either could build alone.

This mirrors how the best engineering teams work together. When you’re designing a system with a colleague, you externalize your thinking, challenge each other’s assumptions, and explore alternatives. Working with AI requires the same discipline. You articulate problems clearly, make your constraints explicit, and iterate on solutions collaboratively.

The practical skill here involves having productive conversations with AI about system design in the same way you would with a colleague.

Start by clearly defining the problem space: What are the constraints? What are the non-obvious requirements? What could go wrong? AI can help you explore these questions more comprehensively before you commit to solutions.

This resembles how senior engineers mentor junior team members—by asking good questions and helping them think through problems systematically. The difference is that AI can process vast amounts of information quickly and suggest patterns you might not have considered.

Start practicing this now by using AI to review your design documents, challenge your assumptions, and suggest alternatives. The goal is ensuring you consider angles you might have missed, just like getting a thorough code review from a thoughtful colleague.

Human Skills Become Your Competitive Advantage

As AI handles more routine implementation work, the uniquely human aspects of engineering become increasingly important. Understanding and communicating business context, navigating organizational complexity, and making judgment calls under uncertainty—these skills differentiate great engineers from good ones.

Product intuition becomes especially critical. AI can generate code, but it can’t determine whether you’re building the right thing for solving your customers’ problems. Engineers who understand user needs, translate business requirements into technical solutions, and make trade-offs based on strategic priorities remain indispensable.

These are the same skills that make you valuable on any engineering team. The ability to see the bigger picture, understand stakeholder needs, and make technical decisions that serve business objectives has always been what separates senior engineers from code writers.

The ability to work across disciplines becomes more valuable as well. The best AI implementations often require understanding domain expertise, user experience implications, and business impact. Engineers who can bridge these contexts design better AI integrations, just as they design better systems when working with product managers, designers, and other stakeholders.

Communication skills get amplified too. Evaluating options, explaining trade-offs, and building consensus around technical decisions becomes crucial when AI can generate multiple potential solutions quickly. You’re curating and contextualizing solutions rather than just implementing them—much like how lead engineers guide technical discussions and help teams make good decisions collectively.

Building AI-Native Systems From the Ground Up

The most significant opportunities lie in designing systems built around AI capabilities from the beginning, rather than retrofitting AI into existing architectures. This requires thinking differently about how software systems work and collaborating effectively during the design process.

AI-native systems often need different patterns for data flow, error handling, and user interaction. They might handle probabilistic outcomes rather than deterministic ones, incorporate continuous learning loops, and provide transparency into decision-making processes. Engineers who understand these patterns early will have a significant advantage.

This resembles the transition any engineering team makes when adopting new paradigms. The teams that succeed are those that collaborate well during the learning process, share knowledge effectively, and iterate toward better patterns together.

Working with AI also means getting comfortable with a different development workflow. Instead of writing every function from scratch, you might orchestrate AI services, design feedback loops for model improvement, and build systems that get smarter about your application over time. The engineering challenge shifts from pure implementation toward integration and optimization.

Starting small with AI integrations in your current projects is a practical approach for seeing how AI systems can help. Add intelligent features to existing applications. Experiment with AI APIs and services. Build systems that can incorporate AI capabilities without requiring complete rewrites. Each project teaches you more about AI-native patterns, similar to how you’d gradually adopt any new technology stack.

Developing AI Collaboration Skills

Learning to work effectively with AI as a thinking partner goes beyond using AI tools. You’re developing a collaborative workflow where AI augments your problem-solving process rather than just automating tasks.

This means getting good at prompt engineering, but more importantly, learning to structure problems and code in ways that AI can help with effectively. Some problems benefit from AI’s pattern recognition capabilities. Others need AI’s ability to generate and evaluate multiple approaches quickly. Understanding when and how to use these capabilities makes you more effective.

Good engineers know when to ask colleagues for help, how to frame problems clearly, and which team members bring the right expertise to different challenges. Working with AI requires similar social and communication skills.

It’s also critical to develop good judgment about AI outputs. AI can generate impressive solutions that miss important constraints or edge cases. Engineers who can quickly evaluate AI suggestions, identify potential issues, and iterate toward better solutions will consistently outperform those who either avoid AI entirely or accept its outputs uncritically.

This mirrors how you’d work with any collaborator—trusting their expertise while applying your own judgment, asking clarifying questions, and building on their contributions with your own insights and context.

Positioning for Long-Term Success

Engineers who thrive long-term will view AI as a force multiplier for their existing strengths rather than a replacement for their role. If you’re great at system design, AI can help you explore more architectural options. If you excel at debugging, AI can help you identify patterns across larger codebases. If you’re skilled at optimization, AI can help you analyze performance bottlenecks more comprehensively.

The strategic approach involves doubling down on your strengths while developing AI collaboration skills that amplify them. You don’t need to become an AI researcher unless that’s your passion. Instead, become expert at applying AI to the problems you already enjoy solving.

This also means staying close to the business impact of your work. Engineers who understand how their technical decisions affect user experience, business metrics, and organizational goals will always be valuable, regardless of how AI capabilities evolve. The technology might change, but the need for good judgment about what to build and how to build it remains constant.

The Compound Advantage of Early Adoption

Engineers who start developing AI collaboration skills now will have years of experience when these capabilities become standard across the industry. This includes technical knowledge and intuition for when AI helps and when it doesn’t, understanding failure modes, and building robust workflows around AI capabilities.

Start with AI tools that augment your current workflow—code completion, documentation generation, test writing. Gradually expand to more complex collaborations like architectural design, system optimization, and problem analysis. Each interaction teaches you more about effective AI collaboration.

Just like learning to work well with any new team member, the key is consistent practice and honest feedback. Try different approaches, see what works, and gradually build more sophisticated collaborative patterns.

The goal is becoming highly effective at leveraging AI rather than becoming dependent on it. Engineers with this skill set will consistently deliver better results faster than those working without AI augmentation. As AI capabilities improve, this advantage compounds.

The future belongs to engineers who see AI as an opportunity to tackle harder problems, build better systems, and have greater impact. Instead of competing with AI, they’re collaborating with it to push the boundaries of what’s possible in software engineering.

The best engineers have always been force multipliers—they make everyone around them more effective. AI gives these engineers a new kind of leverage. Instead of just amplifying the capabilities of their teams, they can amplify their own problem-solving abilities and tackle challenges that were previously beyond reach.

The practices that make you great at working with AI—clear communication, structured thinking, productive collaboration, and sound judgment—are the same practices that make you great at working with people. Master these fundamentals, and you’ll thrive regardless of how the technology landscape evolves.

From Problem Solver to Problem Solver Creator

2025-06-24T13:06:44+00:00

The question that changed everything for me was simple: “What if instead of being the person who solves problems, I became the person who creates problem solvers?” It sounds obvious in retrospect, but the shift from solving to enabling requires completely rewiring how you think about getting work done.

For years, my value came from being able to debug the trickiest issues, architect complex systems, and untangle the technical problems that had everyone else stuck. I was fast, thorough, and reliable. But in a leadership role, continuing to be the primary problem solver wasn’t scaling—it was becoming a bottleneck.

I realized that every problem I solved myself, though satisfying, was a missed opportunity to develop someone else’s problem-solving capabilities. Instead of asking “How can I fix this?” I started asking “Who could learn the most from figuring this out, and how can I set them up for success?”

This mindset shift transforms everything about how you approach engineering leadership. Instead of optimizing for immediate solutions, you optimize for building a team that can tackle increasingly complex challenges independently. Here’s what that looks like in practice.

Teaching Through Ownership, Not Tasks

The difference between assigning tasks and developing problem solvers is the difference between “implement this API endpoint” and “figure out how we should handle user authentication for this new feature.” One teaches someone to follow specifications; the other teaches them to think through trade-offs and business impact, research solutions, and make technical decisions.

Strategic delegation becomes about identifying problems that are slightly beyond someone’s current comfort zone—complex enough to require real thinking, but not so complex that they’ll get stuck without making progress. When we needed to optimize our database performance, instead of diving in myself, I paired our most eager junior backend engineer with our database expert and said, “Figure out why our queries are getting slower and what we should do about it.” (They did an excellent job and both learned new things in the process.)

The key is providing enough context for good decision-making while resisting the urge to prescribe the solution.

This means sharing the business constraints, the technical requirements, and the success criteria, then stepping back and letting them work through the problem-solving process. When they hit roadblocks, you guide them toward resources and approaches rather than answers.

What you’re really doing is teaching people to ask the right questions: What are we optimizing for? What are the constraints? What could go wrong? How will we know if it’s working? These thinking patterns transfer to every future problem they encounter.

Building Problem-Solving Muscle Through Learning

The best problem solvers aren’t necessarily the ones who know the most—they’re the ones who are best at learning what they need to know. Creating a culture where continuous learning is expected and supported turns every project into an opportunity to develop new problem-solving capabilities.

This means structuring work so that people regularly encounter unfamiliar challenges with appropriate support systems. When someone expresses curiosity about machine learning, performance optimization, or distributed systems, find ways to connect that interest to real problems your team needs to solve. The developer who wants to understand ML can take point on improving your recommendation algorithm. The engineer curious about performance can lead the investigation into why your app feels sluggish.

Internal knowledge sharing amplifies this effect. Regular deep-dive sessions where team members present problems they’ve solved create a library of problem-solving approaches that everyone can learn from. But more importantly, the act of sharing forces people to articulate their thinking process, which helps them develop more systematic approaches to future problems.

The compound effect is remarkable. Teams that prioritize learning consistently punch above their weight because they’re better at recognizing patterns, adapting to new situations, and breaking down complex problems into manageable pieces.

Communication That Enables Independent Thinking

The goal of communication in leadership isn’t just clarity—it’s creating the conditions where people can make good decisions without constantly checking in with you. This means providing not just what was decided, but the reasoning behind decisions, the factors that were considered, and the principles that guide similar situations.

When you share context richly, you’re teaching people to think through problems the way you would, but with their own insights and perspectives.

Instead of saying “use Redis for caching,” explain why caching is needed, what alternatives were considered, what trade-offs matter, and how to evaluate whether it’s working. Now when similar performance problems arise, they have a framework for thinking through solutions.

One-on-ones become especially valuable for developing problem-solving skills. These conversations are where you can understand how someone approaches challenges, what assumptions they’re making, and where their thinking might benefit from different perspectives. Often, the most helpful thing you can do is ask questions that help them think through problems more systematically.

The ultimate goal is asynchronous problem-solving—people having enough context and judgment to tackle new challenges without waiting for direction. When that happens, your team’s problem-solving capacity isn’t limited by your bandwidth.

Identifying and Developing Natural Problem-Solving Styles

Every engineer has a natural approach to problem-solving, but not everyone has had the opportunity to develop and refine that approach. Part of creating problem solvers is recognizing these natural inclinations and providing opportunities to strengthen them.

Some people are naturally systematic—they break down complex problems into smaller pieces and work through them methodically. Others are more intuitive—they see patterns and connections that aren’t immediately obvious. Some are great at asking the right questions to clarify requirements. Others excel at considering edge cases and potential failures.

The key is matching people with problems that play to their strengths while gradually expanding their toolkit. Let the systematic thinker lead the database migration planning. Give the pattern-recognizer the tricky debugging challenge. Ask the question-asker to work with product managers on requirement gathering.

But also create opportunities for people to develop complementary skills. Pair the intuitive problem solver with someone more methodical. Have the detail-oriented engineer work on a project that requires big-picture thinking. These collaborations teach people new approaches while solving real problems.

Leadership development happens naturally when people get comfortable with their own problem-solving style and learn to facilitate problem-solving in others.

Removing Obstacles to Problem-Solving Growth

The biggest barriers to developing problem solvers are often systemic rather than individual. People can’t develop good judgment if they don’t have access to the information they need to make decisions. They can’t learn from mistakes if the environment punishes experimentation. They can’t tackle complex problems if they’re constantly interrupted by urgent but low-value work.

Your role becomes creating the conditions where problem-solving skills can develop naturally.

This often means advocating upward for better tools, more reasonable deadlines, or clearer priorities. It means protecting your team’s focus time and ensuring they have access to the resources they need to dive deep into problems.

Sometimes it’s about facilitating conversations between teams so your engineers can get the context they need to make good technical decisions. Sometimes it’s about negotiating for technical debt time so people can practice the long-term thinking that prevents problems rather than just solving them reactively.

The most important obstacle to remove is the fear of making mistakes. Problem-solving skills develop through experimentation, and experimentation requires an environment where intelligent failures are treated as learning opportunities rather than performance problems.

The Multiplier Effect

What makes this approach so rewarding is that the impact compounds exponentially. A team of capable problem solvers doesn’t just solve more problems—they solve harder problems, prevent problems through better design, and create solutions that other teams can build on.

When you develop someone’s problem-solving abilities, you’re not just helping them with their current role. You’re giving them tools they’ll use throughout their career, whether they stay individual contributors or move into leadership themselves. The engineer who learns to think systematically about performance problems becomes someone who designs performant systems from the start.

The ripple effects extend beyond your immediate team. Problem solvers become mentors. They raise the bar in technical discussions. They ask better questions in design reviews. They contribute to a culture where good technical decision-making is normal rather than exceptional.

This is the ultimate lever in engineering leadership: instead of solving problems yourself, you create the conditions where great solutions emerge naturally from your team.

Instead of being the bottleneck, you become the catalyst that makes everything else work better.

The transition from solving problems to creating problem solvers is challenging because it requires patience and faith in other people’s potential. But when you see someone tackle a problem that would have stumped them six months ago, or when your team consistently delivers solutions that surprise you with their thoughtfulness, you realize you’ve built something much more valuable than any individual technical contribution: a system that continuously generates great technical work.

I Spent $78 Learning Why Bash Still Matters in the AI Age

2025-06-15T14:09:42+00:00

Here’s how a little laziness cost me $78.

While working on a personal project recently, I wanted Cline to process about a hundred files that were each in subdirectories of a project. I fired up Cline and picked Gemini 2.5 Pro (context window FTW) and asked it to recurse through the subdirectories, process the files, and put the results in a new file.

Cline got to work… slowly. I watched as the “API Request…” spinner appeared for each file read and each time it saved the results. About twenty minutes and $26 later, it finished.

Okay, I thought, that’s not great, but not untenable. The cost of convenience, right? I opened up the results file to take a look and.. sigh. Not great work. It was obvious that some files had been skipped despite my very careful instructions to process each and every one.

So, like a glutton for punishment, I made a list of the files Cline had skipped and asked it to try again. Tired of babysitting, I raised the “Maximum Request Auto Approval” limit to more than I thought would be needed to finish processing the files that were left, and went to take a coffee break.

When I came back, Cline was done. The results? Still not great. Files had still been skipped, some files that were processed were missing results, and, oh, my task bill had risen to $78.

Okay, this was untenable. Reading all this data into context was costly and slow.

Then the coffee started to kick in, I guess, because it dawned on me: why in the world was I using expensive API calls to do something a Bash one-liner could do?

“Cline, write a Bash command that will recurse through the data/ directory and obtain the content of all the files and copy it into a single new file.”

Which produced:

find data/ -type f -exec cat {} + > all_data.txt

This command:

find data/ - searches recursively in the data directory.
-type f - specifies that we’re looking for files only (not directories, links, etc.).
-exec cat {} + - for all files found, execute the cat command. The {} is a placeholder for the filename, and the + is a crucial optimization that groups multiple filenames into a single cat command, avoiding the overhead of launching a new process for every single file.
> all_data.txt - redirects the standard output of the cat command (which is the concatenated content of all the files) into a new file named all_data.txt.

Then I asked Cline to read the resulting all_data.txt file, process it, and output the results.

It took about two minutes.

And it cost me $0.78.

What just happened?

My initial naive approach had accidentally created a perfect storm of computational inefficiency.

When Cline processed each file individually, it was making separate API calls for every single operation - reads, writes, the works. With about 100 files, that meant roughly 200+ API calls, each one spinning up its own network round-trip with all the latency that entails. Every time I saw that “API Request…” spinner, I was watching money float away into the ether.

But here’s the kicker: large language models like Gemini charge based on token consumption.

It’s not just the file content they’re charging for; every single API call also included the entire conversation history, system prompts, and my instructions.

With a stateless API, that context has to be re-transmitted with every single request. If my average context was around 10,000 tokens and I made 200 calls, I burned through 2 million tokens (10,000 * 200) on overhead alone, before even counting the actual data.

Combining all the files with bash flipped this whole equation on its head. Instead of 200 API calls, I made exactly one. Instead of bearing the network latency for every file operation, combining the files locally on my machine meant the filesystem could actually optimize that work. What had taken almost an hour of network round-trips for Gemini to access all the data was reduced to a couple hundred milliseconds of local file operations.

The expensive lesson in algorithmic thinking

This whole debacle reminded me why understanding the cost model of your tools matters just as much as understanding their capabilities. API pricing is designed around per-request and per-token charges, which naturally punishes fine-grained operations. It’s similar to how databases are optimized for bulk operations rather than processing individual rows - the overhead of each transaction quickly becomes the bottleneck.

My first approach had O(n) complexity for API calls, where n equals the number of files. The bash solution reduced that to O(1) by batching everything locally first. That’s the difference between linear scaling and constant cost, and at $78, I felt every bit of that mathematical distinction.

There’s also something to be said about data locality here. My original method couldn’t take advantage of any local caching or filesystem optimizations. Every operation had to go over the network to an API server, get processed, and come back. The bash approach kept everything local until the very end, letting my machine’s filesystem cache work its magic.

The real cost of convenience

I’d fallen into the trap of thinking that because I could use an AI tool for everything, I should use it for everything. But there’s a difference between leveraging AI for tasks that require intelligence and using it as an expensive replacement for basic system utilities.

The irony is that I probably spent more mental energy managing and troubleshooting the AI approach than I would have just thinking through the problem for five minutes and reaching for the right tool from the start. Sometimes the most sophisticated solution is knowing when to employ a basic tool.

My little bit of laziness bought me a $78 lesson that boils down to this: always understand the economic model of your tools, especially when they’re priced per operation. The most elegant and cost-effective solution isn’t always the newest and most technically exciting one.

Create Better Code Documentation 10x Faster with AI

2024-08-27T13:55:47+00:00

Documentation has always been one of those “we should do this” tasks that somehow never makes it to the top of the sprint. But what if creating comprehensive, useful documentation could be as straightforward as explaining your code to a colleague?

Conversational AI has changed the game entirely. Instead of starting with a blank page and trying to remember every detail a new team member might need, you can have AI help you think through the process systematically. The result isn’t just better docs—it’s documentation that actually serves your team’s needs as you grow and evolve.

Here’s how to use AI to build documentation that scales with your team and genuinely improves how you work together.

Documentation That Welcomes New Team Members

The best part about using AI for documentation is that it naturally thinks from an outsider’s perspective. While you and your team already understand your system’s quirks and design decisions, AI starts fresh every time—much like a new hire would.

Most conversational AI tools allow you to upload code files or paste code snippets. You can then use prompts that help surface the knowledge your team takes for granted:

Write documentation for a new software engineer joining our team. Assume they’re experienced but know nothing about our specific domain, architecture decisions, or business logic. Include the “why” behind non-obvious technical choices and flag anything that might seem strange or unexpected to an outside developer.

This approach reveals the implicit knowledge that experienced team members forget to document—why certain patterns exist, what alternatives were considered, and where the potential gotchas are. It transforms documentation from a chore into a useful onboarding tool that actually reduces the time senior developers spend answering questions.

To create comprehensive documentation you can use immediately, provide the AI with additional context such as:

What the application does and who uses it
Key architectural decisions and their reasoning
Setup and deployment processes
Integration points with other systems
Common troubleshooting scenarios

Your role becomes reviewing and refining rather than writing from scratch—which is often the difference between documentation that gets done and documentation that gets skipped.

Operational Documentation That Actually Helps

One of the most valuable types of documentation is also the most overlooked: information organized for when things go wrong. During incidents, you need answers fast, not comprehensive explanations.

AI excels at creating focused, actionable documentation because you can specify exactly what situation you’re optimizing for:

Create incident response documentation for this codebase. Focus on: 1) How to quickly identify what component is failing, 2) Common failure modes and their symptoms, 3) Step-by-step debugging workflows, 4) Who to contact for different types of issues. Write this as if the person reading it is stressed, tired, and needs answers in under 5 minutes.

This type of documentation serves a completely different purpose than your standard README or API docs. It’s designed for when your most knowledgeable developers aren’t available and someone needs to resolve an issue quickly.

The beauty of AI-generated operational docs is that they’re naturally structured for scan-ability rather than linear reading—exactly what you need during high-pressure situations.

Capturing Institutional Knowledge

Here’s where AI really shines: helping you identify and document the knowledge that exists only in people’s heads. This institutional knowledge is often the difference between a change that takes 30 minutes and one that takes 3 hours of debugging.

You can surface these knowledge gaps by asking AI to analyze your code from a risk perspective:

Analyze this code and identify areas where domain knowledge or business context would be critical for modification. What would a developer need to know about our business, users, or regulatory requirements to safely change this code? What assumptions about data, timing, or external systems are embedded here?

For inline documentation, you can focus on the business logic and integration points that aren’t obvious from the code itself:

Add inline documentation to this code file without changing any of the code. Focus on documenting business logic, data assumptions, and integration points that wouldn’t be obvious to someone unfamiliar with our domain.

This process often improves the code itself—explaining your logic to AI sometimes reveals opportunities for clearer naming, better structure, or simplified approaches.

Making Documentation a Team Superpower

The real opportunity here isn’t just better individual documentation—it’s democratizing the ability to create good documentation across your entire team. Developers who previously avoided writing docs because they didn’t know where to start now have a collaborative partner to help structure their thoughts.

Start with high-impact documentation: Focus on onboarding guides and operational runbooks first. These provide immediate value and create positive momentum around documentation practices.
Use AI to improve existing docs: You can ask AI to review and improve documentation you already have, suggesting missing information or better organization.
Make it iterative: Documentation doesn’t need to be perfect on the first pass. Use AI to create initial drafts that you can refine based on team feedback and real usage patterns.
Leverage different formats: AI can help create everything from README files to inline comments to architectural decision records, adapting the style and depth based on the audience and purpose.

Practical Tips for Better Results

When working with AI to create documentation, providing context about the intended audience and use case dramatically improves the output. Explain not just what the code does, but who will be using the documentation and in what situations.

For complex codebases, you might get better results by working with smaller sections and then asking AI to help you organize everything into a coherent structure. Many AI tools can also provide downloadable files if you specify that in your prompt, which saves time on longer documents.

The goal isn’t to replace human judgment in documentation—it’s to remove the barriers that prevent good documentation from getting written in the first place. AI handles the initial structure and comprehensive coverage, while you focus on accuracy, team-specific context, and ensuring the documentation actually serves your workflows.

Good documentation transforms how teams work together. It reduces interruptions, accelerates onboarding, and creates resilience when key team members aren’t available. With AI handling the heavy lifting of initial creation, maintaining comprehensive documentation becomes achievable rather than aspirational.

Your future team members (and your future self during the next production incident) will definitely appreciate the investment.

Post to your static website from your iPhone

2024-05-05T00:00:00+00:00

I love websites. I love static sites in particular. But I know that sometimes it’s just not practical to write and post only from your computer. With my hands full raising a family, I do a lot more development in stops and starts from my phone these days than I thought I ever would.

So I brought together everything that’s great about Hugo plus everything that’s great about sharing your 3AM thoughts with the world from your phone, thanks to Collected Notes. I put it in a new Hugo site template with a fancy new theme I call Quint.

You can deploy the Quint site template with one button (this button):

The Quint template can use the Collected Notes app as a CMS and also saves your posts to the site repository, for redundancy. It fetches new posts each time you build, and if you’re deploying via Netlify or GitHub Actions, you can use a webhook to deploy the site whenever you make a new post with Collected Notes.

To set up your own site:

Deploy the Quint template to Netlify with the button above, or clone the repo if you plan to use another deployment solution.
Sign up for Collected Notes if you haven’t already (there’s a free plan) and download the Collected Notes app on your iPhone.
Update the utils/fetch-posts.js file to use your Collected Notes site name.
Allow the GitHub Action to push changes back to your repository to save your posts. Under Settings > Actions > General > Workflow permissions, choose Read and write permissions.

Netlify will trigger a new build each time you push to your site repo, or, if you have a Collected Notes Premium subscription, you can set a Netlify Build Hook URL in your Collected Notes site settings to automatically redeploy the site when you make a post or update an existing post.

I hope you found this post helpful! If you have any suggestions or improvements to add, feel free to contribute to the repository.

How to send long text input to ChatGPT using the OpenAI API

2023-09-26T04:46:36-05:00

In a previous post, I showed how you can apply text preprocessing techniques to shorten your input length for ChatGPT. Today in the web interface (chat.openai.com), ChatGPT allows you to send a message with a maximum token length of 4,096.

There are bound to be situations in which this isn’t enough, such as when you want to read in a large amount of text from a file. Using the OpenAI API allows you to send many more tokens in a messages array, with the maximum number depending on your chosen model. This lets you provide large amounts of text to ChatGPT using chunking. Here’s how.

Chunking your input

The gpt-4 model currently has a maximum content length token limit of 8,192 tokens. (Here are the docs containing current limits for all the models.) Remember that you can first apply text preprocessing techniques to reduce your input size – in my previous post I achieved a 28% size reduction without losing meaning with just a little tokenization and pruning.

When this isn’t enough to fit your message within the maximum message token limit, you can take a general programmatic approach that sends your input in message chunks. The goal is to divide your text into sections that each fit within the model’s token limit. The general idea is to:

Tokenize and split text into chunks based on the model’s token limit. It’s better to keep message chunks slightly below the token limit since the token limit is shared between your message and ChatGPT’s response.
Maintain context between chunks, e.g. avoid splitting a sentence in the middle.

Each chunk is sent as a separate message in the conversation thread.

Handling responses

You send your chunks to ChatGPT using the OpenAI library’s ChatCompletion. ChatGPT returns individual responses for each message, so you may want to process these by:

Concatenating responses in the order you sent them to get a coherent answer.
Manage conversation flow by keeping track of which response refers to which chunk.
Formatting the response to suit your desired output, e.g. replacing \n with line breaks.

Putting it all together

Using the OpenAI API, you can send multiple messages to ChatGPT and ask it to wait for you to provide all of the data before answering your prompt. Being a language model, you can provide these instructions to ChatGPT in plain language. Here’s a suggested script:

Prompt: Summarize the following text for me

To provide the context for the above prompt, I will send you text in parts. When I am finished, I will tell you “ALL PARTS SENT”. Do not answer until you have received all the parts.

I created a Python module, chatgptmax, that puts all this together. It breaks up a large amount of text by a given maximum token length and sends it in chunks to ChatGPT.

You can install it with pip install chatgptmax, but here’s the juicy part:

import os
import openai
import tiktoken

# Set up your OpenAI API key
# Load your API key from an environment variable or secret management service
openai.api_key = os.getenv("OPENAI_API_KEY")

def send(
    prompt=None,
    text_data=None,
    chat_model="gpt-3.5-turbo",
    model_token_limit=8192,
    max_tokens=2500,
):
    """
    Send the prompt at the start of the conversation and then send chunks of text_data to ChatGPT via the OpenAI API.
    If the text_data is too long, it splits it into chunks and sends each chunk separately.

    Args:
    - prompt (str, optional): The prompt to guide the model's response.
    - text_data (str, optional): Additional text data to be included.
    - max_tokens (int, optional): Maximum tokens for each API call. Default is 2500.

    Returns:
    - list or str: A list of model's responses for each chunk or an error message.
    """

    # Check if the necessary arguments are provided
    if not prompt:
        return "Error: Prompt is missing. Please provide a prompt."
    if not text_data:
        return "Error: Text data is missing. Please provide some text data."

    # Initialize the tokenizer
    tokenizer = tiktoken.encoding_for_model(chat_model)

    # Encode the text_data into token integers
    token_integers = tokenizer.encode(text_data)

    # Split the token integers into chunks based on max_tokens
    chunk_size = max_tokens - len(tokenizer.encode(prompt))
    chunks = [
        token_integers[i : i + chunk_size]
        for i in range(0, len(token_integers), chunk_size)
    ]

    # Decode token chunks back to strings
    chunks = [tokenizer.decode(chunk) for chunk in chunks]

    responses = []
    messages = [
        {"role": "user", "content": prompt},
        {
            "role": "user",
            "content": "To provide the context for the above prompt, I will send you text in parts. When I am finished, I will tell you 'ALL PARTS SENT'. Do not answer until you have received all the parts.",
        },
    ]

    for chunk in chunks:
        messages.append({"role": "user", "content": chunk})

        # Check if total tokens exceed the model's limit and remove oldest chunks if necessary
        while (
            sum(len(tokenizer.encode(msg["content"])) for msg in messages)
            > model_token_limit
        ):
            messages.pop(1)  # Remove the oldest chunk

        response = openai.ChatCompletion.create(model=chat_model, messages=messages)
        chatgpt_response = response.choices[0].message["content"].strip()
        responses.append(chatgpt_response)

    # Add the final "ALL PARTS SENT" message
    messages.append({"role": "user", "content": "ALL PARTS SENT"})
    response = openai.ChatCompletion.create(model=chat_model, messages=messages)
    final_response = response.choices[0].message["content"].strip()
    responses.append(final_response)

    return responses

Here’s an example of how you can use this module with text data read from a file. (chatgptmax also provides a convenience method for getting text from a file.)

# First, import the necessary modules and the function
import os

from chatgptmax import send

# Define a function to read the content of a file
def read_file_content(file_path):
    with open(file_path, 'r', encoding='utf-8') as file:
        return file.read()

# Use the function
if __name__ == "__main__":
    # Specify the path to your file
    file_path = "path_to_your_file.txt"
    
    # Read the content of the file
    file_content = read_file_content(file_path)
    
    # Define your prompt
    prompt_text = "Summarize the following text for me:"
    
    # Send the file content to ChatGPT
    responses = send(prompt=prompt_text, text_data=file_content)
    
    # Print the responses
    for response in responses:
        print(response)

Error handling

While the module is designed to handle most standard use cases, there are potential pitfalls to be aware of:

Incomplete sentences: If a chunk ends in the middle of a sentence, it might alter the meaning or context. To mitigate this, consider ensuring that chunks end at full stops or natural breaks in the text. You could do this by separating the text-chunking task into a separate function that:
1. Splits the text into sentences.
2. Iterates over the sentences and adds them to a chunk until the chunk reaches the maximum size.
3. Starts a new chunk when the current chunk reaches the maximum size or when adding another sentence would exceed the maximum size.
API connectivity issues: There’s always a possibility of timeouts or connectivity problems during API calls. If this is a significant issue for your application, you can include retry logic in your code. If an API call fails, the script could wait for a few seconds and then try again, ensuring that all chunks are processed.
Rate limits: Be mindful of OpenAI API’s rate limits. If you’re sending many chunks in rapid succession, you might hit these limits. Introducing a slight delay between calls or spreading out requests can help avoid this.

Optimization

As with any process, there’s always room for improvement. Here are a couple of ways you might optimize the module’s chunking and sending process further:

Parallelizing API calls: If OpenAI API’s rate limits and your infrastructure allow, you could send multiple chunks simultaneously. This parallel processing can speed up the overall time it takes to get responses for all chunks. Unless you have access to OpenAI’s 32k models or need to use small chunk sizes, however, parallelism gains are likely to be minimal.
Caching mechanisms: If you find yourself sending the same or similar chunks frequently, consider implementing a caching system. By storing ChatGPT’s responses for specific chunks, you can retrieve them instantly from the cache the next time, saving both time and API calls.

Now what

If you found your way here via search, you probably already have a use case in mind. Here are some other (startup) ideas:

You’re a researcher who wants to save time by getting short summaries of many lengthy articles.
You’re a legal professional who wants to analyze long contracts by extracting key points or clauses.
You’re a financial analyst who wants to pull a quick overview of trends from a long report.
You’re a writer who wants feedback on a new article or chapter… without having to actually show it to anyone yet.

Do you have a use case I didn’t list? Let me know about it! In the meantime, have fun sending lots of text to ChatGPT.

Optimizing text for ChatGPT: NLP and text pre-processing techniques

2023-09-19T04:46:36-05:00

In order for chatbots and voice assistants to be helpful, they need to be able to take in and understand our instructions in plain language using Natural Language Processing (NLP). ChatGPT relies on a blend of advanced algorithms and text preprocessing methods to make sense of our words. But just throwing a wall of text at it can be inefficient – you might be dumping in a lot of noise with that signal and hitting the text input limit.

Text preprocessing can help shorten and refine your input, ensuring that ChatGPT can grasp the essence without getting overwhelmed. In this article, we’ll explore these techniques, understand their importance, and see how they make your interactions with tools like ChatGPT more reliable and productive.

Text preprocessing

Text preprocessing prepares raw text data for analysis by NLP models. Generally, it distills everyday text (like full sentences) to make it more manageable or concise and meaningful. Techniques include:

Tokenization: splitting up text by sentences or paragraphs. For example, you could break down a lengthy legal document into individual clauses or sentences.
Extractive summarization: selecting key sentences from the text and discarding the rest. Instead of reading an entire 10-page document, extractive summarization could pinpoint the most crucial sentences and give you a concise overview without delving into the details.
Abstractive summarization: generating a concise representation of the text content, for example, turning a 10-page document into a brief paragraph that captures the document’s essence in new wording.
Pruning: removing redundant or less relevant parts. For example, in a verbose email thread, pruning can help remove all the greetings, sign-offs, and other repetitive elements, leaving only the core content for analysis.

While all these techniques can help reduce the size of raw text data, some of these techniques are easier to apply to general use cases than others. Let’s examine how text preprocessing can help us send a large amount of text to ChatGPT.

Tokenization and ChatGPT input limits

In the realm of Natural Language Processing (NLP), a token is the basic unit of text that a system reads. At its simplest, you can think of a token as a word, but depending on the language and the specific tokenization method used, a token can represent a word, part of a word, or even multiple words.

While in English we often equate tokens with words, in NLP, the concept is broader. A token can be as short as a single character or as long as a word. For example, with word tokenization, the sentence “Unicode characters such as emojis are not indivisible. ✂️” can be broken down into tokens like this: [“Unicode”, “characters”, “such”, “as”, “emojis”, “are”, “not”, “indivisible”, “.”, “✂️”]

In another form called Byte-Pair Encoding (BPE), the same sentence is tokenized as: [“Un”, “ic”, “ode”, " characters", " such", " as", " em, “oj”, “is”, " are", " not", " ind", “iv”, “isible”, “.”, " �", “�️”]. The emoji itself is split into tokens containing its underlying bytes.

Depending on the ChatGPT model chosen, your text input size is restricted by tokens. Here are the docs containing current limits. BPE is used by ChatGPT to determine token count, and we’ll discuss it more thoroughly later. First, we can programmatically apply some preprocessing techniques to reduce our text input size and use fewer tokens.

A general programmatic approach

For a general approach that can be applied programmatically, pruning is a suitable preprocessing technique. One form is stop word removal, or removing common words that might not add significant meaning in certain contexts. For example, consider the sentence:

“I always enjoy having pizza with my friends on weekends.”

Stop words are often words that don’t carry significant meaning on their own in a given context. In this sentence, words like “I”, “always”, “enjoy”, “having”, “with”, “my”, “on” are considered stop words.

After removing the stop words, the sentence becomes:

“pizza friends weekends.”

Now, the sentence is distilled to its key components, highlighting the main subject (pizza) and the associated context (friends and weekends). If you find yourself wishing you could convince people to do this in real life (coughmeetingscough)… you aren’t alone.

Stop word removal is straightforward to apply programmatically: given a list of stop words, examine some text input to see if it contains any of the stop words on your list. If it does, remove them, then return the altered text.

def clean_stopwords(text: str) -> str:
    stopwords = ["a", "an", "and", "at", "but", "how", "in", "is", "on", "or", "the", "to", "what", "will"]
    tokens = text.split()
    clean_tokens = [t for t in tokens if not t in stopwords]
    return " ".join(clean_tokens)

To see how effective stop word removal can be, I took the entire text of my Tech Leader Docs newsletter (17,230 words consisting of 104,892 characters) and processed it using the above function. How effective was it? The resulting text contained 89,337 characters, which is about a 15% reduction in size.

Other pruning techniques can also be applied programmatically. Removing punctuation, numbers, HTML tags, URLs and email addresses, or non-alphabetical characters are all valid pruning techniques that can be straightforward to apply. Here is a function that does just that:

import re

def clean_text(text):
    # Remove URLs
    text = re.sub(r'http\S+', '', text)
    
    # Remove email addresses
    text = re.sub(r'\S+@\S+', '', text)
    
    # Remove everything that's not a letter (a-z, A-Z)
    text = re.sub(r'[^a-zA-Z\s]', '', text)
    
    # Remove whitespace, tabs, and new lines
    text = ''.join(text.split())

    return text

What measure of length reduction might we be able to get from this additional processing? Applying these techniques to the remaining characters of Tech Leader Docs results in just 75,217 characters; an overall reduction of about 28% from the original text.

More opinionated pruning, such as removing short words or specific words or phrases, can be tailored to a specific use case. These don’t lend themselves well to general functions, however.

Now that you have some text processing techniques in your toolkit, let’s look at how a reduction in characters translates to fewer tokens used when it comes to ChatGPT. To understand this, we’ll examine Byte-Pair Encoding.

Byte-Pair Encoding (BPE)

Byte-Pair Encoding (BPE) is a subword tokenization method. It was originally introduced for data compression but has since been adapted for tokenization in NLP tasks. It allows representing common words as tokens and splits more rare words into subword units. This enables a balance between character-level and word-level tokenization.

Let’s make that more concrete. Imagine you have a big box of LEGO bricks, and each brick represents a single letter or character. You’re tasked with building words using these LEGO bricks. At first, you might start by connecting individual bricks to form words. But over time, you notice that certain combinations of bricks (or characters) keep appearing together frequently, like “th” in “the” or “ing” in “running.”

BPE is like a smart LEGO-building buddy who suggests, “Hey, since ’th’ and ‘ing’ keep appearing together a lot, why don’t we glue them together and treat them as a single piece?” This way, the next time you want to build a word with “the” or “running,” you can use these glued-together pieces, making the process faster and more efficient.

Colloquially, the BPE algorithm looks like this:

Start with single characters.
Observe which pairs of characters frequently appear together.
Merge those frequent pairs together to treat them as one unit.
Repeat this process until you have a mix of single characters and frequently occurring character combinations.

BPE is a particularly powerful tokenization method, especially when dealing with diverse and extensive vocabularies. Here’s why:

Handling rare words: Traditional tokenization methods might stumble upon rare or out-of-vocabulary words. BPE, with its ability to break words down into frequent subword units, can represent these words without needing to have seen them before.
Efficiency: By representing frequent word parts as single tokens, BPE can compress text more effectively. This is especially useful for models like ChatGPT, where token limits apply.
Adaptability: BPE is language-agnostic. It doesn’t rely on predefined dictionaries or vocabularies. Instead, it learns from the data, making it adaptable to various languages and contexts.

In essence, BPE strikes a balance, offering the granularity of character-level tokenization and the context-awareness of word-level tokenization. This hybrid approach ensures that NLP models like ChatGPT can understand a wide range of texts while maintaining computational efficiency.

Sending lots of text to ChatGPT

At time of writing, a message to ChatGPT via its web interface has a maximum token length of 4,096 tokens. If we assume the prior mentioned percent reduction as an average, this means you could reduce text of up to 5,712 tokens down to the appropriate size with just text preprocessing.

What about when this isn’t enough? Beyond text preprocessing, larger input can be sent in chunks using the OpenAI API. In my next post, I’ll show you how to build a Python module that does exactly that.

Mastering Git for Small Teams

2022-02-28T06:37:48-06:00

I’ve watched too many talented engineers spend their Friday afternoons untangling Git messes that could have been avoided with a simpler workflow. You know the scene: someone’s trying to merge a three-week-old feature branch, there are conflicts in files that haven’t been touched in months, and suddenly what should have been a five-minute deployment turns into a two-hour debugging session (with the whole team).

The solution isn’t mastering Git’s most obscure commands or memorizing every branching strategy ever invented. It’s adopting a workflow that prevents the chaos in the first place. Here’s the approach I use personally and recommend for small teams that want to ship code without the drama.

A Protected Main Branch (No Exceptions)

First rule: no human should have direct push permissions to your master branch. Ever. I don’t care if you’re the CTO, the person who started the repository, or the only one who “really understands the codebase.” The moment you start making exceptions is the moment you start breaking things in production.

Your main branch should be your source of truth for what’s currently deployed. When you create a release from the latest tag, that code should work. Period. If you’re not deploying frequently and automatically, you’re missing out on one of the biggest advantages of this approach.

One Issue, One Branch, One PR (Keep It Simple)

Here’s where most teams overcomplicate things. You’ve got your issues tracked somewhere (and if you don’t, we need to have a different conversation). Each issue represents a well-defined piece of work that can be merged and deployed without breaking anything. Maybe it’s a new feature, a component update, or a bug fix. Doesn’t matter—the process stays the same.

Author's illustration of issue branches and releases from master.

The key is keeping branches short-lived. For a small commercial team, we’re talking days, not weeks. Open source projects with volunteer contributors might stretch this to a few weeks or months, but the principle remains: finish the work, get it reviewed, merge it, and move on.

Here’s what this looks like in practice. Say you’re working on (#28) Add user settings page:

# Get all the latest work locally
git checkout master
git pull
# Start your new branch from master
git checkout -b 28/add-settings-page

Work on the issue, and periodically merge master to stay current:

# Commit to your issue branch
git commit ...
# Get the latest work on master
git checkout master
git pull
# Return to your issue branch and merge in master
git checkout 28/add-settings-page
git merge master

I know some of you are thinking “but what about rebasing?” Look, I like rebasing too. A clean, linear history is beautiful. But I’ve seen too many developers get tangled up in interactive rebasing purgatory while accidentally dropping commits or creating conflicts that didn’t need to exist. Merging might create a slightly messier history, but it’s predictable and reversible. When you’re optimizing for team productivity, predictable wins over pretty.

When your work is ready, open a PR against master. Tests run automatically. Your teammates review the code and leave helpful feedback (hopefully). Maybe you deploy a preview version to staging. Once everything looks good, merge it, close the issue, and delete the branch. (Yes. Delete it. It will be okay.)

Avoiding the Common Disasters

Here are the patterns I see that turn this simple workflow into a nightmare:

Branching off feature branches: This is how you end up with dependency chains that make merging feel like getting the Christmas lights out of storage. Someone starts working on feature B before feature A is merged, then feature C depends on both, and suddenly you need a whiteboard and a computer science degree to figure out the merge order. Just branch from the latest master. Always.

Scope creep on branches: You’re implementing the user settings page, but then you notice the button component could use some updates, and hey, while we’re at it, let’s refactor this entire authentication flow. Stop. That’s how a three-day task becomes a three-week (or three-month) PR that nobody wants to review. Stick to the issue at hand.

Keeping dead branches around: Your branch got merged last month, but it’s still sitting there in the repository like a ghost haunting your Git history. Delete merged branches immediately. Future you will thank present you for not having to scroll through fifty old feature branches trying to find the one you’re actually working on.

Why This Actually Works

This workflow works because it aligns with how small teams actually operate. You don’t need the complexity of GitFlow when you’ve got eight developers. You don’t need long-lived release branches when you’re deploying multiple times per week. You need a system that gets out of your way and lets you focus on building software.

The protection on master means your deployable code stays deployable. The one-issue-per-branch rule keeps PRs reviewable and prevents feature creep. The short-lived branches mean conflicts are small and manageable. The regular merging from master means you catch integration issues early when they’re easy to fix.

Most importantly, this workflow is boring in the best possible way. Once your team gets the hang of it, Git becomes background infrastructure instead of a daily source of stress. Developers stop losing work to merge conflicts. Code reviews become focused discussions about functionality rather than archaeology expeditions through weeks of accumulated changes.

The best development workflows are the ones you don’t have to think about. They handle the routine stuff automatically so you can focus on the interesting problems. This Git strategy does exactly that—it gets out of your way and lets you ship code with confidence.

Introducing The Tech Leader Docs

2021-12-21T06:28:06-06:00

I’m launching a brand new paid newsletter on Substack focused on building, growing, and leading your technology teams to success. It’s a short, no-time-wasted bi-weekly newsletter that will give you immediately applicable skills and strategies you can take to work that day.

Here are a few things past colleagues have said about my work in software engineering leadership:

"…the level of organization and process we had was amazing and I personally want to duplicate as much of it as possible!"

"…remember how excited I was in my interview with you Victoria about how well-organized [your previous company] was and how well-thought-out your processes were? I dare say 99.99% of startups, and many larger companies aren’t very organized and it becomes a pain point as they grow. Between the docs and the other processes you had in place, I think you could write a really good book that would be a ‘Blueprint’ for forming a technical team."

"@Victoria Drake you publish a book and I will buy at least a dozen copies to hand out."

Instead of waiting to collect all this information in book form, the first post goes out the first week of January. Here’s a preview of some of the skills you’ll learn in future editions:

How to set up remote and asynchronous work to be super-efficient
How to remove the right restrictions to make your processes more productive
Setting up safeguards to ensure progress doesn’t slow down when you take time off
Hiring for the culture you want
Creating the number one secret weapon that makes everyone on your team more productive

These insights are for senior engineers and engineering managers, as well as anyone who wants to start establishing yourself as a leader on your team right away.

You can subscribe monthly, or lock in an early-bird New Year Special subscription for 30% off before Jan 1.

My website Victoria.dev isn’t going anywhere, I’ve just decided to put more effort into this new resource. The paid newsletter format strikes a good balance between the immediate delivery of information, skin-in-the-game for you to implement these ideas, and motivation for me to keep sharing how I acquired the skills and knowledge that live in my head, just as I’ve done for years on my blog.

I’ve held all kinds of roles on engineering teams, from contract developer to Director of Engineering. Over the years I’ve heard from past colleagues and peers about confusing processes, time-wasting meetings, and poor leadership in companies of all sizes. With The Tech Leader Docs, I hope to help those in positions to create positive change in their organization (including you!) to turn that feedback around.

Subscribe today and lock in 30% off. You’ll start 2022 with the practical skills it takes to build a successful engineering team: smarter strategies, less toil, and happier and more productive developers.

My paper to-do strategy

2021-10-25T12:17:32+00:00

Coding up a to-do app may be the Hello, World of every framework, but when it comes to actually tracking tasks effectively (knock ’em out not stack ’em up) there’s no app that keeps things front of mind better than an open notebook on your desk.

Here’s my stupid-simple strategy for tracking and checking off my to-do list.

One page at a time

Plenty of methodologies recommend using sections or different pages of your book for monthly, weekly, and daily views; others advocate for creating sections for each category, such as “Home Tasks” and “Work Tasks” and other such time-wasters. All of this is unnecessary.

A to-do list works because it’s in your face and hard to miss. When you write things down on different pages, they become easy to miss. Don’t do that.

Use one page at a time. Write down one task under another. Don’t sort them, prioritize them (yet), or categorize anything. Just write them down on the current page, where you’re guaranteed to look when you lay eyes on your notebook next.

Intuitive notation

I use my notebook for two things: short notes (just a bit of information – nothing to do) and tasks (something to do). This translates to a notation system of three possible states:

It’s a note, indicated with a bullet point
It’s a new task, indicated with a checkbox
It’s a completed task, with the checkbox checked and the line struck out (because strike-throughs are satisfying)

I use a checkbox to distinguish tasks from notes because I’m an old-school HTML fan, but you do you.

You may like to add your own embellishments to this: I sometimes denote an urgent item with an asterisk. You might like to use a color pen or highlighter (avoid the bullet journal rabbit hole – another time-waster). Just keep it simple, repeatable, and intuitive.

When it’s time to turn the page

When life gets busy, you might fill up a page pretty quickly. If one or two tasks haven’t yet been crossed off, they’re liable to be forgotten. You can avoid this by carrying tasks over to the next page.

It’s straightforward: cross out the task on the page that’s filled up. Turn the page and write it down there again.

That’s silly, you might say, that’s a waste of energy! By the time I write it down all over again, I could’ve done half of it already.

…

I’ll wait.

…

The clever bit about carrying a task over is taking the opportunity to evaluate it. If the task is really a five-minute thing, more often than not, I go ahead and take care of it right there and then. If it’s a longer endeavor, the friction of writing it down again gives me the chance to answer the question of whether it’s something I feel strongly about doing (and hence whether it’s really important that I do it at all). It might not be, and that’s fine. I cross it out and don’t do it. If it is an important task, carrying it over means it remains front of mind until I can make the time to get it done.

Time well spent doing

I’ve explored a myriad of task list apps, pre-printed to-do lists and journals, and all kinds of digital notes for tracking work. I consistently keep returning to the feel of pen on paper and an open notebook on my desk. Why? Minimal cognitive load.

No time spent categorizing and labeling tasks in a complicated system. No time spent remembering how to open that app, where you stored that todo.txt file, or deciding whether to write something down under your weekly or daily plan. No tasks lost in an invisible backlog that grows over the years, becoming more and more infeasible.

Just pen and paper, one page at a time, and the satisfaction of getting things done.

Set up a Pi-hole VPN on an AWS Lightsail instance

2021-10-07T11:01:13+00:00

I’ve written a fair bit in the past about the whys of online privacy, and a lot about staying safe online. Chances are, if a search brought you here, you’re well-past why. Let’s go straight on to how.

This guide will walk you through setting up Pi-hole on an AWS Lightsail instance that acts as your VPN thanks to OpenVPN. It’s a more succinct version of the official Pi-hole docs for OpenVPN, made specifically for Lightsail with a few tips and tricks added in, because you deserve it.

Create and connect to a Lightsail instance

Log in or sign up to AWS and create a Lightsail Instance.
Under Select a platform, choose Linux/Unix.
Under Select a blueprint, choose the OS Only button.
Select the latest officially supported Ubuntu server.
You can save a tidbit of effort by putting the following into the Launch script box:
```
# Update installed packages
sudo apt-get update
sudo apt-get upgrade -y
```
Create a new SSH key for this server and ensure you download the .pem.
Choose your plan. The $3.50 USD instance is sufficient.
Give it a name then click Create instance.
Stare eagerly at the page until the instance status is Running, then go to the Networking tab.
Create a Static IP and attach it to your new instance. Remember that static IP addresses are free only while attached to an instance.
Click on your instance name to return to its dashboard. Go back to the Networking tab. It’ll look a bit different now.
Under IPv6 networking, click the toggle to turn it off (unless you know what you are doing and you want IPv6 for some reason. Most of y’all don’t need it).
Under IPv4 Firewall, delete the rule for HTTP.
Click Add rule. In the Application dropdown, choose Custom.
- For Protocol, choose UDP.
- In the Port or range input, enter a UDP port for the OpenVPN server to run on. (It’s typically 1194, which you can choose to use, but you might like a different number for security purposes. Port range is 0-65535.)
Connect using SSH and your new key pair, either in your terminal or on the Connect tab with the browser-based client.

Install OpenVPN on your server

After connecting to your server using SSH, install OpenVPN on your server.

# Download OpenVPN
wget https://git.io/vpn -O openvpn-install.sh
chmod 755 openvpn-install.sh
sudo ./openvpn-install.sh

You’ll see:

Welcome to this OpenVPN road warrior installer!

This server is behind NAT. What is the public IPv4 address or hostname?
Public IPv4 address / hostname [x.xx.xxx.xxx]:

…where the default option is your static IP that you set up earlier. Hit return to accept this. Then:

Which protocol should OpenVPN use?
    1) UDP (recommended)
    2) TCP
Protocol [1]: 1

Choose 1 or hit return. Then:

What port should OpenVPN listen to?
Port [1194]: #####

Enter the UDP port number you chose earlier. Then:

Select a DNS server for the clients:
    1) Current system resolvers
    2) Google
    3) 1.1.1.1
    4) OpenDNS
    5) Quad9
    6) AdGuard
DNS server [1]: 1

Choose 1 or hit return. Then:

Enter a name for the first client:
Name [client]: pihole

The Pi-hole will be the client. Name it as you like then Press any key to continue...

OpenVPN will set itself up. Confirm that tun0 has the interface address 10.8.0.1/24 with the following command:

ip addr show tun0

This ensures that the Pi-hole will be set up properly. Now, about that:

Install and configure Pi-hole

On your Lightsail instance, install Pi-hole.

# Download and install Pi-hole
curl -sSL https://install.pi-hole.net | bash

This runs the Pi-hole automated installer. You’ll see some prompts which you can answer using the enter key, arrow keys, tab, and space bar for selecting an option.

The important things:

When you see Choose An Interface, ensure you pick tun0. It isn’t the default selection.
You’ll need to set the IPv4 address to the interface address you viewed previously using the ip addr command: 10.8.0.1/24. This ensures the Pi-hole uses the VPN.

At time of writing, the second item above wasn’t presented as an option in the automated installer. After the Pi-hole installer finishes, manually change the IP address by editing the configuration file:

> sudo vim /etc/pihole/setupVars.conf

Change the IPV4_ADDRESS to 10.8.0.1/24 and save the file. Restart the Pi-hole with: pihole restartdns.

If you mess up, you can redo the configuration with pihole reconfigure.

Finally, you’ll configure the VPN to use the Pi-hole.

Configure OpenVPN

Confirm the address of the tun0 interface with:

ip a | grep -C 1 'tun0'

You should see: inet 10.8.0.1/24 in there.

Edit the OpenVPN config file with:

sudo vim /etc/openvpn/server/server.conf

Change the line that starts with push "dhcp-option… to use the Pi-hole’s IP address that you confirmed above:

push "dhcp-option DNS 10.8.0.1"

If any other lines start with push "dhcp-option…, comment those out.

If you want to log OpenVPN traffic, add these lines to the end of the file:

log /var/log/openvpn.log
verb 3

Save the config. If you forgot to open Vim with sudo, use the tee trick: :w !sudo tee %, then O, then :q!.

Restart OpenVPN with sudo systemctl restart openvpn-server@server.

Configure firewall

Run the following to control traffic to the server as described here.

sudo iptables -I INPUT -i tun0 -j ACCEPT
sudo iptables -A INPUT -i tun0 -p tcp --destination-port 53 -j ACCEPT
sudo iptables -A INPUT -i tun0 -p udp --destination-port 53 -j ACCEPT
sudo iptables -A INPUT -i tun0 -p tcp --destination-port 80 -j ACCEPT
sudo iptables -A INPUT -p tcp --destination-port 22 -j ACCEPT
sudo iptables -A INPUT -p tcp --destination-port 1194 -j ACCEPT
sudo iptables -A INPUT -p udp --destination-port 1194 -j ACCEPT
sudo iptables -I INPUT -m state --state RELATED,ESTABLISHED -j ACCEPT
sudo iptables -I INPUT -i lo -j ACCEPT
sudo iptables -P INPUT DROP

# Optionally, also block HTTPS advertisements while you're here.
sudo iptables -A INPUT -p udp --dport 80 -j REJECT --reject-with icmp-port-unreachable
sudo iptables -A INPUT -p tcp --dport 443 -j REJECT --reject-with tcp-reset
sudo iptables -A INPUT -p udp --dport 443 -j REJECT --reject-with icmp-port-unreachable

You can review the results with sudo iptables -L --line-numbers.

These are only stored in memory before you save them, so test out your set up on your client now to see if it all works as expected.

Test your client connection

To test your configuration, try adding a client (the phone or computer that will connect to the VPN).

Run the OpenVPN script again: sudo ./openvpn-install.sh and choose 1) Add a new client. Give it a name; you may find it helps to name it by the device, e.g. “phone”. This creates a file that ends in .ovpn. You need to place this file on your client to use it.
Install the appropriate OpenVPN app for your device.
Transfer the .ovpn file you just obtained to the device if you haven’t already. (See future tasks for a way to copy the file to your host machine.) Follow instructions in your app (try under FAQ) for importing the .ovpn file and activating the VPN.
Ensure it seems to connect properly. If you go to DuckDuckGo.com and search for “What’s my IP”, you should see the location of your Lightsail instance. For a more in-depth test, check for DNS leaks at BrowserLeaks.com.

Try browsing for a while. You can also view the Pi-hole dashboard by visiting http://pi.hole/admin/ on this device.

If everything seems all right, go on to saving the configuration on your instance.

Save `iptables`

Save the iptables you created earlier using the tee command to achieve the second permission.

sudo iptables-save | sudo tee /etc/pihole/rules.v4

You’re finished with configuration on your Lightsail instance. If you wish to disconnect now, you can just type exit.

Future tasks

You’re done with the set up! You now have your very own personal VPN with a Pi-hole keeping you safe from nasty trackers. Here are some references for operations you might like to come back to in the future:

Reconnect to your Lightsail instance with SSH:
- ssh -i /path/to/private-key.pem ubuntu@public-ip-address
Set a password for the web interface dashboard:
- pihole -a -p
Access the web interface dashboard:
- Connect to the VPN, then visit http://pi.hole/admin/
Update the Pi-hole:
- pihole -up
Add a new client (for iOS, Linux, or Windows, or for Android)
Copy the .ovpn file for a client to your host machine (run on the host machine):
- ssh -i /path/to/private-key.pem ubuntu@public-ip-address 'sudo cat /path/on/lightsail/client.ovpn' > /path/on/host/client.ovpn
Beef up that block list! Here’s my favorite resource for updating your Pi-hole adlist table: The Big Blocklist Collection

Enjoy your new, more secure and peaceful Internet! If you found this guide helpful, please share it with someone else.

Beyond Gut Feelings: How I Use Issue Metrics to Boost Engineering Velocity

2021-08-30T05:35:02+00:00

How long does it take for a bug to get squashed, or for a pull request to be merged? What kind of issues take the longest to close?

Most organizations want to improve productivity and output, but few technical teams seem to take a data-driven approach to discovering productivity bottlenecks. If you’re looking to improve development velocity, a couple key metrics could help your team get unblocked. Here’s how you can apply a smidge of data science to visualize how your repository is doing, and where improvements can be made.

Getting quality data

The first and most difficult part, as any data scientist would likely tell you, is ensuring the quality of your data. It’s especially important to consider consistency: are dates throughout the dataset presented in a consistent format? Have tags or labels been applied under consistent rules? Does the dataset contain repeated values, empty values, or unmatched types?

If your repository has previously changed up processes or standards, consider the timeframe of the data you collect. If labeling issues is done arbitrarily, those may not be a useful feature. While cleaning data is outside the scope of this article, I can, at least, help you painlessly collect it.

I wrote a straightforward Python utility that uses the GitHub API to pull data for any repository. You can use this on the command line and output the data to a file. It uses the list repository issues endpoint (docs), which, perhaps confusingly, includes both issues and pull requests (PRs) for the repository. I get my data like this:

$ python fetch.py -h
usage: fetch.py [-h] [--token TOKEN] repository months
$ python fetch.py OWASP/wstg 24 > data.json

Using the GitHub API means less worry about standardization, for example, all the dates are expressed as ISO 8601. Now that you have some data to process, it’s time to play with Pandas.

Plotting with Pandas

You can use a Jupyter Notebook to do some simple calculations and data visualization.

First, create the Notebook file:

touch stats.ipynb

Open the file in your favorite IDE, or in your browser by running jupyter notebook.

In the first code cell, import Pandas and load your data:

import pandas as pd

data = pd.read_json("data.json")
data

You can then run that cell to see a preview of the data you collected.

Pandas is a well-documented data analysis library. With a little imagination and a few keyword searches, you can begin to measure all kinds of repository metrics. For this walk-through, here’s how you can calculate and create a graph that shows the number of days an issue or PR remains open in your repository.

Create a new code cell and, for each item in your Series, subtract the date it was closed from the date it was created:

duration = pd.Series(data.closed_at - data.created_at)
duration.describe()

Series.describe() will give you some summary statistics that look something like these (from mypy on GitHub):

count                           514
mean      5 days 08:04:17.239299610
std      14 days 12:04:22.979308668
min                 0 days 00:00:09
25%          0 days 00:47:46.250000
50%                 0 days 06:18:47
75%          2 days 20:22:49.250000
max               102 days 20:56:30

Series.plot() uses a specified plotting backend (matplotlib by default) to visualize your data. A histogram can be a helpful way to examine issue duration:

duration.apply(lambda x: x.days).plot(kind="hist")

This will plot a histogram that represents the frequency distribution of issues over days, which is one way you can tell how long most issues take to close. For example, mypy seems to handle the majority of issues and PRs within 10 days, with some outliers taking more than three months:

It would be interesting to visualize other repository data, such as its most frequent contributors, or most often used labels. Does a relationship exist between the author or reviewers of an issue and how quickly it is resolved? Does the presence of particular labels predict anything about the duration of the issue?

You aim for what you measure

Now that you have some data-driven superpowers, remember that it comes with great responsibility. Deciding what to measure is just as, if not more, important than measuring it.

Consider how to translate the numbers you gather into productivity improvements. For example, if your metric is closing issues and PRs faster, what actions can you take to encourage the right behavior in your teams? I’d suggest encouraging issues to be clearly defined, and pull requests to be small and have a well-contained scope, making them easier to understand and review.

To prepare to accurately take measurements for your repository, establish consistent standards for labels, tags, milestones, and other features you might want to examine. Remember that meaningful results are more easily gleaned from higher quality data.

Finally, have fun exercising your data science skills. Who knows what you can discover and improve upon next!

There are better options for a privacy-respecting phone

2021-08-11T11:37:35+00:00

Whether you think the news of Apple scanning your private devices was a big deal, run-of-the-mill, or something we all should have seen coming, you might be wondering, “What now?” We know full well that Google is looking at the stuff on your phone too (and Gmail, and… well, everywhere else) so it’s not like there are other options after Apple… right?

If a move towards privacy is what we’re after, we know a new off-the-shelf Google phone isn’t a better answer – but there are more options.

If you don’t want the details, jump straight to The TL;DR at the end.

Linux phones (sort of)

Unless you’re a rather tolerant tech-savvy tinkerer, a Linux phone isn’t one of these options… yet. I’ve personally been very excited about the bevy of emerging options in this space, from freedom-oriented hardware to fully open source, crowd-developed operating systems.

The current state of these efforts is that this magical mashup just isn’t ready yet. Most Linux phone OS such as Ubuntu Touch, Mobian, Pure OS, etc, are in a “mostly working” state, with the missing features ranging from “lack of reliable push notifications” to “intermittent Bluetooth connectivity” to “camera.”

If all you need is text messaging and a web browser, yes, you can probably go this route. For most users however, this isn’t going to make daily-driver status.

If a Linux phone would suit you, I recommend getting your hands on a PinePhone and running Arch Linux ARM (releases on GitHub) with Plasma Mobile.

De-googled Android

For a daily-driver, “de-googled” Android is your best bet. Android itself (specifically, the Android Open Source Project source code) is based on a modified Linux kernel and is free and open source software. When we typically think of “Android phones,” we refer to Android devices with Google’s proprietary software added to the mix, including Google Play Services. A “de-googled” Android phone is essentially the Android OS without Google’s ~~spyware~~ services included by default.

Keep in mind that this route still involves some DIY. You’ll need to install an OS on a device yourself. Don’t worry, there are step-by-step guides available – the most technical thing you’ll likely have to do is copy and paste some commands into your terminal.

Free and open source Android OS comes in multiple flavors, and the choice isn’t arbitrary. Your selection of a “de-googled” phone is going to be determined by a couple factors: the hardware device you have or that you want to use, and the apps (software) you want to run on it.

Hardware

The phone you may already have (or the one you’re willing to purchase) will influence your choice of operating system (OS).

LineageOS

At the time I’m writing this, if you have an older Pixel or another model of Android phone, your best bet for a hassle-free OS with A-class support will be Lineage. Here’s a link to the LineageOS list of supported devices. Clicking on your device here will get you to some installation instructions for your phone.

GrapheneOS

If you have a newer Pixel (generation 3 up to the newer 5) then GrapheneOS could be the way to go. Here are the devices officially supported by GrapheneOS. They also have easy-to-follow installation instructions and help via chat. It is possible to run GrapheneOS on other phones, but not without substantial DIY for which technical knowledge would help.

Generally speaking, GrapheneOS is intended to be a security-hardened operating system targeted at individuals who won’t be miffed if there are tradeoffs for mitigating vulnerabilities. If you don’t have those requirements or intend to use Google Apps on your phone (see Software), then LineageOS will likely suit you better.

New phone, who dis?

If you’re looking to purchase a new phone, you have some flexibility. My general recommendation is to pick up last-season’s version of the model you want. Not only will this likely be cheaper (and often a great deal if you buy refurbished) but the open source community that develops these operating systems will have had more time to work with the device itself, which could help ensure better compatibility and a smoother set up.

Consider buying a refurbished phone (sometimes called “renewed”) locally when you can. This can help fund the small businesses that offer them.

Software

What do you need to do on your phone? Privacy and convenience are typically at odds (a far larger topic I won’t dig into right now) so it can help to narrow down the functionality you need. If your needs look something like:

Calls and texts
Web browser
Web-based email via browser

Then you’re good to go, right out of the box, with either LineageOS or GrapheneOS. They’ll both include free and open source apps that let you do all these things.

If you want a particular application that doesn’t come pre-installed, here’s where we get into some nuance. Your choices depend on the level of privacy you’d like to maintain. Here are your avenues for installing apps, listed in order of preference.

1. Official APKs

Some particularly privacy-focused applications offer an Android Package Kit (APK) that you can download directly in order to install the app. You should only download these when you’ve navigated directly to a domain that the organization owns. Here are my favorites:

You can download and install APKs whether you choose LineageOS or GrapheneOS.

2. Use F-Droid

If you can’t find an APK for something you want, search for it on F-Droid.

The F-Droid software repository allows you to download and install apps in much the same way that the Google Play store does, with a couple notable differences. All the apps here are free and open source, and no account or profile is required to download them. The F-Droid APK itself can be downloaded and installed from f-droid.org directly on either LineageOS or GrapheneOS.

Just like any open source software, it’s up to the user (you) to ensure that you’re downloading and installing software you trust. If you want help or advice, F-Droid has a healthy community that you can interact with in lots of ways, including via IRC, Matrix, and the Fediverse.

You can find an app for pretty much anything here: from your general-store type functions such as to-do lists, music players, and maps; to specific niche security applications, and even a tea timer. Here are some well-known choices I can easily recommend:

3. Aurora Store

If you need an app that isn’t available on F-Droid, your next stop is the Aurora Store. This is an unofficial client for the Google Play Store that lets you download free applications anonymously, without signing into a Google account. Most applications found in the larger stores can be downloaded this way, without requiring Google’s proprietary stuff on your phone.

When loading Aurora Store for the first time, be sure to choose the “Anonymous” option instead of signing in.

The Aurora Store itself can be installed via F-Droid or auroraoss.com. It works on either LineageOS or GrapheneOS – however, apps that require less private permissions or access will probably work better on LineageOS.

Keep in mind that your phone OS in no way supports these apps directly, or knows what’s in them, or what sort of tracking and information exchange they may be up to. It’s a slight privacy downgrade, but still better than a fully Google-ified OS.

4. If you need Google Apps

If this will be your only phone and you simply must have Google Apps on it (think Google Play Store, Gmail, Calendar, Photos, etc) then go with LineageOS. You can choose to try emulating Google Play Services using LineageOS for microG, or install the Google Apps add-on when you install LineageOS.

The TL;DR

Here’s the “Internet personality quiz” version of everything above. You are…

Knowledgeable about Linux; mostly use a phone for text, calls, and web browser; and potentially want to help develop Linux phone software.
- Try a Linux phone such as the PinePhone, but consider one of the other options as a back up for when you just need stuff to work.
Security or privacy inclined, happy to use FOSS apps, or do most things via web browser anyway.
- Get your hands on a Pixel 3{XL, a, a XL}, Pixel 4{XL, a, a 5G}, or Pixel 5, and use GrapheneOS. Installation instructions here.
- Optionally, download the F-Droid or Aurora Store APKs for apps.
Someone who needs Google Apps to work, or you want a phone that isn’t a Pixel, or you’re setting up a device for someone who’s fine using Android but needs it to look familiar.
- Use LineageOS with any of its supported devices. Click on the device name for installation instructions.
- If you must have Google Apps and need Google Play Services to work, install the add-on at the same time you install LineageOS.
- Optionally, download the F-Droid or Aurora Store for installing apps.

Whichever route you choose, my advice is to treat this like a learning experiment. You’re sort of building your own phone, after all, and gaining all the technological independence that comes with that knowledge. If possible, don’t ditch your current phone until you try out one (two?) of these paths. The one you end up liking most could surprise you! It’s great to have options.

The Doorway Problem: Why Building in Isolation Fails

2021-08-09T03:17:49+00:00

It’s a comedy classic—you’ve got a grand idea. Maybe you want to build a beautiful new dining room table. You spend hours researching woodcraft, learn about types of wood and varnish, explore different styles of construction, and now you have a solid plan. You buy the wood and other materials. You set up in the garage. For months you measure and saw, sand, hammer and paint. Finally, the effort pays off. The table is finished, and it’s fantastic.

In a frenzy of accomplishment you drag it into the house—only to discover that your dining room doorway is several inches too small. It doesn’t fit.

You might say this comedic example is unrealistic. Of course an experienced DIY-er would have measured the doorway first. But in real life, unforeseen problems rarely come solo. Once you finally get the table through the door (after removing the legs and reassembling it inside), you discover the floor’s uneven. The chairs you chose are a few inches too short. The ceiling light hangs too low. Each solution creates new problems you never anticipated.

I’ve seen this exact pattern play out dozens of times in software development, just with different furniture. Teams spend months building features in isolation, only to discover they don’t fit through the “doorways” of real user workflows, existing infrastructure, or business constraints. The solution isn’t better planning—it’s building in context from the start.

The Planning Fallacy (Or: Why We’re All Terrible at This)

Few software developers are accurate when it comes to time and cost estimates. This isn’t a failing of engineers specifically—it’s a deeply human tendency toward optimism when predicting our own future. First proposed by Daniel Kahneman and Amos Tversky in 1979, the planning fallacy explains why our estimates are consistently wrong.

In one study, students were asked to estimate how long they’d take to finish their senior theses. The estimates averaged 27.4 days at the optimistic end and 48.6 days at the pessimistic end. The actual completion time? 55.5 days. Even the pessimistic estimates were too optimistic.

The researchers proposed two main reasons: first, people focus on their future plans rather than their past experiences; second, people don’t think past experiences matter much to the future anyway.

You can probably find examples of this in your own recent project history. Sure, that last “two-day feature” turned into a two-week affair, but that was only because the API documentation was wrong. Or maybe you didn’t finish that database migration when planned, but that was only because you discovered the staging environment was configured differently than production. You’re absolutely, positively, definitely certain that next time will be different.

The reality is that we’re terrible at factoring in the unexpected daily demands of building software.

Legacy code behaves mysteriously. Third-party services have undocumented quirks. Staging environments don’t match production. Users do things we never anticipated. Some measure of ignorance about these complications probably keeps us sane enough to start new projects.

But some measure of accurate planning is also necessary for success. The solution is working in context as much as possible, rather than trying to plan for every contingency.

Context Is Your Reality Check

Let’s reconsider the dining room table story. Instead of spending months out in the garage, what would you do differently to build in context?

You might say, “Build it in the dining room!” While that would be ideal for context, it’s rarely possible in homes or software development. Instead, you do the next best thing: start building, and make frequent visits to context.

Having decided you want to build a table, one of the first questions is “How big will it be?” You’ll have requirements to fulfill (must seat six, must match other furniture, must hold the weight of your annual twenty-eight-course Christmas feast) that lead you to a rough decision.

With a size in mind, you build a mock-up. At this point, the specific materials, style, and color don’t matter—only the three dimensions. Once you have your mock table, you can make your first trip to the context where it will ultimately live. Attempting to carry your foam/wood/cardboard/balloon animal mock-up into the dining room will reveal issues you never considered, and possibly new opportunities as well. Perhaps, though you’d never have thought it, a modern abstractly-shaped dining table would better complement the space. You can take this into account in your next higher-fidelity iteration.

This translates directly to software development, minus the Christmas feast. You may recognize this as the MVP approach, but even here, putting the MVP in context is a step that’s frequently omitted.

I’ve seen teams spend months building a “simple” user authentication system, only to discover that their company’s SSO provider doesn’t support the OAuth flow they built around. Or teams that create beautiful interfaces that completely break when real user data (with its inconsistent formats and edge cases) gets loaded. Where will your product ultimately live? How will it be accessed? What does real data look like?

Building your MVP and attempting to deploy it with realistic constraints will uncover these issues when they’re still manageable.

Even when teams have prior experience with technologies, remember the planning fallacy. People naturally discount past evidence to the point of forgetting. It’s also unlikely that the same exact team is building the same exact product as last time. The language, technology, framework, and infrastructure have likely changed—as have the capabilities and bandwidth of the engineers. Frequent visits to context help you run into issues early, adapt to them, and create short feedback loops.

Go for Good Enough (Then Iterate)

The specific meaning of putting something in context varies from project to project. It might mean deploying to cloud infrastructure, running on a new server, or testing whether your remote office can access the same resources you use. In all cases, keep those short iterations going. Don’t wait to get a version to 100% before finding out if it works in context. Ship it at 80%, see how close you got, then iterate.

This approach feels risky if you’re used to planning everything upfront. But the alternative—discovering fundamental incompatibilities after months of work—is much riskier. Better to learn that your table won’t fit through the door when it’s still made of cardboard than when it’s solid oak.

The best software gets built by teams that understand the difference between the theoretical problem they’re solving and the real environment where their solution needs to work. Context is messy, unpredictable, and full of constraints you never anticipated. That’s exactly why you need to visit it early and often.

Your garage is perfect for focused work, but your dining room is where people actually eat dinner. Build for where your software will really live, not where it’s convenient to develop it.

How to Think Like a Hacker (And Why Your Team Should Too)

2021-07-27T04:26:26-04:00

The most effective security-minded developers I know share one trait: they’re professionally suspicious of their own assumptions. They look at a form field and wonder what happens if someone tries to enter something unexpected. They design an API endpoint and ask how someone might misuse it. They have a systematic curiosity about how systems behave versus how they’re supposed to behave.

I saw this firsthand while working with a team where questioning assumptions became a regular part of our code review process. We’d look at every new feature and ask “How might someone abuse this?” I developed a particular talent for finding injection attacks on forms—apparently I have a knack for thinking of creative ways to sneak SQL queries into text fields. After the third or fourth time I caught these vulnerabilities during review, we added validation middleware to eliminate that entire class of problems.

But the real breakthrough was watching how the team’s thinking evolved. Once developers got used to questioning their assumptions about user behavior, they started writing more robust solutions from the start. Security thinking became a starting point rather than something bolted on afterward.

Designing for Reality, Not Just Intent

One of the most effective practices we developed was specifying both the “happy path” and the “unhappy path” during our design process. The happy path was straightforward—everything happens in the way and sequence we intended. But the unhappy paths were where we learned the most: what happens when steps occur out of order? When data is missing or provided in an unexpected format? When external systems fail at exactly the wrong moment?

This dual-path thinking transformed how we approached every feature. Instead of just asking “How should this work?” we started asking “How will this actually be used?” and “What should happen when reality doesn’t match our expectations?” It sounds pessimistic, but it actually made development more fun. It caused us to think about our application from all angles rather than just implementing obvious functionality.

The unhappy path exercise revealed assumptions we didn’t even know we were making. We’d design a user registration flow assuming people would fill out forms completely and submit them once. Then we’d consider reality: What if someone submits the form multiple times? What if they navigate away and come back? What if they fill out the form, wait an hour, then submit it after their session expires?

Each unhappy path scenario led to better design decisions. Race condition handling. Idempotent endpoints. Graceful degradation when external services are unavailable. The code that protected against malicious users also handled legitimate users experiencing network glitches or browser crashes.

Systematic Questioning as a Superpower

There’s a particular mindset that effective security thinking requires—call it systematic skepticism. It’s the ability to look at any system and ask “What assumptions is this making?” and “What happens when those assumptions are wrong?” This kind of thinking makes your software more robust.

Sometimes this means channeling your inner four-year-old—pushing every button, ignoring all instructions, using things in ways their makers never intended. But rather than random exploration, you develop structured ways of challenging system boundaries, finding edge cases, and being creative about the ways that software can be used beyond its intended purpose.

This systematic questioning makes you better at every aspect of development. When you’re used to thinking about edge cases and unexpected inputs, you write more defensive code naturally. When you habitually consider what could go wrong, you build better (more useful) error handling. When you assume users will do unexpected things, you design more intuitive interfaces.

I’ve noticed that developers who adopt this questioning mindset become significantly better at debugging production issues too. Instead of being surprised when something breaks, they’re already thinking “What unexpected condition triggered this?” They approach problems with methodical curiosity rather than frustrated confusion.

Building a Culture of Constructive Skepticism

The key to building security-conscious teams isn’t teaching people to be afraid of attackers—it’s helping them develop genuine curiosity about system behavior under stress. When questioning assumptions becomes intellectually interesting rather than anxiety-inducing, your team will start doing it automatically.

Code reviews become more engaging when everyone is looking for unspoken assumptions about user behavior. Feature planning gets more thorough when “What are the unhappy paths?” is a standard question alongside “What should it do?” Architecture discussions become more robust when you’re considering not just how systems should work together, but how they should behave when dependencies are slow, unavailable, or returning unexpected data.

The practical implementation is surprisingly straightforward. During development, encourage your team to spend time being deliberately unreasonable with whatever they’re building. During design reviews, spend equal time on happy and unhappy paths. During testing, encourage your team to think like someone who’s never seen your application before and doesn’t understand the rules.

What emerges is a team that builds more resilient systems without extra effort. When you’re accustomed to thinking about failure modes, you naturally design systems that handle them gracefully. When you expect users to ignore instructions, you build interfaces that guide them toward success even when they’re not following the intended flow.

Security as Engineering Excellence

What I’ve learned is that security thinking is really just rigorous engineering thinking with a creative twist. It’s the same mental process you use when debugging complex issues or designing APIs that won’t confuse future developers. You’re considering multiple perspectives, anticipating edge cases, and designing for resilience rather than just functionality.

The most successful security-conscious teams I’ve worked with don’t have dedicated security experts who review everything after the fact—they have developers who think about security implications as naturally as they think about performance or usability. This happens through cultural reinforcement and consistent practice, not through mandates or compliance checklists.

The payoff extends far beyond security. Teams that think about unhappy paths build more reliable software. Developers who consider malicious inputs write better input validation for legitimate users. Engineers who design for system failures create more robust integrations. The skills reinforce each other in ways that make everyone more effective.

Most importantly, this approach makes engineering work more intellectually satisfying. There’s something deeply rewarding about anticipating problems and solving them before they happen. When your team develops the habit of systematically questioning their assumptions, they’ll approach every problem with the kind of methodical curiosity that leads to truly robust solutions.

You can help your team become professionally curious about system boundaries, failure modes, and the gap between how software is supposed to work and how it actually gets used. Once they develop that mindset, they’ll write more secure code naturally, because they’ll view software the same way attackers do—as systems that can fail when someone does something unexpected.

A GitHub guide for non-technical leaders

2021-05-24T00:00:00+00:00

As I write this, the front page of GitHub.com declares in big bold letters that this is “Where the world builds software.” This is true. In technology companies today, the creation of your product is largely happening where your developers spend time. It’s where big and small product decisions are made every day – the kind of decisions that, wittingly or not, will decide the future of your company.

I’m writing this guide for a very specific person – possibly you, or someone you know. I’ll explain how a non-technical business leader can find information and take part in the decisions and questions that happen only on GitHub. You don’t need to know how to use Git. You just need a few minutes to follow along, and a desire to be a resource and servant leader for your teams. Let’s do it!

If you haven’t signed up yet, click below to read the very first steps. Once you’re logged in, read on to join in!

The very first steps

Digital resilience: redundancy for websites and communications

2021-02-22T04:00:43-05:00

When what seems like half the planet noped out of WhatsApp after its terms of service update, applications like Signal (which I highly recommend) saw an unprecedented increase in user traffic. Signal had so many new users sign up that it overwhelmed their existing infrastructure and lead to a 24-hour-ish outage.

Signal is experiencing technical difficulties. We are working hard to restore service as quickly as possible.
— Signal (@signalapp) January 15, 2021

The small team responded impressively quickly, especially given that a 4,200% spike in new users was utterly implausible before it occurred.

The downside of so many people moving onto this fantastic application is that it caused a brief outage. If you rely solely on a certain application for your communications, brief outages can be debilitating. Even when it seems implausible that your favorite chat, email, or website service could just – poof – vanish overnight, recent events have proved it isn’t impossible.

Have a backup plan. Have several. Here’s how you can improve your digital resiliency for things like websites, messaging, and email.

Messaging

I recommend Signal because it is open source, end-to-end encrypted, cross-platform, and offers text, voice, video, and group chat. It’s usually very reliable; however, strange things can happen.

It’s important to set up a backup plan ahead of any service outages with the people you communicate with the most. Have an agreement for a secondary method of messaging – ideally another end-to-end encrypted service. Avoid falling back on insecure communications like SMS and social media messaging. Here’s a short list for you to explore:

If you’re particularly technically inclined, you can set up your own self-hosted chat service with Matrix.

Having a go-to plan B can help bring peace of mind and ensure you’re still able to communicate when strange things happen.

Cloud contacts

Do you know the phone numbers of your closest contacts? While memorizing them might not be practical, storing them solely online is an unnecessary risk. Most services allow you to export your contacts to vCard or CSV format.

I recommend keeping your contacts locally on your device whenever you can. This ensures you still know how to contact people if your cloud provider is unavailable, or if you don’t have Internet access.

Full analog redundancy is also possible here. Remember that paper stuff? Write down the phone numbers of your most important contacts so you can access them if your devices run out of battery or otherwise can’t turn on (drop your phone much?).

Local email synchronization

If your email service exists solely online, there’s a big email-shaped hole in your life. If you can’t log in to your email for any reason – an outage on their end, a billing error, or your Internet is down – you’ll have no way to access your messages for however long your exile lasts. If you think about all the things you do via email in a day, I think the appropriate reaction to not having local copies is 🤦.

Download an open source email client like Thunderbird. Follow instructions to install Thunderbird and set it up with your existing online email service. Your online service provider may have a help document that shows you how to set up Thunderbird.

You can maximize your privacy by turning off Thunderbird’s telemetry.

To ensure that Thunderbird downloads your email messages and stores them locally on your machine:

Click the “hamburger” overflow menu and go to Account Settings
Choose Synchronization & Storage in the sidebar
Ensure that under Message Synchronizing, the checkbox for Keep messages in all folders for this account on this computer is checked.

You may need to visit each of your folders in order to trigger the initial download.

Some other settings you may want to update:

Choose Composition & Addressing and uncheck the box next to Compose messages in HTML format to send plaintext emails instead.
Under Return Receipts choose Global Preferences. Select the radio button for Never send a return receipt.

You don’t need to start using Thunderbird for all your email tasks. Just make sure you open it up regularly so that your messages sync and download to your machine.

Websites

I strongly believe you should have your own independent website for reasons that go beyond redundancy. To truly make your site resilient, it’s important to have your own domain.

If you know that my website is at the address victoria.dev, for example, it doesn’t matter whether I’m hosting it on GitHub Pages, AWS, Wordpress, or from a server in my basement. If my hosting provider becomes unavailable, my website won’t go down with it. Getting back up and running would be as simple as updating my DNS configuration to point to a new host.

Price is hardly an excuse, either. You can buy a domain for less than a cup of coffee with my Namecheap affiliate link (thanks!). Namecheap also handles your DNS settings, so it’s a one-stop shop.

With your own domain, you can build resiliency for your email address as well. Learn how to set up your custom domain with your email provider. If you need to switch providers in the future, your email address ports to the new service with you. Here are a few quick links for providers I’d recommend:

Build your digital resiliency

I hope you’ve found this article useful on your path to building digital resiliency. If you’re interested in more privacy topics, you might like to learn about great apps for outsourcing security.

If your threat model includes anonymity or censorship, building digital resiliency is just a first step. The rest is outside the scope of my blog, but here are a few great resources I’ve come across:

Create a self-hosted chat service with your own Matrix server

2021-02-15T01:38:07-05:00

Matrix is an open standard for decentralized real-time communication. The specification is production-ready and bridges to tons of silo products like Slack, Gitter, Telegram, Discord, and even Facebook Messenger. This lets you use Matrix to link together disjoint communities in one place, or create an alternative communication method that works with, but is independent of, communication silos.

You can create your own self-hosted Matrix chat for as little as $3.50 USD per month on an AWS Lightsail instance. Your homeserver can federate with other Matrix servers, giving you a reliable and fault-tolerant means of communication.

Matrix is most widely installed via its Synapse homeserver implementation written in Python 3. Dendrite, its second-generation homeserver implementation written in Go, is currently released in beta. Dendrite will provide more memory efficiency and reliability out-of-the-box, making it an excellent choice for running on a virtual instance.

Here’s how to set up your own homeserver on AWS Lightsail with Dendrite. You can also contribute to Dendrite today.

Create a Lightsail instance

Spin up a new Lightsail instance on AWS with Debian as your operating system. It’s a good idea to create a new per-instance key for use with SSH. You can do this by with the SSH key pair manager on the instance creation page. Don’t forget to download your private key and .gitignore your secrets.

Click Create Instance. Wait for the status of your instance to change from Pending to Running, then click its name to see further information. You’ll need the Public IP address.

To enable people including yourself to connect to the instance, go to the Networking tab and add a firewall rule for HTTPS. This will open 443 so you can connect over IPv4. You can also do this for IPv6.

Connect DNS

Give your instance a catchier address by buying a domain at Namecheap and setting up DNS records.

On your domain management page in the Nameservers section, choose Namecheap BasicDNS.
On the Advanced DNS tab, click Add New Record.

Add an A Record to your Lightsail Public IP. You can use a subdomain if you want one, for example,

Type: A Record
Host: matrix
Value: 13.59.251.229

This points matrix.example.org to your Lightsail instance.

Set up your Matrix homeserver

Change permissions on the private key you downloaded:

chmod 600

Then SSH to your Public IP:

ssh -i  admin@

Welcome to your instance! You can make it more interesting by downloading some packages you’ll need for Dendrite. It’s a good idea to use apt for this, but first you’ll want to make sure you’re getting the latest stuff.

Dec 2021 update: As the good people of Mastodon point out, you might like to ensure you’re choosing the stable version for Debian. For instance, replace buster below with what’s “stable” at the moment.

Change your sources list in order to get the newest version of Go:

sudo vim /etc/apt/sources.list

Delete everything except these two lines:

deb http://cdn-aws.deb.debian.org/debian buster main
deb-src http://cdn-aws.deb.debian.org/debian buster main

Then replace the distributions:

:%s/buster main/testing main contrib non-free/g

Run sudo apt dist-upgrade. If you’re asked about modified configuration files, choose the option to “keep the local version currently installed.”

Once the upgrade is finished, restart your instance with sudo shutdown -r now.

Go make some coffee, then SSH back in. Get the packages you’ll need with:

sudo apt update
sudo apt upgrade
sudo apt install -y git golang nginx python3-certbot-nginx

You’re ready to get Dendrite.

Get Dendrite

Clone Dendrite and follow the README instructions to get started. You’ll need to choose whether you want your Matrix instance to be federating. For simplicity, here’s how to set up a non-federating deployment to start:

git clone https://github.com/matrix-org/dendrite
cd dendrite
./build.sh

# Generate a Matrix signing key for federation (required)
./bin/generate-keys --private-key matrix_key.pem

# Generate a self-signed certificate (optional, but a valid TLS certificate is normally
# needed for Matrix federation/clients to work properly!)
./bin/generate-keys --tls-cert server.crt --tls-key server.key

# Copy and modify the config file - you'll need to set a server name and paths to the keys
# at the very least, along with setting up the database connection strings.
cp dendrite-config.yaml dendrite.yaml

Configure Dendrite

Modify the configuration file you just copied:

sudo vim dendrite.yaml

At minimum, set:

server name to your shiny new domain name, e.g. matrix.example.org
disable_federation to true or false
registration_disabled to true or false

You might like to read the Dendrite FAQ.

Configure nginx

Get the required packages if you didn’t already install them above:

sudo apt install nginx python3-certbot-nginx

Create your site’s configuration file under sites-available with:

cd /etc/nginx/sites-available
ln -s /etc/nginx/sites-available/ /etc/nginx/sites-enabled/
sudo cp default

Edit your site configuration. Delete the root and index lines if you don’t need them, and input your server name.

Your location block should look like:

location / {
    proxy_pass https://localhost:8448;
}

Remove the default with: sudo rm /etc/nginx/sites-enabled/default.

Create self-signed certificates

You can use Certbot to generate self-signed certificates with Let’s Encrypt.

sudo certbot --nginx -d

If you don’t want to give an email, add the --register-unsafely-without-email flag.

Test your configuration and restart nginx with:

sudo nginx -t
sudo systemctl restart nginx

Then start up your Matrix server.

# Build and run the server:
./bin/dendrite-monolith-server --tls-cert server.crt --tls-key server.key --config dendrite.yaml

Your Matrix server is up and running at your web address! If you disabled registration in your configuration, you may need to create a user. You can do this by running the included dendrite/bin/createuser.

You can log on to your new homeserver with any Matrix client, or Matrix-capable applications like Pidgin with the Matrix plugin.

Other troubleshooting

Log files

If you get an error such as:

... [github.com/matrix-org/dendrite/internal/log.go:155] setupFileHook
  Couldn't create directory /var/log/dendrite: "mkdir /var/log/dendrite: permission denied"

You’ll need to create a spot for your log files. Avoid the bad practice of running stuff with sudo whenever you can. Instead, create the necessary file with the right permissions:

sudo mkdir /var/log/dendrite
sudo chown admin:admin /var/log/dendrite

# Build and run the server:
./bin/dendrite-monolith-server --tls-cert server.crt --tls-key server.key --config dendrite.yaml

Unable to decrypt

If you see: Unable to decrypt: The sender's device has not sent us the keys for this message. you may need to verify a user (sometimes yourself).

In your client, open the user’s profile. Click the lock icon if there is one, or otherwise look for a way to verify them.
You may be asked to see if some emojis presented to both users match if you’re using certain clients like Element.
You can then re-request encryption keys for any sent messages.

Set up your own Matrix server today

I hope you found this introduction to setting up your own Matrix homeserver to be helpful!

Do I Raise or Return Errors in Python?

2021-02-09T05:34:48-05:00

I’ve been writing Python for nearly a decade, and this question still comes up in code reviews more often than you’d think. Should I raise an exception or return an error value? It seems simple on the surface, but the choice ripples through your entire codebase in ways that can make or break your team’s productivity six months down the line.

The Real Question Behind the Question

When your function discovers something’s wrong, you’re not only choosing between raise and return. You’re making a decision about how your entire application will handle failure, how readable your code will be for the next person, and how many 3 AM prod debugging sessions you’re setting up for your future self and team.

Here’s how I think about this choice, because the right one for your application affects everything from your error logs to your team’s velocity.

When I Reach for Exceptions

I raise exceptions when something genuinely unexpected happens—when the assumptions my function was built on just got violated. If I’m writing a function to parse a config file and the file doesn’t exist, that’s exceptional. The caller expected a valid config, and I can’t deliver on that contract.

def load_config(filepath):
    if not os.path.exists(filepath):
        raise FileNotFoundError(f"Config file not found: {filepath}")
    
    try:
        with open(filepath, 'r') as f:
            return json.load(f)
    except json.JSONDecodeError as e:
        raise ConfigurationError(f"Invalid JSON in config file: {e}")

Here’s why exceptions work well here: the calling code doesn’t need to check every single operation. If any step fails, the exception bubbles up to whoever can actually handle it. Your main application logic stays clean, and error handling happens at the right level.

The business impact here is huge. When your core logic isn’t cluttered with error checking, you can focus on the actual problem you’re solving. Your functions do one thing well, and your error handling is centralized where it belongs.

When I Return Error Values

But sometimes the “error” isn’t really an error—it’s just one of several possible outcomes. When I’m building a user search function, finding zero results isn’t exceptional. It’s totally normal behavior that the caller needs to handle anyway.

def search_users(query):
    results = database.search(query)
    if not results:
        return []  # Empty list, not an exception
    return results

# Calling code feels natural
users = search_users("john")
if users:
    display_users(users)
else:
    show_no_results_message()

This approach shines when you have multiple valid outcomes and the caller needs to make decisions based on which one occurred. It also works well for performance-critical code where exception handling overhead matters.

The Type Safety Angle

Here’s something I’ve started caring about more as codebases grow: how well does your choice play with static type checking? Modern Python with type hints changes the game significantly.

With exceptions, your function signature stays clean:

def parse_user_id(user_input: str) -> int:
    try:
        return int(user_input)
    except ValueError:
        raise InvalidUserIdError("User ID must be a number")

But with return values, you’re often dealing with unions:

def parse_user_id(user_input: str) -> int | None:
    try:
        return int(user_input)
    except ValueError:
        return None

That | None propagates through your entire codebase. Every function that calls this one now has to handle the None case, and mypy will remind you of that fact. Sometimes that’s exactly what you want—explicit error handling at every level. Other times, it creates unnecessary complexity.

The Performance Reality Check

What about performance? Yes, exceptions are slower than returning values, but context matters enormously here.

In tight loops processing thousands of items per second, that overhead can add up. Profiling code where you’ve switched from exceptions to return values might show improved performance of 20-30%. But in typical web application code where you’re dealing with database calls and network requests, exception overhead is noise compared to everything else.

The more important performance consideration is often developer performance. How quickly can someone understand your code? How easily can they modify it without introducing bugs? I’ve seen teams spend weeks debugging subtle issues that wouldn’t have existed with clearer (documented!) error handling patterns.

Patterns That Actually Work in Production

After working on systems that handle millions of requests, here are the patterns I keep coming back to:

For library code: Raise exceptions. Libraries don’t know how their callers want to handle errors, so push that decision up the stack. Custom exception types help callers decide what to catch and what to let bubble up.

For user input validation: Usually return structured error information. Users make mistakes constantly, and that’s normal behavior, not exceptional.

def validate_email(email: str) -> ValidationResult:
    if not email:
        return ValidationResult(valid=False, error="Email is required")
    if "@" not in email:
        return ValidationResult(valid=False, error="Invalid email format")
    return ValidationResult(valid=True)

For external service calls: This is tricky. Network timeouts and service errors happen, but they’re not exactly exceptional in a distributed system. I often use exceptions for the truly unexpected (DNS resolution failures) and return values for the predictable failures (rate limiting, temporary service unavailability).

The 3AM System Down Test

Here’s my ultimate thought experiment test: if something broke in production and you had to debug it at 3 AM, bleary-eyed and chugging coffee, which approach helps you understand what went wrong faster?

Good exceptions with detailed error messages and proper stack traces are incredible for this. You can see exactly where things went wrong and why. But exceptions that get swallowed or re-raised without context are debugging nightmares.

Return values with proper logging can also be great for debugging, especially when you need to understand the sequence of events that led to a problem. But they require more discipline—you need to actually check and log those return values.

Making the Choice

When I’m looking at a specific function, I ask myself:

Is this condition truly unexpected given the function’s contract?
Do callers need to make immediate decisions based on this failure?
How will this pattern scale across my team and codebase?
What will debugging look like when this inevitably breaks?

There’s no universal right answer, but there are patterns that work well for different situations. The key is being intentional about your choice and consistent within your codebase.

Your error handling strategy affects how quickly new team members can contribute, how easy it is to track down production issues, and how confident you can be when making changes. Choose patterns that serve your team’s long-term productivity, not just today’s immediate problem.

The best choice for error handling is the one that helps you sleep better at night, knowing that when something goes wrong, you’ll be able to figure out what happened and fix it quickly.

If you found some value in this post, there’s more! I write about high-output development processes and building maintainable systems in the AI age. You can get my posts in your inbox by subscribing below.

What Tech Leaders Do Before Going on Vacation

2021-02-01T04:02:54-06:00

Early in my career, I worked on a team where the CEO decided to take two weeks off without much preparation. By the middle of the first week, people had “run out” of things to do. Not because there wasn’t work—there was plenty—but because no one knew what they were supposed to prioritize, who could make decisions, or how to move forward on anything that required input from leadership.

We spent those two weeks in a weird organizational limbo, working on whatever seemed important while bigger decisions piled up. Upon returning, the CEO was frustrated that so little had been accomplished, and the team was frustrated that they’d been left without clear direction. It was a perfect example of how taking time off as a leader requires completely different preparation than taking time off as an individual contributor.

The reality is that leadership vacation planning isn’t about finishing your own work—it’s about ensuring your team can function effectively without you. Done well, it’s actually a powerful way to develop your team’s autonomy and decision-making capabilities. Done poorly, it creates exactly the kind of organizational dysfunction I witnessed firsthand.

The Information Bottleneck Problem

Here’s what most leaders don’t realize: you’re probably a bigger bottleneck than you think. Not because you’re micromanaging, but because critical context lives in your head that your team needs access to in order to make good decisions. The challenge isn’t documenting everything you know—that’s impossible. The challenge is identifying what your team will actually need while you’re gone.

I’ve learned to approach this systematically. Instead of trying to dump all my knowledge, I focus on the specific work my team will be doing during my absence. What decisions might come up that I can provide context for? What blockers could they encounter and who could help in my absence? Who will take the lead on making decisions to help keep projects moving forward?

This exercise often reveals gaps in team communication that extend beyond vacation planning.

If people don’t know how to prioritize work when you’re gone for a week, they probably struggle with prioritization day-to-day more than you realize.

Vacation prep becomes a forcing function for better ongoing delegation.

The practical approach is straightforward: review your priority list and write down the context and contacts that your team will need to get work done while you’re away. But the deeper value is discovering where your team needs more autonomy and decision-making authority in general.

Decision-Making Without You

The most common mistake I see leaders make is trying to pre-decide everything that might come up while they’re away. This is both impossible and counterproductive. Instead, the goal should be empowering your team to make good decisions using the same framework you would use.

Before any significant time off, I have explicit conversations with my team about what kinds of decisions they can make independently and what should wait for my return. More importantly, I explain the reasoning behind those boundaries so they understand when to escalate and when to proceed.

More than just being on the same page, these boundaries help to build your team’s confidence in their own judgment.

When people understand your decision-making criteria and feel trusted to apply them, they’ll make better choices whether you’re away on vacation or away in a meeting.

The key is being specific about decision authority rather than vague about “checking with me first.” Instead of saying “let me know if anything important comes up,” try “you can approve any engineering changes that don’t affect the database schema, but flag anything that requires downtime for discussion when I’m back.”

Creating Clarity, Not Chaos

The difference between teams that thrive when their leader is away and teams that stagnate comes down to clarity of expectations. Your team needs to know not just what to work on, but how to make trade-offs when priorities conflict, who to go to for different types of help, and what success looks like in your absence.

I’ve found that internal communication about your time off is just as important as external auto-responders.

A quick message to your team explaining where to find information, who’s covering what responsibilities, and how to handle common scenarios prevents a lot of confusion and hesitation.

But the real test is whether your team feels empowered to act or feels like they’re in caretaker mode until you return. The goal is maintaining momentum, not just maintaining the status quo. This requires trusting your team with meaningful work and giving them the context they need to handle unexpected situations.

The Leadership Development Opportunity

Your vacation is actually a development opportunity for your team if you set it up intentionally.

When you step back temporarily, you create space for other people to step up, make decisions, and take on leadership responsibilities.

Instead of just hoping things will be fine while you’re gone, use your absence as a chance to test and develop your team’s capabilities. Give someone the opportunity to run meetings, handle stakeholder communication, or make technical decisions that they’re ready for but haven’t had the chance to practice.

The preparation for this kind of delegation is more involved than just finishing your own work, but the payoff is enormous. You return to a team that’s more capable and confident, and you’ve identified who’s ready for additional responsibilities. Plus, you’ve stress-tested your team’s ability to function without you, which is valuable information for organizational resilience.

Making Time Off Actually Restful

The irony of leadership is that taking time off can be stressful if you’re worried about what’s happening while you’re away. The best vacation preparation eliminates that anxiety by ensuring your team has everything they need to succeed without you.

This means being honest about your availability expectations and sticking to them. If you tell your team you’ll be completely offline, don’t check Slack “just once” and end up getting pulled into work discussions. If you’re going to check in periodically, be specific about when and how, so people know what to expect.

The teams that handle leadership time off best are the ones where this kind of preparation is routine, not exceptional. When delegation, clear communication, and decision-making authority are part of your regular management practice, preparing for vacation becomes straightforward rather than stressful.

Your time off should leave your team more capable, not less. When you return from vacation to find that your team tackled challenges, made good decisions, and maintained momentum without you, you’ll know you’ve built something sustainable. That’s not just good vacation planning—it’s good leadership.

Add search to Hugo static sites with Lunr

2021-01-26T09:25:17-05:00

Yes, you can have an interactive search feature on your static site! No need for servers or paid subscriptions here. Thanks to the open source Lunr and the power of Hugo static site generator, you can create a client-side search index with just a template and some JavaScript.

A number of my readers have been kind enough to tell me that you find my blog useful, but there’s something that you don’t know. Up until I recently implemented a search feature on victoria.dev, I had been my own unhappiest user.

My blog exists for all to read, but it’s also my own personal Internet brain. I frequently pull up a post I’ve written when trying to re-discover some bit of knowledge that I may have had the foresight to record. Without a search, finding it again took a few clicks and more than a few guesses. Now, all my previous discoveries are conveniently at my fingertips, ready to be rolled into even more future work.

If you’d like to make your own personal Internet brain more useful, here’s how you can implement your own search feature on your static Hugo site.

Get Lunr

While you can install lunr.js via npm or include it from a CDN, I chose to vendorize it to minimize network impact. This means I host it from my own site files by placing the library in Hugo’s static directory.

You can save your visitors some bandwidth by minifying lunr.js, which I did just by downloading lunr.js from source and using the JS & CSS Minifier Visual Studio Code extension on the file. That brought the size down roughly 60% from 97.5 KB to 39.35 KB.

Save this as static/js/lunr.min.js.

To easily place your search form wherever you like on your site, create the form as a partial template at layouts/partials/search-form.html

<form id="search"
    action='{{ with .GetPage "/search" }}{{.Permalink}}{{end}}' method="get">
    <label hidden for="search-input">Search sitelabel>
    <input type="text" id="search-input" name="query"
    placeholder="Type here to search">
    <input type="submit" value="search">
form>

Include your search form in other templates with:

{{ partial "search-form.html" . }}

Create a search page

For your search to be useful, you’ll need a way to trigger one. You can create a (static!) /search page that responds to a GET request, runs your search, and displays results.

Here’s how to create a Hugo template file for a search page and get it to render.

Create layouts/search/list.html with the following minimum markup, assuming you’re inheriting from a base template:

{{ define "main" }}
{{ partial "search-form.html" . }}

<ul id="results">
    <li>
        Enter a keyword above to search this site.
    li>
ul>
{{ end }}

In order to get Hugo to render the template, a matching content file must be available. Create content/search/_index.md to satisfy this requirement. The file just needs minimal front matter to render:

---
title: Search me!
---

You can run hugo serve and navigate to /search to see if everything builds as expected.

A few libraries exist to help you build a search index and implement Lunr. You can find them here on the Hugo site. If you want to fully understand the process, however, you’ll find it’s not complicated do this without additional dependencies, thanks to the power of Hugo’s static site processing.

Build your search index

Here’s how to build an index for Lunr to search using Hugo’s template rendering power. Use range to loop over the pages you want to make searchable, and capture your desired parameters in an array of documents. One way to do this is to create layouts/partials/search-index.html with:

<script>
window.store = {
    // You can specify your blog section only:
    {{ range where .Site.Pages "Section" "posts" }}
    // For all pages in your site, use "range .Site.Pages"
    // You can use any unique identifier here
    "{{ .Permalink }}": {
        // You can customize your searchable fields using any .Page parameters
        "title": "{{ .Title  }}",
        "tags": [{{ range .Params.Tags }}"{{ . }}",{{ end }}],
        "content": {{ .Content | plainify }}, // Strip out HTML tags
        "url": "{{ .Permalink }}"
    },
    {{ end }}
}
script>

<script src="/js/lunr.min.js">script>
<script src="/js/search.js">script>

When Hugo renders your site, it will build your search index in much the same way as a List page is built, creating a document for each page with its parameters.

The last piece of the puzzle is the code to handle the search process: taking the search query, getting Lunr to perform the search, and displaying the results.

Perform the search and show results

Create static/js/search.js to hold the JavaScript that ties it all together. This file has three main tasks: get the search query, perform the search with Lunr, and display the results.

Get query parameters with JavaScript

This part’s straightforward thanks to URLSearchParams:

const params = new URLSearchParams(window.location.search)
const query = params.get('q')

Search for the query with Lunr

Define and configure an index for Lunr. This tells Lunr what you’d like to search with, and you can optionally boost elements that are more important.

const idx = lunr(function () {
    // Search these fields
    this.ref('id')
    this.field('title', {
        boost: 15
    })
    this.field('tags')
    this.field('content', {
        boost: 10
    })

    // Add the documents from your search index to
    // provide the data to idx
    for (const key in window.store) {
        this.add({
        id: key,
        title: window.store[key].title,
        tags: window.store[key].category,
        content: window.store[key].content
        })
    }
})

You can then execute the search and store results with:

const results = idx.search(query)

Display results

You’ll need a function that builds a list of results and displays them on your search page. Recall the id you gave your ul element in layouts/search/list.html and store it as a variable:

const searchResults = document.getElementById('results')

If a search results in some results (🥁), you can iterate over them and build a

element for each one.

if (results.length) { // Length greater than 0 is truthy
    let resultList = ''
    for (const n in results) {
      // Use the unique ref from the results list to get the full item
      // so you can build its 
      const item = store[results[n].ref]
      resultList += '
 + item.url + '">' + item.title + ''
      // Add a short clip of the content
      resultList += '' + item.content.substring(0, 150) + '...
'
    }
    searchResults.innerHTML = resultList
}

For each of your results, this produces a list item similar to:

<li>
    <p>
        <a href=".../blog/add-search-to-hugo-with-lunr/">
        Add search to Hugo static sites with Lunr
        a>
    p>
    <p>Yes, you can have an interactive search feature on your static site!...p>
li>

If there are no results, ham-handedly insert a message instead.

else {
    searchResults.innerHTML = 'No results found.'
}

Full code for search.js

Here’s what static/js/search.js could look like in full.

search.js full code

Make your own independent website

2021-01-16T08:41:27-05:00

The web that raised me was a digital playground in the truest sense. It was made up of HTML experiments Frankensteined together by people still figuring it all out.

The beauty of not completely knowing what you’re doing is a lack of premature judgement. Without a standard to rise to, you’re free to go sideways. Explore. Try things that don’t work, without any expectation they will work. An open world with a beginner’s mindset.

The web that raised me was a little broken. Things didn’t always display the way they were supposed to. That too is part of the beauty. It was just broken enough to make you think for yourself.

1991 was the year of the individual on the web, the first year any layperson could open a web browser and access the new hypermedia dimension. There were no go-to, search-suggested, centralized websites. There were newsgroups. You had what you made and what your meatspace contacts sent you. In 2021, I think we need a return to that level of individualism. We need to make 2021 the year of the independent web.

That’s not to say I think the massive monopolistic platforms are going anywhere. Twitter, Facebook, mainstream “news” media sites – they’re all a kind of utility now, like plumbing and electricity. They’ll find their place in regulation and history. But they are not your website.

Your website is the one you create. Where the content, top-to-bottom, is yours alone to shape and present as you please. Your website is your place of self-expression, without follower counts or statistics to game. Your website is for creation, not reaction.

It’s all yours, but it doesn’t have to seem lonely. Your site can interact with the entire online world through syndication and protocols made possible by this thing we call the Internet. See:

IndieWeb for POSSE, an abbreviation for Publish (on your) Own Site, Syndicate Elsewhere
Webmention and an easy way to implement them
twtxt instances for a decentralized timeline experience
Neofeed , my personal timeline project made for Neocities . (It’s open source and you can help me extend it! )

Your website is your beginning point. The one source of truth for your identity online, from which you can generate and distribute disposable copies to any platform you please. This is what it means to truly own your content. And on the Internet, your content is you.

This is my website. When I first created it, I did so for myself. I had no expectation of visitors. I just knew I’d rather have these thoughts and things I’ve learned here, out here, made indelible in the folds of the public Internet, instead of on some dark corner of my machine, to be lost forever once I am.

Make your own website. You’ll grow your own sense of well-deserved accomplishment and contribute to your independence on the web. You’ll learn by doing, by scratching your own itch.

Learn about web technologies. Use them as you would if you were a child holding a pencil or paintbrush for the first time. Experiment, with no expectations other than discovering what you can do to make it delight you.

These sites and articles inspired this post and helped me implement webmentions!

How to Choose a Great Tech Hire

2021-01-12T05:50:53-05:00

I’ve seen too many hiring processes that focus on the wrong things. Teams spend hours on algorithm puzzles and whiteboard exercises, then hire someone who can’t write readable code or collaborate effectively with colleagues. Six months later, they’re dealing with either a performance issue or an unexpected resignation from someone who never felt like they fit the team.

These candidates don’t lack technical ability. The problem is that traditional hiring processes don’t predict who will actually succeed and stay on your team. After years of hiring engineers and watching some thrive while others struggle, I’ve learned that the best predictors of long-term success are often the things most interviews completely miss.

Here’s what I actually look for when hiring engineers, and why these signals matter more than most technical assessments.

Look for Builders, Not Just Coders

The question that matters most isn’t “Can they solve algorithm problems?” It’s “Can they build things that solve problems?” There’s a fundamental difference between someone who can write code and someone who can deliver working software that serves a purpose.

When I review candidates, I’m looking for evidence that they’ve built complete projects from start to finish. Not just coding exercises or tutorial follow-alongs, but actual working software that solves real problems. This could be command-line utilities, web applications, automation tools, or contributions to open source projects—the complexity matters less than the completeness.

What I’m really evaluating is their ability to navigate the full software development lifecycle. Can they scope a problem, make technical decisions, handle edge cases, write documentation, and ship something that actually works? These are the skills that translate directly to success on your team, regardless of whether they learned them in a computer science program or taught themselves on weekends.

The best candidates can walk you through their projects and explain not just how they built something, but why they made specific technical choices. They understand the trade-offs they made and can articulate what they learned from the experience. This kind of thinking is what distinguishes engineers who will contribute meaningfully to your team from those who will struggle to move beyond assigned tasks.

Evaluate Systems Thinking Over Syntax Knowledge

Most technical interviews focus on whether someone knows specific syntax or can solve isolated problems. But the engineers who succeed on teams are the ones who understand how their code fits into larger systems and affects other people’s work.

I look for candidates who demonstrate awareness of follow-on effects. When they describe a project, do they consider performance implications? Do they think about maintainability? Can they explain how their technical decisions might impact other developers or users?

Understanding concepts like mutability, thread safety, and code reusability shows technical competence as well as thinking systematically about software as something that exists in a larger context. Engineers who grasp these concepts naturally write code that’s easier to debug, extend, and maintain. They consider the total cost of ownership, not just the immediate implementation.

During interviews, I ask candidates to explain technical trade-offs they’ve made in their projects. The specific technologies matter less than their ability to reason about complexity, performance, and maintainability. Engineers who think this way will continue learning and adapting as your company’s tech stack evolves.

Assess Communication Skills Through Real Examples

Communication skills aren’t just a “nice to have” for engineers—they’re essential for team effectiveness. But most hiring processes assess communication through artificial interview scenarios rather than looking at how candidates actually communicate about technical topics.

I spend significant time reviewing candidates’ written communication. How do they explain their projects in README files? How do they participate in open source discussions? Can they write clear, helpful documentation? These examples reveal how they’ll communicate with your team when explaining technical decisions, documenting systems, or participating in code reviews.

Pay attention to how candidates describe complex technical concepts during interviews. Can they adjust their explanation based on their audience’s technical background? Do they provide context and examples? Can they acknowledge when they don’t know something without becoming defensive?

The engineers who succeed long-term are those who can collaborate effectively across different skill levels and backgrounds. They can explain technical concepts to non-technical stakeholders, provide helpful code review feedback, and contribute to architectural discussions. These collaborative skills are often better predictors of success than pure technical ability.

Identify Team Players Through Contribution Patterns

The best predictor of how someone will behave on your team is how they’ve behaved on other teams. Rather than asking hypothetical questions about teamwork, look at concrete examples of how candidates have collaborated with others.

Open source contributions provide excellent insight into someone’s collaborative style. How do they handle feedback on their code? Do they contribute thoughtfully to discussions? Can they work within existing conventions and standards? Do they help other contributors or just focus on their own work?

For candidates without extensive open source history, look at how they talk about past team experiences. Do they credit others for successes? Can they describe situations where they helped colleagues or learned from feedback? How do they handle disagreement or conflict?

I’m particularly interested in candidates who show evidence of helping others grow. Engineers who mentor junior developers, contribute to team documentation, or improve development processes tend to have a positive impact that extends far beyond their individual contributions.

Evaluate Learning Ability Over Current Knowledge

Technology changes rapidly, which means the specific skills someone has today matter less than their ability to acquire new skills as needed. The engineers who thrive long-term are those who stay curious and adapt effectively to new challenges.

During interviews, I ask candidates about times they had to learn something completely new for a project. How did they approach unfamiliar technologies? What resources did they use? How did they validate their understanding? The process they describe reveals more about their potential than any specific technology they currently know.

I also look for evidence of intellectual humility. Can candidates acknowledge the limits of their knowledge? Do they ask thoughtful questions? Are they excited about learning from more experienced team members? Engineers who combine confidence in their abilities with openness to learning tend to grow quickly and integrate well with existing teams.

What This Means for Your Hiring Process

Identifying these qualities requires a different approach than traditional technical interviews. Instead of algorithm problems, focus on discussing real projects and technical decisions. Instead of whiteboard coding, review actual code they’ve written and ask them to explain their thinking.

Spend time on behavioral questions that reveal collaborative patterns and learning ability. Make time for informal conversation about what kind of work environment they thrive in and what they’re excited to learn next.

Most importantly, involve your team in the hiring process. The people who will work directly with your new hire are often better at assessing team fit than individual interviewers making isolated decisions.

Remember that hiring is ultimately about predicting future success, not just evaluating current abilities. The candidates who can build complete projects, think systematically about technical decisions, communicate effectively, and continue learning will contribute more to your team’s long-term success than those who simply perform well on coding tests.

Your perfect candidate isn’t necessarily the most technically skilled or the most knowledgeable about your domain. It’s the person who will grow with your team and contribute to the kind of collaborative, effective engineering culture that retains great people and delivers great software.

How to become a software developer

2021-01-05T04:50:07-06:00

As a Director of Engineering, I’m a software developer who hires and leads other software developers. It’s not surprising then that I get asked this question a lot, in various forms:

How do I become a software developer?
What language or framework should I learn first?
Where do I start?

While I’m certain there’s no one right answer for everyone, I’m also certain that the world needs more software developers and systems thinkers.

The best thing I can do to help you lead yourself, learn to code, and become a software developer is to share the most efficient parts of how I did it myself. This is the article I wish I had read when I started coding.

Depth matters

Software is exceedingly complex. Like a good novel that you wish you’d never finish reading, there’s always more to discover and learn. If you don’t want to miss the best parts, don’t be satisfied with surface-level explanations. Always go deeper! Ask why, why, and why again until you get to the fundamentals. Soon enough, you’ll start to see patterns.

By digging deeper, you’ll begin to understand the fundamentals of how things connect, what makes things “fast,” and facets of software operation that you probably can’t even imagine exist. It’s like peeking behind the curtain and seeing a whole world of systems and processes that most people are never aware of.

Going in-depth can expand your mind and your capacity for learning. Keep asking why. Follow every link. Let your curiosity guide you.

Hard stuff matters

Giving yourself the chance to be delighted through discovery doesn’t come for free. It takes a lot of hard work to read and compress complicated ideas into your meat brain.

It’s important not to gloss over the hard stuff. In fact, if something seems too hard to understand, you might benefit from doing it first. You might have to get creative to find ways to explain things to yourself, but when you succeed, it makes everything else easier later on.

Analogies are helpful for understanding hard concepts, but they’ll only help you start to understand concepts at a surface level. Remember to go in-depth. Don’t stop at the analogy.

Writing matters

Write right away. Create a habit of explaining everything you learn to yourself in long-form writing. Better than bullet points, writing with a conversational tone engages parts of your brain that help you to process and remember new information. It’s why humans like and remember stories, and it’s a superpower you get for free.

Start by writing for yourself. Write about what interests you. Try something new, even if it seems rudimentary, and write in-depth about what you learn. (One of my most popular posts is about iteration in Python. When I first wrote it, I considered myself a complete beginner.)

If you want to go a step further, share your writing with the world!. Learn in public, like I do by writing on this site. I often get questions like, “how do I choose a theme for my blog?” or “what platform should I use?” or “what popular language/framework/topic should I focus on?” My answer is: don’t worry about it.

Don’t fret too much about your blog theme or platform. Pick the easiest option for you to get started with for now. All of that will change and improve as you learn, practice, and find your focus. Just start writing, ideally, yesterday.

Write for yourself by explaining what you’re doing, as if it were past-you teaching future-you — because it is. You will be your first reader, and the first judge of how useful your blog can be. Seek to impress yourself!

The language, framework, or version doesn’t matter

Why pigeonhole your abilities before you even start? Pick any software language, framework, or technology that seems to make sense to you when you first read it. Start there.

Remember that it’s important to dig deep and understand the fundamentals. Basic concepts of software transcend languages. Whichever first language you choose, understand functions, variables, return values, iteration, and how immutability works. You’ll find that learning these concepts will make it easier to recognize them in your second language, and learn that too.

Your portfolio doesn’t matter

If your first objective is to build a portfolio, you may be trying to run before you walk. Building a portfolio to showcase to potential employers is a great goal, but a terrible first step.

If you think of creating a polished portfolio as a first step, you’re liable to spend too much time making it pretty and presentable before focusing on the content. As someone who hires software developers, I can tell you wholeheartedly that I’d rather see clean and well-written code than a flashy front page.

Don’t confuse building a portfolio with building projects. Absolutely build projects, right from the beginning. There’s no better way to see the practical application of what you’re learning. Just treat them as first drafts, as training ground, and don’t worry about packaging them up for professional consumption.

By allowing yourself to build some draft projects first, you allow yourself the breathing room to learn from them. Focus on iteration, on making one small thing better each time, and you’ll build a portfolio without even realizing it.

Focus on what matters

Don’t follow this advice blindly; rather, incorporate it into your own systems. Experiment, make it work better than when you found it, then pay it forward by writing down what you’ve learned for someone else to read!

Here are my favorite books for reading or listening to if you want to cultivate a learning mindset. See non-coding books for coders.

If this article benefits you in some way, I encourage you to write about it! The process of learning how to learn is never finished. You can be the next iteration.

Be brave and build in public

2020-12-24T04:57:31-06:00

I used to think that when I wanted to make updates to a project, I ought to hold back and do a big re-launch with all the changes at once. I figured that seeing big changes would feel like Christmas!

Unfortunately, Christmas only comes once a year. After years in the tech and cybersecurity world, my perspective has changed.

I’ve found that people, including myself, value receiving small, constant, incremental improvements far more than big changes once or a few times a year. It makes sense if you think about it. The former constantly delights in small, unexpected ways that make the user experience better. The latter is invisible, except for a few times a year.

There are occasions when big changes make sense. Say you’re re-launching functionality, or coinciding with an event that deserves all the fanfare of an unveiling.

Other than that, and for most of us, holding back doesn’t serve us at all. It may even come from something far more insidious: fear of judgement.

Being brave

Thinking such as “I’ll show it to the world when it’s ready,” always leaves out the most important detail. What does “ready” mean?

If you haven’t written down your definition of “ready,” consider that you may be holding back for no good reason. What’s the worst that could happen, anyway, if you make your work public when it’s less than perfect?

I decided to find out when I started to build in public. Instead of holding back work, I released a first version as soon as it functioned as intended. I leaned on v0.0.* tags as a way to say, “This is available, but still in progress.” Or, I’d say so outright, in the README.

In the world of open source, building in public can be scary stuff. It feels like making yourself vulnerable. It’s opening up part of yourself, a creative part, for scrutiny and nitpicking – by strangers. Of course it’s not comfortable.

Once I overcame the discomfort, once I decided to be brave and appreciate even possibly negative feedback, something amazing happened.

I suddenly had help.

Yes, there was scrutiny and nitpicking – but I don’t think any of it was ill-intentioned. I found that there existed whole communities of people who wanted to help me build a project that they thought was interesting. In some cases, I was utterly amazed when people submitted pull requests for issues I’d opened on my own projects describing enhancements I’d like to have.

I’ve been fortunate to have wonderful experiences with open source so far. Based on these experiences, I’d like to share with you what I’ve discovered to be the most effective ways to be brave and generous when it comes to open source.

When unprompted strangers submit helpful comments, issues, and pull requests on your projects, it feels like Christmas. You can give the gift of helpfulness in your contributions as well.

Treat comments like a face-to-face conversation. Greet the person you’re addressing. Use full sentences. Think about whether what you’re writing will make someone’s day better or worse, and be nice.

When writing issues, include as much technical detail as possible. Screenshots, console logs, screenshots of console logs, your operating system, browser, screen resolution – all these can help maintainers quickly diagnose a root cause.

Pull requests are the best presents ever. They make maintainers happy when well done, and it’s a gift that gives back when your contribution gets merged! 🎉 Give your PR the best chance of getting accepted by looking for and following any project contribution guidelines.

Recognize the human

Our brains are slightly lacking in an evolutionary sense when it comes to interacting with other humans through tiny screens. It can be easy to forget that the actions you put out there will eventually reach one or more other people.

You can help to maintain a great open source community by remembering the humans that make it exist. When commenting, take the time to do it well (see below) and recognize the time that someone else has put in. When closing a thread or merging a contribution, remember to say thank you to the people who pitched in to help. I try to use first names instead of screen names, whenever possible.

You can build personal relationships, too. If you’re a project maintainer, you may choose to give people a way to contact you directly to ask questions or hash out complicated plans. Establishing one-to-one communications with regular contributors is also a great way to build a community around your project.

Recognizing the humans behind the open source community is a simple and meaningful way to give back.

Don’t rush

The vast majority of open source participants are volunteers, which means they don’t get paid for the time they spend building up projects. That sometimes means that other work takes priority. It’s okay if this describes you, too.

It’s important to remember that in most cases, a well-done contribution later is preferred over a half-done contribution sooner. If you’re too short on time now to write a thoughtful comment – don’t! Either draft a quick note and set it aside for later, or comment something along the lines of:

Hi there! Just wanted to let you know that I’ve seen this and I plan to help! I’ll respond in full as soon as I have the time to write a thoughtful comment.

Showing that you think a comment is worth the time to do well is something that open source contributors and repository maintainers both appreciate.

Build generously

When open source participants act with conscientiousness, every day feels like Christmas. Regardless of your type of contribution, you can help build this generous global community year-round.

The humans of open source, by self selection, mostly consist of good people who want to help. If you build openly, share feedback generously, and try to do good in general, I think you fit in here.

I hope you have a very happy holiday season and give many gifts that keep on giving!

So you're the family tech support

2020-12-21T08:42:24-05:00

🎄🌟 Happy holidays! 🌟🎄

For those of you seeing relatives this season, chances are that you’re the designated family tech support. If part of your time home for the holidays is spent on software updates and troubleshooting WiFi, here are a few other quick wins to help boost your family’s online privacy and security.

1. Set up a VPN

Using a VPN is Online Safety 101. Choose a reputable provider with a strict no-logging policy, or if you’re up for it, roll your own.

2. Introduce a password manager

If your family member uses the same password everywhere (+, same as last year) because passwords are hard to remember, introduce them to their new best friend, 1Password. Help your family get set up with secure passwords they don’t have to write down on Post-It notes – just one master pass(phrase) is all you need.

When choosing a passphrase, avoid using information easily found on social media accounts, like pet names, favorite sports teams, favorite brands, or birthdays.

3. Switch to DuckDuckGo

Help fight the Internet search monopoly by getting your family to use a search engine that respects their privacy. Go to your browser Settings and set your Default Search Engine (that uses the URL bar) to DuckDuckGo. Break the ice with an instant answer feature, like searching “calendar” so you can count down to Christmas.

(You might want to search for “classic cocktails cheat sheet” after all this.)

4. Install a better browser and blocker

While I prefer a Pi-hole, setting one up can be complex. Instead, help set up a privacy-preserving browser like Firefox or a wide-spectrum blocking extension like uBlock Origin (GitHub source).

Your family will get faster page load times, less advertisements interrupting articles and videos, and fewer sneaky trackers leaking browsing habits to big tech, all with near-zero maintenance.

Be a home-for-the-holidays hero!

Help improve your family’s security posture this holiday season. A little beefed-up cybersecurity may be one of the best gifts you can give!

I’m keeping it short-and-sweet this week. My annual Christmas post drops on December 24, full of warm fuzzy goodness and a tech tip or two. Thank you for being a subscriber – stay tuned!

How to Write Good Documentation

2020-12-14T04:53:10-05:00

If you’ve ever half-written a software project before taking a few days off, this is the article you’ll discover you needed when you reopen that IDE.

In the technology teams I lead, we make a constant effort to document all the things. Documentation lives alongside the code as an equal player. This helps ensure that no one needs to make assumptions about how something works, or is calling lengthy meetings to gain working knowledge of a feature. Good documentation saves us a lot of time and hassle.

That said, and contrary to popular belief, the most valuable software documentation is not primarily written for other people. As I said in this well-received tweet:

The secret to good documentation is to write it while you're writing the code. You are your first audience. Explain what you're doing to yourself. Future you will thank you!
— Victoria Drake November 24, 2020

Here are three concrete steps you can take to write good documentation before it’s too late.

1. Start with accurate notes

As you work out ideas in code, ensure you don’t soon forget important details by starting with accurate notes. While you will want to explain things to yourself in long-form later, short-form notes will suffice to capture details without interrupting your coding session flow.

Don’t rely on inline comments that often fail to make sense once you’ve forgotten the context. Keep a document open alongside your code and write down things like commands, decisions, and sources you use. This can include:

Prompts or shell commands you used
Why you chose a particular method over another
Links you visited for help or coughcopy-pastecough inspiration
The order in which you did things

Don’t worry about full sentences at this point. Just ensure you accurately capture context, relevant code snippets, and helpful URLs. It can also be helpful to turn on any auto-save option available.

2. Explain decisions in long form

The ideal time to tackle this step is when you take a break from coding, but before you completely go out to lunch on whatever it is you’re working on at the moment. You want to ensure that context, ideas, and decisions are all still fresh in your mind when you explain them to yourself.

Go over the short-form notes you took and start expanding them into conversational writing. Be your own rubber duck. Describe what you’re doing as if you were teaching it to someone else. You might cover topics such as:

Quirky-looking decisions: “I would normally do it this way, but I chose to do something different because…”
Challenges you ran into and how you overcame them
Architectural decisions that support your project goals

Stick to the main points. Long-form writing doesn’t mean you’ll be paid by the word! Just use full sentences, and write as if explaining your project to a colleague. You’re explaining to future you, after all.

3. Don’t neglect prerequisite knowledge

This step is best done after a long lunch break, or even the next day (but probably not two). Re-read your document and fill in any blanks that become apparent after putting some distance between yourself and the project.

Take extra care to fill in or at least link to prerequisite knowledge, especially if you frequently use different languages or tools. Even an action as small as pasting in a link to the API documentation you used can save hours of future searching.

Write down or link to READMEs, installation steps, and relevant support issues. For frequently performed command-line actions, you can use a self-documenting Makefile to avoid having to man common tasks each time you come back to a project.

It’s easy to forget supporting details after even just a short break from your project. Capture anything you found helpful this time around.

Document all the things

The next time you catch yourself thinking, “I’m sure I’ll remember this part, no need to write it down,” just recall this emoji: 🤦‍♀️

Software projects are made up of a lot more than just their code. To best set up your future self for success, document all the things! Whether it’s a process you’ve established, Infrastructure as Code, or a fleeting future roadmap idea — write it down! Future you will thank you for it.

If you enjoyed this post, there’s a lot more where that came from! I write about developer ergonomics for high-performing teams and building beautiful, maintainable software in the age of AI. You can subscribe below to see new posts first.

Do One Thing: Mastering Prioritization for High-Performing Teams

2020-12-07T15:01:25-06:00

In the engineering teams I lead, “priority” has no plural form. This drives some people slightly crazy, especially those who like to hedge their bets with phrases like “top priorities” or “critical priorities.” But I’ve learned that the moment you allow multiple top priorities, you’ve essentially created zero priorities.

I discovered this the hard way while working with a team that was constantly context-switching between “urgent” projects. Everyone was busy, morale was decent, but we weren’t actually shipping much of value. During one particularly frustrating week, I counted seventeen different tasks that had been labeled as “high priority” by various stakeholders. Our standups felt like disaster reports, and I realized we’d created a system where being busy had become more important than being effective.

The solution turned out to be surprisingly simple, though not easy to implement: put everything into a single, ordered list where only one thing can be most important at any given time.

The Radical Transparency of a Central List

Most teams I’ve encountered operate like a collection of individual to-do lists with some coordination meetings sprinkled on top. Engineering works on technical debt, product pushes for new features, leadership wants infrastructure improvements, and everyone optimizes their own piece of the puzzle. The result is a lot of activity that doesn’t add up to meaningful progress.

A single, centralized, prioritized list changes the entire dynamic. Everyone can see what’s actually being worked on, what’s coming next, and most importantly, what’s not getting done and why. This visibility creates natural conversations about trade-offs that simply don’t happen when work is siloed.

I’ve watched teams discover they were working on competing solutions to the same problem, simply because no one had a complete view of active work. Others realized they were delaying important projects because someone assumed “someone else” was handling the dependency. When everything is visible and ordered, these coordination problems become obvious and fixable.

The transparency also creates a different kind of accountability. When priorities are public and explicit, it becomes much harder to justify working on pet projects or avoiding difficult tasks. The list becomes a shared source of truth that guides decisions rather than each person interpreting priorities through their own lens.

Autonomy Within Structure

One concern I hear frequently is that a single priority list will turn people into order-takers rather than creative problem-solvers. In practice, I’ve found exactly the opposite happens when you implement it correctly.

The key is encouraging people to choose the highest-priority task they can effectively tackle rather than assigning specific tasks to specific people. Someone might skip over the absolute top item because it requires domain knowledge they don’t have, but they can pick up the second or third item that lets them contribute meaningfully while learning something new.

This approach leverages the fact that your team members understand their own capabilities and growth goals better than you do. A senior engineer might choose to mentor a junior developer on a complex task. A frontend specialist might want to tackle a backend task to broaden their skills. These decisions create better outcomes in the long term than top-down task assignment while still maintaining focus on organizational priorities.

The autonomy comes from trusting people to make good decisions about how to contribute most effectively, while the structure comes from ensuring those contributions align with actual business needs.

The Art of Making Yourself Redundant

If your team frequently asks you what they should work on next, you’ve accidentally created a bottleneck—and it’s you. This is one of the most common scaling problems I see with engineering leaders who transition from individual contributor roles.

The goal is building a system where intelligent people can make good decisions without constant input from leadership. This requires making context painfully available—team goals, product strategy, architectural decisions, customer feedback, and anything else that influences prioritization should be accessible and current.

I’ve found that the difference between teams that scale smoothly and teams that hit velocity walls usually comes down to how well they’ve documented the reasoning behind decisions. When someone can understand not just what to build but why it matters and how it fits into the larger strategy, they can make smart trade-offs independently.

This redundancy becomes especially critical during high-pressure situations. When systems are down or deadlines are looming, you don’t want your team waiting for permission to take action. Teams that have practiced autonomous decision-making within clear constraints can respond quickly and effectively without requiring heroic coordination efforts.

The Cultural Transformation

What surprises most leaders is how much this simple change affects team culture. When priorities are clear and transparent, several things happen that go far beyond improved task management.

First, political conversations about priority disappear. There’s no point in lobbying for your favorite project when the criteria for prioritization are explicit and the current order is visible to everyone. Energy that was spent on organizational maneuvering gets redirected toward actual work.

Second, people start thinking about their contributions differently. Instead of optimizing for individual productivity, they begin considering how their work fits into team objectives. This naturally leads to better collaboration and knowledge sharing.

Third, the team develops a shared sense of progress and momentum. When everyone can see important work getting completed in priority order, it creates a satisfying rhythm that isolated individual achievements can’t match.

Implementation Reality

The biggest challenge isn’t creating the list—it’s maintaining the discipline to use it consistently. Teams often start strong but gradually drift back to multiple priority tracks when pressure increases or when compelling new opportunities arise.

I’ve learned to treat priority discipline like any other technical practice that requires ongoing attention. Schedule regular review sessions to reorder the list, have explicit discussions about what we’re choosing not to do, and consistently communicate why keeping a single-priority focus helps maintain development velocity.

The payoff: teams that ship more valuable work with less stress and confusion. When everyone understands what matters most and feels empowered to contribute effectively, both productivity and job satisfaction improve dramatically.

Most importantly, single-priority focus creates sustainable high performance rather than the boom-and-bust cycles that come from constantly shifting between competing urgent demands. Teams learn to work steadily toward important goals rather than reacting to whatever feels most pressing in the moment.

OWASP Web Security Testing Guide v4.2 released

2020-12-03T16:02:33-06:00

I’m very happy and proud to share that the Open Web Application Security Project (OWASP) Web Security Testing Guide v4.2 is now available! This update is the result of a lot of hard work by the repository team and many dedicated contributors. With a team like this, I’m honored to be a core maintainer and co-author.

Here’s a reprint of the announcement I wrote for owasp.org. If you’re interested in security testing for web applications and APIs, this is an update you’ll definitely want to check out!

You can become a contributor yourself by joining us on GitHub!

Web Security Testing Guide v4.2 Released

Thursday, December 3, 2020

The OWASP Web Security Testing Guide team is proud to announce version 4.2 of the Web Security Testing Guide (WSTG)! In keeping with a continuous delivery mindset, this new minor version adds content as well as improves the existing tests.

In recent years, the Web Security Testing Guide has sought to remain your foremost open source resource for web application testing. Our previous release marked a move from a cumbersome wiki platform to the highly collaborative world of GitHub. Since then, over 61 new contributors pushing over 600 commits have helped to make the WSTG better than ever.

Version 4.2 of the Web Security Testing Guide introduces new testing scenarios, updates existing chapters, and offers an improved reading experience with a clearer writing style and chapter layout. Readers will enjoy easier navigation and consistent testing instructions.

With new improvements to our development workflow, new contributors will find it easier than ever to help build future versions of the WSTG. A clear and concise contributor’s guide and style guide can help you write new tests or ensure existing scenarios stay current. Core maintainers Rick Mitchell, Elie Saad, Rejah Rehim, and Victoria Drake have implemented modern processes like continuous integration with GitHub Actions. New workflows help to build PDFs and make reviewing new additions and updates easier.

We couldn’t be happier to share this new version with you, and we don’t plan to slow down anytime soon. The dedicated volunteers who’ve made this release possible are already hard at work on the next major version of the WSTG. Come join us and become a contributor!

You can read the Web Security Testing Guide v4.2 online or download a PDF on our project page. We greatly appreciate all the authors, editors, reviewers, and readers who make this open source security endeavor worthwhile.

Thank you for being a part of the WSTG!

What is TCP/IP? Layers and protocols explained

2020-11-29T04:01:22-04:00

A significant part of the process of creation is the ability to imagine things that do not yet exist. This skill was instrumental to the creation of the Internet. If no one had imagined the underlying technology that most now take for granted every day, there would be no cat memes.

To make the Internet possible, two things that needed imagining are layers and protocols. Layers are conceptual divides that group similar functions together. The word “protocol,” means “the way we’ve agreed to do things around here,” more or less. In short, both layers and protocols can be explained to a five-year-old as “ideas that people agreed sounded good, and then they wrote them down so that other people could do things with the same ideas.”

The Internet Protocol Suite is described in terms of layers and protocols. Collectively, the suite refers to the communication protocols that enable our endless scrolling. It’s often called by its foundational protocols: the Transmission Control Protocol (TCP) and the Internet Protocol (IP). Lumped together as TCP/IP, these protocols describe how data on the Internet is packaged, addressed, sent, and received.

Here’s why the Internet Protocol Suite, or TCP/IP, is an imaginary rainbow layer cake.

Layers are imaginary

If you consider the general nature of a rainbow layer sponge cake, it’s mostly made up of soft, melt-in-your mouth vanilla-y goodness. This goodness is in itself comprised of something along the lines of eggs, butter, flour, and sweetener.

There isn’t much to distinguish one layer of a rainbow sponge cake from another. Often, the only difference between layers is the food-coloring and a bit of frosting. When you think about it, it’s all cake from top to bottom. The rainbow layers are only there because the baker thought they ought to be.

Similar to cake ingredients, layers in the context of computer networking are mostly composed of protocols, algorithms, and configurations, with some data sprinkled in. It can be easier to talk about computer networking if its many functions are split up into groups, so certain people came up with descriptions of layers, which we call network models. TCP/IP is just one network model among others. In this sense, layers are concepts, not things.

Some of the people in question are part of the Internet Engineering Task Force (IETF). They created the RFC-1122 publication, discussing the Internet’s communications layers. Half of a whole, the standard:

…covers the communications protocol layers: link layer, IP layer, and transport layer; its companion RFC-1123 covers the application and support protocols.

The layers described by RFC-1122 and RFC-1123 each encapsulate protocols that satisfy the layer’s functionality. Let’s look at each of these communications layers and see how TCP and IP stack up in this model of the Internet layer cake.

Link layer protocols

The link layer is the most basic, or lowest-level, classification of communication protocol. It deals with sending information between hosts on the same local network, and translating data from the higher layers to the physical layer. Protocols in the link layer describe how data interacts with the transmission medium, such as electronic signals sent over specific hardware. Unlike other layers, link layer protocols are dependent on the hardware being used.

Internet layer protocols

Protocols in the Internet layer describe how data is sent and received over the Internet. The process involves packaging data into packets, addressing and transmitting packets, and receiving incoming packets of data.

The most widely known protocol in this layer gives TCP/IP its last two letters. IP is a connectionless protocol, meaning that it provides no guarantee that packets are sent or received in the right order, along the same path, or even in their entirety. Reliability is handled by other protocols in the suite, such as in the transport layer.

There are currently two versions of IP in use: IPv4, and IPv6. Both versions describe how devices on the Internet are assigned IP addresses, which are used when navigating to cat memes. IPv4 is more widely used, but has only 32 bits for addressing, allowing for about 4.3 billion (ca. 4.3×10⁹) possible addresses. These are running out, and IPv4 and will eventually suffer from address exhaustion as more and more people use more devices on the Internet.

The successor version IPv6 aims to solve address exhaustion by using 128 bits for addresses. This provides, um, a lot more address possibilities (ca. 3.4×10³⁸).

Transport layer protocols

In May 1974, Vint Cerf and Bob Kahn (collectively often called “the fathers of the Internet”) published a paper entitled A Protocol for Packet Network Intercommunication. This paper contained the first description of a Transmission Control Program, a concept encompassing what would eventually be known as the Transmission Control Protocol (TCP) and User Datagram Protocol (UDP). (I had the pleasure of meeting Vint and can personally confirm that yes, he does look exactly like The Architect in the Matrix movies.)

The transport layer presently encapsulates TCP and UDP. Like IP, UDP is connectionless and can be used to prioritize time over reliability. TCP, on the other hand, is a connection-oriented transport layer protocol that prioritizes reliability over latency, or time. TCP describes transferring data in the same order as it was sent, retransmitting lost packets, and controls affecting the rate of data transmission.

Application layer protocols

The application layer describes the protocols that software applications interact with most often. The specification includes descriptions of the remote login protocol Telnet, the File Transfer Protocol (FTP), and the Simple Mail Transfer Protocol (SMTP).

Also included in the application layer are the Hypertext Transfer Protocol (HTTP) and its successor, Hypertext Transfer Protocol Secure (HTTPS). HTTPS is secured by Transport Layer Security, or TLS, which can be said to be the top-most layer of the networking model described by the Internet protocol suite. If you’d like to further understand TLS and how this protocol secures your cat meme viewing, I invite you read my article about TLS and cryptography.

The Internet cake is still baking

Like a still-rising sponge cake, descriptions of layers, better protocols, and new models are being developed every day. The Internet, or whatever it will become in the future, is still in the process of being imagined.

If you enjoyed learning from this post, there’s a lot more where this came from! I write about computing, cybersecurity, and building great technical teams. You can subscribe below to see new posts first.

Responsive pages and color themes with minimal CSS

2020-11-17T06:04:58-05:00

Hello, do come in! If you’re reading this on my website, you may notice I’ve spruced up a bit. Victoria.dev can now better respond to your devices and preferences!

Most modern devices and web browsers allow users to choose either a light or dark theme for the user interface. With CSS media queries, you can have your own website’s styles change to match this user setting!

Media queries are also a common way to have elements on web pages change to suit different screen sizes. This is an especially powerful tool when combined with custom properties set on the root element.

Here’s how to use CSS media queries and custom properties to improve your visitor’s browsing experience with just a few lines of CSS.

Catering to color preferences

The prefers-color-scheme media feature can be queried to serve up your user’s color scheme of choice. The light option is the go-to version if no active preference is set, and it has decent support across modern browsers.

Additionally, users reading on certain devices can also set light and dark color themes based on a schedule. For example, my phone uses light colors throughout its UI during the daytime, and dark colors at night. You can make your website follow suit!

Avoid repeating a lot of CSS by setting custom properties for your color themes on your :root pseudo-class. You can specify the themes available with the color-scheme property (currently part of a draft specification, but I like to write my articles to age well). Create a version for each theme you wish to support. Here’s a quick example you can build on:

:root {
    color-scheme: light dark;
}

@media (prefers-color-scheme: light) {
    :root {
        --text-primary: #24292e;
        --background: white;
        --shadow: rgba(0, 0, 0, 0.15) 0px 2px 5px 0px;
    }
}

@media (prefers-color-scheme: dark) {
    :root {
        --text-primary: white;
        --background: #24292e;
        --shadow: rgba(0, 0, 0, 0.35) 0px 2px 5px 0px;
    }
}

As you can see, you can use custom properties to set all kinds of values. To use these as variables with other CSS elements, use the var() function:

header {
    color: var(--text-primary);
    background-color: var(--background);
    box-shadow: var(--shadow);
}

In this quick example, the header element will now display your user’s preferred colors according to their browser settings!

Preferred color schemes are set by the user in different ways, depending on the browser. Here are a couple examples.

Firefox

You can test out light and dark modes in Firefox by typing about:config into the address bar. Accept the warning if it pops up, then type ui.systemUsesDarkTheme into the search.

Choose a Number value for the setting, then input a 1 for dark or 0 for light.

Brave

If you’re using Brave, find color theme settings in Settings > Appearance > Brave colors.

Variable scaling

You can also use a custom property to effortlessly adjust the size of text or other elements depending on your user’s screen size. The width media feature tests the width of the viewport. While width: _px will match an exact size, you can also use min and max to create ranges.

Query with min-width: _px to match anything over _ pixels, and max-width: _px to match anything up to _ pixels.

Use these queries to set a custom property on the :root to create a ratio:

@media (min-width: 360px) {
    :root {
        --scale: 0.8;
    }
}

@media (min-width: 768px) {
    :root {
        --scale: 1;
    }
}

@media (min-width: 1024px) {
    :root {
        --scale: 1.2;
    }
}

Then make an element responsive by using the calc() function. Here are a few examples:

h1 {
    font-size: calc(42px * var(--scale));
}

h2 {
    font-size: calc(26px * var(--scale));
}

img {
    width: calc(200px * var(--scale));
}

In this example, multiplying an initial value by your --scale custom property allows the size of headings and images to magically adjust to your user’s device width.

The relative unit rem will have a similar effect. You can use it to define sizes for elements relative to the font size declared at the root element.

h1 {
    font-size: calc(5rem * var(--scale));
}

h2 {
    font-size: calc(1.5rem * var(--scale));
}

p {
    font-size: calc(1rem * var(--scale));
}

Of course, you can also multiply two custom properties. For example, setting the --max-img as a custom property on the :root can help to save you time later on by not having to update a pixel value in multiple places:

img {
    max-width: calc(var(--max-img) * var(--scale));
}

Raise your responsiveness game

Try out these easy wins for a website that caters to your visitor’s devices and preferences. I’ve put them to good use now on victoria.dev. I invite you to let me know how you like it!

Build your own serverless subscriber list with Go and AWS

2020-11-10T04:52:50-05:00

You can now subscribe to my email list on victoria.dev! Here’s how I lovingly built a subscription sign up flow with email confirmation that doesn’t suck. You can too.

If you’re interested in managing your own mailing list or newsletter, you can set up Simple Subscribe on your own AWS resources to collect email addresses. This open source API is written in Go, and runs on AWS Lambda. Visitors to your site can sign up to your list, which is stored in a DynamoDB table, ready to be queried or exported at your leisure.

When someone signs up, they’ll receive an email asking them to confirm their subscription. This is sometimes called “double opt-in,” although I prefer the term “verified.” Simple Subscribe works on serverless infrastructure and uses an AWS Lambda to handle subscription, confirmation, and unsubscribe requests.

You can find the Simple Subscribe project, with its fully open-source code, on GitHub. I encourage you to pull up the code and follow along! In this post I’ll share each build step, the thought process behind the API’s single-responsibility functions, and security considerations for an AWS project like this one.

Building a verified subscription flow

A non-verified email sign up process is straightforward. Someone puts their email into a box on your website, then that email goes into your database. However, if I’ve taught you anything about not trusting user input, the very idea of a non-verified sign up process should raise your hackles. Spam may be great when fried in a sandwich, but no fun when it’s running up your AWS bill.

While you can use a strategy like a CAPTCHA or puzzle for is-it-a-human verification, these can create enough friction to turn away your potential subscribers. Instead, a confirmation email can help to ensure both address correctness and user sentience.

To build a subscription flow with email confirmation, create single-responsibility functions that satisfy each logical step. Those are:

Accept an email address and record it.
Generate a token associated with that email address and record it.
Send a confirmation email to that email address with the token.
Accept a verification request that has both the email address and token.

To achieve each of these goals, Simple Subscribe uses the official AWS SDK for Go to interact with DynamoDB and SES.

At each stage, consider what the data looks like and how you store it. This can help to handle conundrums like, “What happens if someone tries to subscribe twice?” or even threat-modeling such as, “What if someone subscribes with an email they don’t own?”

Ready? Let’s break down each step and see how the magic happens.

Subscribing

The subscription process begins with a humble web form, like the one on my site’s main page. A form input with attributes type="email" required helps with validation, thanks to the browser. When submitted, the form sends a GET request to the Simple Subscribe subscription endpoint.

Simple Subscribe receives a GET request to this endpoint with a query string containing the intended subscriber’s email. It then generates an id value and adds both email and id to your DynamoDB table.

The table item now looks like:

email	confirm	id	timestamp
`subscriber@example.com`	false	`uuid-xxxxx`	2020-11-01 00:27:39

The confirm column, which holds a boolean, indicates that the item is a subscription request that has not yet been confirmed. To verify an email address in the database, you’ll need to find the correct item and change confirm to true.

As you work with your data, consider the goal of each manipulation and how you might compare an incoming request to existing data.

For example, if someone made a subsequent subscription request for the same email address, how would you handle it? You might say, “Create a new line item with a new id,” however, this might not be best strategy when your serverless application database is paid for by request volume.

Since DynamoDB Pricing depends on how much data you read and write to your tables, it’s advantageous to avoid piling on excess data.

With that in mind, it would be prudent to handle subscription requests for the same email by performing an update instead of adding a new line. Simple Subscribe actually uses the same function to either add or update a database item. This is typically referred to as, “update or insert.”

In a database like SQLite this is accomplished with the UPSERT syntax. In the case of DynamoDB, you use an update operation. For the Go SDK, its syntax is UpdateItem.

When a duplicate subscription request is received, the database item is matched on the email only. If an existing line item is found, its id and timestamp are overridden, which updates the existing database record and avoids flooding your table with duplicate requests.

Verifying email addresses

After submitting the form, the intended subscriber then receives an email from SES containing a link. This link is built using the email and id from the table, and takes the format:

/?email=subscriber@example.com&id=uuid-xxxxx

In this set up, the id is a UUID that acts as a secret token. It provides an identifier that you can match that is sufficiently complex and hard to guess. This approach deters people from subscribing with email addresses they don’t control.

Visiting the link sends a request to your verification endpoint with the email and id in the query string. This time, it’s important to compare both the incoming email and id values to the database record. This verifies that the recipient of the confirmation email is initiating the request.

The verification endpoint ensures that these values match an item in your database, then performs another update operation to set confirm to true, and update the timestamp. The item now looks like:

email	confirm	id	timestamp
`subscriber@example.com`	true	`uuid-xxxxx`	2020-11-01 00:37:39

Querying for emails

You can now query your table to build your email list. Depending on your email sending solution, you might do this manually, with another Lambda, or even from the command line.

Since data for requested subscriptions (where confirm is false) is stored in the table alongside confirmed subscriptions, it’s important to differentiate this data when querying for email addresses to send to. You’ll want to ensure you only return emails where confirm is true.

Providing unsubscribe links

Similar to verifying an email address, Simple Subscribe uses email and id as arguments to the function that deletes an item from your DynamoDB table in order to unsubscribe an email address. To allow people to remove themselves from your list, you’ll need to provide a URL in each email you send that includes their email and id as a query string to the unsubscribe endpoint. It would look something like:

/?email=subscriber@example.com&id=uuid-xxxxx

When the link is clicked, the query string is passed to the unsubscribe endpoint. If the provided email and id match a database item, that item will be deleted.

Proving a method for your subscribers to automatically remove themselves from your list, without any human intervention necessary, is part of an ethical and respectful philosophy towards handling the data that’s been entrusted to you.

Caring for your data

Once you decide to accept other people’s data, it becomes your responsibility to care for it. This is applicable to everything you build. For Simple Subscribe, it means maintaining the security of your database, and periodically pruning your table.

In order to avoid retaining email addresses where confirm is false past a certain time frame, it would be a good idea to set up a cleaning function that runs on a regular schedule. This can be achieved manually, with an AWS Lambda function, or using the command line.

To clean up, find database items where confirm is false and timestamp is older than a particular point in time. Depending on your use case and request volumes, the frequency at which you choose to clean up will vary.

Also depending on your use case, you may wish to keep backups of your data. If you are particularly concerned about data integrity, you can explore On-Demand Backup or Point-in-Time Recovery for DynamoDB.

Build your independent subscriber base

Building your own subscriber list can be an empowering endeavor! Whether you intend to start a newsletter, send out notifications for new content, or want to create a community around your work, there’s nothing more personal or direct than an email from me to you.

I encourage you to start building your subscriber base with Simple Subscribe today! Like most of my work, it’s open source and free for your personal use. Dive into the code at the GitHub repository.

WPA Key, WPA2, WPA3, and WEP Key: Wi-Fi security explained

2020-10-19T04:02:27-04:00

Setting up new Wi-Fi? Picking the type of password you need can seem like an arbitrary choice. After all, WEP, WPA, WPA2, and WPA3 all have mostly the same letters in them. A password is a password, so what’s the difference?

About 60 seconds to billions of years, as it turns out.

All Wi-Fi encryption is not created equal. Let’s explore what makes these four acronyms so different, and how you can best protect your home and organization Wi-Fi.

Wired Equivalent Privacy (WEP)

In the beginning, there was WEP.

Not to be confused with the name of a certain rap song.

Wired Equivalent Privacy is a deprecated security algorithm from 1997 that was intended to provide equivalent security to a wired connection. “Deprecated” means, “Let’s not do that anymore.”

Even when it was first introduced, it was known not to be as strong as it could have been, for two reasons: one, its underlying encryption mechanism; and two, World War II.

During World War II, the impact of code breaking (or cryptanalysis) was huge. Governments reacted by attempting to keep their best secret-sauce recipes at home. Around the time of WEP, U.S. Government restrictions on the export of cryptographic technology caused access point manufacturers to limit their devices to 64-bit encryption. Though this was later lifted to 128-bit, even this form of encryption offered a very limited possible key size.

This proved problematic for WEP. The small key size resulted in being easier to brute-force, especially when that key doesn’t often change.

WEP’s underlying encryption mechanism is the RC4 stream cipher. This cipher gained popularity due to its speed and simplicity, but that came at a cost. It’s not the most robust algorithm. WEP employs a single shared key among its users that must be manually entered on an access point device. (When’s the last time you changed your Wi-Fi password? Right.) WEP didn’t help matters either by simply concatenating the key with the initialization vector – which is to say, it sort of mashed its secret-sauce bits together and hoped for the best.

Initialization Vector (IV): fixed-size input to a low-level cryptographic algorithm, usually random.

Combined with the use of RC4, this left WEP particularly susceptible to related-key attack. In the case of 128-bit WEP, your Wi-Fi password can be cracked by publicly-available tools in a matter of around 60 seconds to three minutes.

While some devices came to offer 152-bit or 256-bit WEP variants, this failed to solve the fundamental problems of WEP’s underlying encryption mechanism.

So, yeah. Let’s not do that anymore.

Wi-Fi Protected Access (WPA)

A new, interim standard sought to temporarily “patch” the problem of WEP’s (lack of) security. The name Wi-Fi Protected Access (WPA) certainly sounds more secure, so that’s a good start; however, WPA first started out with another, more descriptive name.

Ratified in a 2004 IEEE standard, Temporal Key Integrity Protocol (TKIP) uses a dynamically-generated, per-packet key. Each packet sent has a unique temporal 128-bit key, (See? Descriptive!) that solves the susceptibility to related-key attacks brought on by WEP’s shared key mashing.

TKIP also implements other measures, such as a message authentication code (MAC). Sometimes known as a checksum, a MAC provides a cryptographic way to verify that messages haven’t been changed. In TKIP, an invalid MAC can also trigger rekeying of the session key. If the access point receives an invalid MAC twice within a minute, the attempted intrusion can be countered by changing the key an attacker is trying to crack.

Unfortunately, in order to preserve compatibility with the existing hardware that WPA was meant to “patch,” TKIP retained the use of the same underlying encryption mechanism as WEP – the RC4 stream cipher. While it certainly improved on the weaknesses of WEP, TKIP eventually proved vulnerable to new attacks that extended previous attacks on WEP. These attacks take a little longer to execute by comparison: for example, twelve minutes in the case of one, and 52 hours in another. This is more than sufficient, however, to deem TKIP no longer secure.

WPA, or TKIP, has since been deprecated as well. So let’s also not do that anymore.

Which brings us to…

Wi-Fi Protected Access II (WPA2)

Rather than spend the effort to come up with an entirely new name, the improved Wi-Fi Protected Access II (WPA2) standard instead focuses on using a new underlying cipher. Instead of the RC4 stream cipher, WPA2 employs a block cipher called Advanced Encryption Standard (AES) to form the basis of its encryption protocol. The protocol itself, abbreviated CCMP, draws most of its security from the length of its rather long name (I’m kidding): Counter Mode Cipher Block Chaining Message Authentication Code Protocol, which shortens to Counter Mode CBC-MAC Protocol, or CCM mode Protocol, or CCMP. 🤷

CCM mode is essentially a combination of a few good ideas. It provides data confidentiality through CTR mode, or counter mode. To vastly oversimplify, this adds complexity to plaintext data by encrypting the successive values of a count sequence that does not repeat. CCM also integrates CBC-MAC, a block cipher method for constructing a MAC.

AES itself is on good footing. The AES specification was established in 2001 by the U.S. National Institute of Standards and Technology (NIST) after a five-year competitive selection process during which fifteen proposals for algorithm designs were evaluated. As a result of this process, a family of ciphers called Rijndael (Dutch) was selected, and a subset of these became AES. For the better part of two decades, AES has been used to protect every-day Internet traffic as well as certain levels of classified information in the U.S. Government.

While possible attacks on AES have been described, none have yet been proven to be practical in real-world use. The fastest attack on AES in public knowledge is a key-recovery attack that improved on brute-forcing AES by a factor of about four. How long would it take? Some billions of years.

Wi-Fi Protected Access III (WPA3)

The next installment of the WPA trilogy has been required for new devices since July 1, 2020. Expected to further enhance the security of WPA2, the WPA3 standard seeks to improve password security by being more resilient to word list or dictionary attacks.

Unlike its predecessors, WPA3 will also offer forward secrecy. This adds the considerable benefit of protecting previously exchanged information even if a long-term secret key is compromised. Forward secrecy is already provided by protocols like TLS by using asymmetric keys to establish shared keys. You can learn more about TLS in this post.

As WPA2 has not been deprecated, both WPA2 and WPA3 remain your top choices for Wi-Fi security.

If the other ones suck, why are they still around?

You may be wondering why your access point even allows you to choose an option other than WPA2 or WPA3. The likely reason is that you’re using legacy hardware, which is what tech people call your mom’s router.

Since the deprecation of WEP and WPA occurred (in old-people terms) rather recently, it’s possible in large organizations as well as your parent’s house to find older hardware that still uses these protocols. Even newer hardware may have a business need to support these older protocols.

While I may be able to convince you to invest in a shiny new top-of-the-line Wi-Fi appliance, most organizations are a different story. Unfortunately, many just aren’t yet cognizant of the important role cybersecurity plays in meeting customer needs and boosting that bottom line. Additionally, switching to newer protocols may require new internal hardware or firmware upgrades. Especially on complex systems in large organizations, upgrading devices can be financially or strategically difficult.

Boost your Wi-Fi security

If it’s an option, choose WPA2 or WPA3. Cybersecurity is a field that evolves by the day, and getting stuck in the past can have dire consequences.

If you can’t use WPA2 or WPA3, do the best you can to take additional security measures. The best bang for your buck is to use a Virtual Private Network (VPN). Using a VPN is a good idea no matter which type of Wi-Fi encryption you have. On open Wi-Fi (coffee shops) and using WEP, it’s plain irresponsible to go without a VPN. Kind of like shouting out your bank details as you order your second cappuccino.

When possible, ensure you only connect to known networks that you or your organization control. Many cybersecurity attacks are executed when victims connect to an imitation public Wi-Fi access point, also called an evil twin attack, or Wi-Fi phishing. These fake hotspots are easily created using publicly accessible programs and tools. A reputable VPN can help mitigate damage from these attacks as well, but it’s always better not to take the risk. If you travel often, consider purchasing a portable hotspot that uses a cellular data plan, or using data SIM cards for all your devices.

Much more than just acronyms

WEP, WPA, WPA2, and WPA3 mean a lot more than a bunch of similar letters – in some cases, it’s a difference of billions of years minus about 60 seconds.

On more of a now-ish timescale, I hope I’ve taught you something new about the security of your Wi-Fi and how you can improve it!

Your cybersecurity starter pack

2020-10-04T04:30:12-04:00

Readers of my blog typically know more about technology and cybersecurity than most people. This article is for most people. If someone you know could benefit from a simple and straightforward introduction to cybersecurity tools, please share this article with them – it benefits everyone!

If you’ve ever said to yourself:

“There’s no one targeting lil ol’ me.”
“I have nothing to hide, anyway.”
“I’m too busy to learn all this stuff. Why can’t someone just give me a simple summary of best practices that I can skim in approximately seven minutes?”

First of all, you might want to stop talking to yourself in public. Secondly, here is a simple summary of best practices that you can skim in approximately seven minutes.

Introducing your three-step starter pack

While there are many different degrees of security, privacy, and anonymity, these three basics are accessible to all:

Use a VPN
Use multifactor authentication
Develop a healthy sense of skepticism

I’ll discuss each of these and help you get started with your security upgrade. But first…

Why is cybersecurity important?

Would you let just anyone walk into your house, or even look through your open doorway from across the street? If not, you might appreciate that the cybersecurity practices we’ll discuss today are not that different from locking your front door.

Cybersecurity isn’t about finding some magic spell that completely secures your online activities – that would be nice, but it’s unrealistic. Good security practices are about employing some thoughtful habits that make your online activities more secure than the next guy, in much the same way as you learned to lock your front door.

Security breaches and incidents happen every day. Most of them occur because an automated scanner cast a wide net and found a person or company with lax security that a hacker could then exploit. Don’t be that guy.

1. Use a VPN

Let’s say you send a lot of mail, but never bother to put your letters in envelopes or even fold them in half. Anyone who bothers to look can read all your dirty secrets (not that you have any).

When you use a Virtual Private Network, or VPN, especially if you often connect to public WiFi, it’s like putting your letters into cryptographically-sealed envelopes and sending them via a special invisible courier service. No one but the intended recipient can read your letters, and no one but you and the courier know to whom the letters are sent.

Encrypted mail still won’t stop you from the accidental reply all, unfortunately.

VPNs prevent others from reading your communications. This may include opportunistic attackers who scan open WiFi, and even your own internet service provider (ISP) who may sell your usage data for advertising dollars.

Choosing a VPN

A few important differentiating factors can help you choose a VPN provider.

Is it free? VPNs cost money to operate; if one is offered for free, consider what they might be doing in order to cover their costs. Generally, I recommend avoiding free VPN apps and services; they’ll typically cost you much more than you’ll know. Expect to pay between $5-$10 USD monthly for the service.
Where is it based? Understand where your VPN provider is based, and what that country’s laws allow them to do with your data.
Do they keep logs? Part of the philosophy of using a VPN is that no one has any business getting into your business when it comes to online activities. When a VPN provider keeps logs of your usage, that defeats the purpose. Instead of your ISP knowing just what you’re up to online, that knowledge is simply transferred to the logging VPN. Look for VPN providers with a strict no-logging policy, or if you’re up for it, roll your own.

2. Use multifactor authentication

Passwords are dead. Computationally, they are a solved problem. Cracking your password is just a matter of time.

Unfortunately, many people still help to speed up the process by using the same compromised passwords for multiple accounts, putting themselves at further risk.

The answer, at least for now, is multifactor authentication (MFA). MFA is made up of three kinds of authentication factors:

Something you know, like a pass phrase;
Something you have, like a chip pin card or phone; and
Something that you are, like your face or fingerprint.

Also the name of my next beatboxing team.

Two or more of these factors are infinitely better than a password alone, especially if your password is on this list.

Multiple authentication factors are now widely supported by account providers and social media sites. If you have the choice, avoid using text messages, or SMS, as a way of receiving authentication codes. SMS authentication leaves you vulnerable to the SIM swap attack - please direct further questions to Jack Dorsey.

Instead, use a One Time Password (OTP) app such as Authy to generate codes on your device. This ensures that you alone, using that particular device, will have the correct authentication code.

You can also use hardware authentication keys such as the YubiKey, but these aren’t yet as widely supported as OTP apps.

3. Develop a healthy sense of skepticism

Social engineering, sometimes SE, is the use of psychological persuasion to get an unwitting target to give up access or information. This can take the form of phishing emails, letters, or phone calls (vishing) as well as far more sophisticated spear-phishing attacks of high-value targets, like company executives.

While some attacks are easier to spot, others use cognitive biases very effectively and are difficult even for security professionals to avoid. No human is immune.

Ultimately, the weakest link in your cybersecurity defense is you. All the VPNs and MFA on the Internet won’t protect you if a scam can trick you into opening the front gates. Always look a Trojan gift horse in the mouth.

Yes, I know it’s a very nice looking wooden horse. Also free. Did you order it? No? Then it can stay outside.

Develop the habit of second-guessing things delivered to your virtual doorstep. Email, phone, and messaging scams range in sophistication. Even security professionals can fall for a good scam.

One way to protect yourself is to practice a healthy sense of skepticism. Question communications that ask you to click on links or visit a website, even if they come from someone you know or a company you use.

If you’re not certain that your bank or mother sent this email, pick up the phone and call them. Even if you think you are certain, pick up the phone and double check. You don’t call your mother enough, anyway.

Oh, and if the person on the phone is from your local tax office or the IRS or the CRA and they’re about to freeze your accounts because a case of mistaken identity has resulted in you being criminally charged for not repaying a loan on a 600-foot yacht in Malibu, just hang up. You know better than that. Tax agencies don’t have phones.

A safer Internet

Congratulations! You now have three tools to make your personal cybersecurity better than the next guy’s. If enough people do that, the whole neighborhood (or in this case, the Internet) will benefit as a result.

If this article piqued your interest, you can go further and outsource your security with a password manager and temporary virtual credit cards.

Cheat sheets and other resources

I’ll leave you with a few resources that I’ve enjoyed:

The Electronic Frontier Foundation website Surveillance Self Defense offers many great guides and how-to’s, such as setting up the encrypted messaging app Signal on your mobile device, and protecting yourself on social media.
The Cybersecurity and Infrastructure Security Agency (CISA) offers many shareable starter resources.
Working from home? The National Security Agency Central Security Service has Telework and Mobile Security Guides that discuss best practices for an unprecedented era of remote work.

Increase developer confidence with a great Django test suite

2020-10-01T05:50:37-04:00

Done correctly, tests are one of your application’s most valuable assets.

The Django framework in particular offers your team the opportunity to create an efficient testing practice. Based on the Python standard library unittest, proper tests in Django are fast to write, faster to run, and can offer you a seamless continuous integration solution for taking the pulse of your developing application.

With comprehensive tests, developers have higher confidence when pushing changes. I’ve seen firsthand in my own teams that good tests can boost development velocity as a direct result of a better developer experience.

In this article, I’ll share my own experiences in building useful tests for Django applications, from the basics to the best possible execution.

What to test

Tests are extremely important. Far beyond simply letting you know if a function works, tests can form the basis of your team’s understanding of how your application is intended to work.

Here’s the main goal: if you hit your head and forgot everything about how your application works tomorrow, you should be able to regain most of your understanding by reading and running the tests you write today.

Here are some questions that may be helpful to ask as you decide what to test:

What is our customer supposed to be able to do?
What is our customer not supposed to be able to do?
What should this method, view, or logical flow achieve?
When, how, or where is this feature supposed to execute?

Tests that make sense for your application can help build developer confidence. With these sensible safeguards in place, developers make improvements more readily, and feel confident introducing innovative solutions to product needs. The result is an application that comes together faster, and features that are shipped often and with confidence.

Where to put tests

If you only have a few tests, you may organize your test files similarly to Django’s default app template by putting them all in a file called tests.py. This straightforward approach is best for smaller applications.

As your application grows, you may like to split your tests into different files, or test modules. One method is to use a directory to organize your files, such as projectroot/app/tests/. The name of each test file within that directory should begin with test, for example, test_models.py.

Besides being aptly named, Django will find these files using built-in test discovery based on the unittest module. All files in your application with names that begin with test will be collected into a test suite.

This convenient test discovery allows you to place test files anywhere that makes sense for your application. As long as they’re correctly named, Django’s test utility can find and run them.

How to document a test

Use docstrings to explain what a test is intended to verify at a high level. For example:

def test_create_user(self):
    """Creating a new user object should also create an associated profile object"""
    # ...

These docstrings help you quickly understand what a test is supposed to be doing. Besides navigating the codebase, this helps to make it obvious when a test doesn’t verify what the docstring says it should.

Docstrings are also shown when the tests are being run, which can be helpful for logging and debugging.

What a test needs to work

Django tests can be quickly set up using data created in the setUpTestData() method. You can use various approaches to create your test data, such as utilizing external files, or even hard-coding silly phrases or the names of your staff. Personally, I much prefer to use a fake-data-generation library, such as faker.

The proper set up of arbitrary testing data can help you ensure that you’re testing your application functionality instead of accidentally testing test data. Because generators like faker add some degree of unexpectedness to your inputs, it can be more representative of real-world use.

Here is an example set up for a test:

from django.test import TestCase
from faker import Faker

from app.models import MyModel, AnotherModel

fake = Faker()


class MyModelTest(TestCase):
    def setUpTestData(cls):
        """Quickly set up data for the whole TestCase"""
        cls.user_first = fake.first_name()
        cls.user_last = fake.last_name()

    def test_create_models(self):
        """Creating a MyModel object should also create AnotherModel object"""
        # In test methods, use the variables created above
        test_object = MyModel.objects.create(
            first_name=self.user_first,
            last_name=self.user_last,
            # ...
        )
        another_model = AnotherModel.objects.get(my_model=test_object)
        self.assertEqual(another_model.first_name, self.user_first)
        # ...

Tests pass or fail based on the outcome of the assertion methods. You can use Python’s unittest methods, and Django’s assertion methods.

For further guidance on writing tests, see Testing in Django.

Best possible execution for running your tests

Django’s test suite is manually run with:

./manage.py test

I rarely run my Django tests this way.

The best, or most efficient, testing practice is one that occurs without you or your developers ever thinking, “I need to run the tests first.” The beauty of Django’s near-effortless test suite set up is that it can be seamlessly run as a part of regular developer activities. This could be in a pre-commit hook, or in a continuous integration or deployment workflow.

I’ve previously written about how to use pre-commit hooks to improve your developer ergonomics and save your team some brainpower. Django’s speedy tests can be run this way, and they become especially efficient if you can run tests in parallel.

Tests that run as part of a CI/CD workflow, for example, on pull requests with GitHub Actions, require no regular effort from your developers to remember to run tests at all. I’m not sure how plainly I can put it – this one’s literally a no-brainer.

Testing your way to a great Django application

Tests are extremely important, and underappreciated. They can catch logical errors in your application. They can help explain and validate how concepts and features of your product actually function. Best of all, tests can boost developer confidence and development velocity as a result.

The best tests are ones that are relevant, help to explain and define your application, and are run continuously without a second thought. I hope I’ve now shown you how testing in Django can help you to achieve these goals for your team!

Delightful Django Development: Setup, Hooks, and CI/CD

2020-09-22T04:55:19-04:00

Do you want your team to enjoy your development workflow? Do you think building software should be fun and existentially fulfilling? If so, this is the post for you!

I’ve been developing with Django for years, and I’ve never been happier with my Django project set up than I am right now. Here’s how I’m making a day of developing with Django the most relaxing and enjoyable development experience possible for myself and my engineering team.

A custom CLI tool for your Django project

Instead of typing:

python3 -m venv env
source env/bin/activate
pip install -r requirements.txt
python3 manage.py makemigrations
python3 manage.py migrate
python3 manage.py collectstatic
python3 manage.py runserver

Wouldn’t it be much nicer to type:

make start

…and have all that happen for you? I think so!

We can do that with a self-documenting Makefile! Here’s one I frequently use when developing my Django applications, like ApplyByAPI.com:

VENV := env
BIN := $(VENV)/bin
PYTHON := $(BIN)/python
SHELL := /bin/bash

include .env

.PHONY: help
help: ## Show this help
    @egrep -h '\s##\s' $(MAKEFILE_LIST) | awk 'BEGIN {FS = ":.*?## "}; {printf "\033[36m%-20s\033[0m %s\n", $$1, $$2}'

.PHONY: venv
venv: ## Make a new virtual environment
    python3 -m venv $(VENV) && source $(BIN)/activate

.PHONY: install
install: venv ## Make venv and install requirements
    $(BIN)/pip install --upgrade -r requirements.txt

freeze: ## Pin current dependencies
    $(BIN)/pip freeze > requirements.txt

migrate: ## Make and run migrations
    $(PYTHON) manage.py makemigrations
    $(PYTHON) manage.py migrate

db-up: ## Pull and start the Docker Postgres container in the background
    docker pull postgres
    docker-compose up -d

db-shell: ## Access the Postgres Docker database interactively with psql. Pass in DBNAME=.
    docker exec -it container_name psql -d $(DBNAME)

.PHONY: test
test: ## Run tests
    $(PYTHON) manage.py test application --verbosity=0 --parallel --failfast

.PHONY: run
run: ## Run the Django server
    $(PYTHON) manage.py runserver

start: install migrate run ## Install requirements, apply migrations, then start development server

You’ll notice the presence of the line include .env above. This ensures make has access to environment variables stored in a file called .env. This allows Make to utilize these variables in its commands, for example, the name of my virtual environment, or to pass in $(DBNAME) to psql.

What’s with that weird “##” comment syntax? A Makefile like this gives you a handy suite of command-line aliases you can check in to your Django project. It’s very useful so long as you’re able to remember what all those aliases are.

The help command above, which runs by default, prints a helpful list of available commands when you run make or make help:

help                 Show this help
venv                 Make a new virtual environment
install              Make venv and install requirements
migrate              Make and run migrations
db-up                Pull and start the Docker Postgres container in the background
db-shell             Access the Postgres Docker database interactively with psql
test                 Run tests
run                  Run the Django server
start                Install requirements, apply migrations, then start development server

All the usual Django commands are covered, and we’ve got a test command that runs our tests with the options we prefer. Brilliant.

You can read my full post about self-documenting Makefiles here, which also includes an example Makefile using pipenv.

Save your brainpower with pre-commit hooks

I previously wrote about some technical ergonomics that can make it a lot easier for teams to develop great software.

One area that’s a no-brainer is using pre-commit hooks to lint code prior to checking it in. This helps to ensure the quality of the code your developers check in, but most importantly, ensures that no one on your team is spending time trying to remember if it should be single or double quotes or where to put a line break.

The confusingly-named pre-commit framework is an otherwise fantastic way to keep hooks (which are not included in cloned repositories) consistent across local environments.

Here is my configuration file, .pre-commit-config.yaml, for my Django projects:

fail_fast: true
repos:
  - repo: https://github.com/pre-commit/pre-commit-hooks
    rev: v3.1.0
    hooks:
      - id: detect-aws-credentials
  - repo: https://github.com/psf/black
    rev: 19.3b0
    hooks:
      - id: black
  - repo: https://github.com/asottile/blacken-docs
    rev: v1.7.0
    hooks:
      - id: blacken-docs
        additional_dependencies: [black==19.3b0]
  - repo: local
    hooks:
      - id: markdownlint
        name: markdownlint
        description: "Lint Markdown files"
        entry: markdownlint '**/*.md' --fix --ignore node_modules --config "./.markdownlint.json"
        language: node
        types: [markdown]

These hooks check for accidental secret commits, format Python files using Black, format Python snippets in Markdown files using blacken-docs, and lint Markdown files as well. To install them, just type pre-commit install.

There are likely even more useful hooks available for your particular use case: see supported hooks to explore.

Useful gitignores

An underappreciated way to improve your team’s daily development experience is to make sure your project uses a well-rounded .gitignore file. It can help prevent files containing secrets from being committed, and can additionally save developers hours of tedium by ensuring you’re never sifting through a git diff of generated files.

To efficiently create a gitignore for Python and Django projects, Toptal’s gitignore.io can be a nice resource for generating a robust .gitignore file.

I still recommend examining the generated results yourself to ensure that ignored files suit your use case, and that nothing you want ignored is commented out.

Continuous testing with GitHub Actions

If your team works on GitHub, setting up a testing process with Actions is low-hanging fruit.

Tests that run in a consistent environment on every pull request can help eliminate “works on my machine” conundrums, as well as ensure no one’s sitting around waiting for a test to run locally.

A hosted CI environment like GitHub Actions can also help when running integration tests that require using managed services resources. You can use encrypted secrets in a repository to grant the Actions runner access to resources in a testing environment, without worrying about creating testing resources and access keys for each of your developers to use.

I’ve written on many occasions about setting up Actions workflows, including using one to run your Makefile, and how to integrate GitHub event data. GitHub even interviewed me about Actions once.

For Django projects, here’s a GitHub Actions workflow that runs tests with a consistent Python version whenever someone opens a pull request in the repository.

name: Run Django tests

on: pull_request

jobs:
  test:

    runs-on: ubuntu-latest

    steps:
      - uses: actions/checkout@v2
      - name: Set up Python
        uses: actions/setup-python@v2
        with:
          python-version: '3.8'
      - name: Install dependencies
        run: make install
      - name: Run tests
        run: make test

For the installation and test commands, I’ve simply utilized the Makefile that’s been checked in to the repository. A benefit of using your Makefile commands in your CI test workflows is that you only need to keep them updated in one place – your Makefile! No more “why is this working locally but not in CI??!?” headaches.

If you want to step up your security game, you can add Django Security Check as an Action too.

Set up your Django project for success

Want to help keep your development team happy? Set them up for success with these best practices for Django development. Remember, an ounce of brainpower is worth a pound of software!

Manipulating data with Django migrations

2020-09-14T02:12:57-04:00

Growing, successful applications are a lovely problem to have. As a product develops, it tends to accumulate complication the way your weekend cake project accumulates layers of frosting. Thankfully, Django, my favorite batteries-included framework, handles complexity pretty well.

Django models help humans work with data in a way that makes sense to our brains, and the framework offers plenty of classes you can inherit to help you rapidly develop a robust application from scratch. As for developing on existing Django applications, there’s a feature for that, too. In this article, we’ll cover how to use Django migrations to update your existing models and database.

What’s under the hood

Django migrations are Python files that help you add and change things in your database tables to reflect changes in your Django models. To understand how Django migrations help you work with data, it may be helpful to understand the underlying structures we’re working with.

What’s a database table

If you’ve laid eyes on a spreadsheet before, you’re already most of the way to understanding a database table. In a relational database, for example, a PostgreSQL database, you can expect to see data organized into columns and rows. A relational database table may have a set number of columns and any number of rows.

In Django, each model is its own table. For example, here’s a Django model:

from django.db import models


class Lunch(models.Model):
    left_side = models.CharField(max_length=100, null=True)
    center = models.CharField(max_length=100, null=True)
    right_side = models.CharField(max_length=100, null=True)

Each field is a column, and each row is a Django object instance of that model. Here’s a representation of a database table for the Django model “Lunch” above. In the database, its name would be lunch_table.

id	left_side	center	right_side
1	Fork	Plate	Spoon

The model Lunch has three fields: left_side, center, and right-side. One instance of a Lunch object would have “Fork” for the left_side, a “Plate” for the center, and “Spoon” for the right_side. Django automatically adds an id field if you don’t specify a primary key.

If you wanted to change the name of your Lunch model, you would do so in your models.py code. For example, change “Lunch” to “Dinner,” then run python manage.py makemigrations. You’ll see:

python manage.py makemigrations
Did you rename the backend.Lunch model to Dinner? [y/N] y
Migrations for 'backend':
  backend/migrations/0003_auto_20200922_2331.py
    - Rename model Lunch to Dinner

Django automatically generates the appropriate migration files. The relevant line of the generated migrations file in this case would look like:

migrations.RenameModel(old_name="Lunch", new_name="Dinner"),

This operation would rename our “Lunch” model to “Dinner” while keeping everything else the same. But what if you also wanted to change the structure of the database table itself, its schema, as well as make sure that existing data ends up in the right place on your Dinner table?

Let’s explore how to turn our Lunch model into a Dinner model that looks like this:

from django.db import models


class Dinner(models.Model):
    top_left = models.CharField(max_length=100, null=True)
    top_center = models.CharField(max_length=100, null=True)
    top_right = models.CharField(max_length=100, null=True)
    bottom_left = models.CharField(max_length=100, null=True)
    bottom_center = models.CharField(max_length=100, null=True)
    bottom_right = models.CharField(max_length=100, null=True)

…with a database table that would look like this:

id	top_left	top_center	top_right	bottom_left	bottom_center	bottom_right
1	Bread plate	Spoon	Glass	Fork	Plate	Knife

Manipulating data with Django migrations

Before you begin to manipulate your data, it’s always a good idea to create a backup of your database that you can restore in case something goes wrong. There are various ways to do this depending on the database you’re using. You can typically find instructions by searching for and keywords like backup, recovery, or snapshot.

In order to design your migration, it’s helpful to become familiar with the available migration operations. Migrations are run step-by-step, and each operation is some flavor of adding, removing, or altering data. Like a strategic puzzle, it’s important to make model changes one step at a time so that the generated migrations have the correct result.

We’ve already renamed our model successfully. Now, we’ll rename the fields that hold the data we want to retain:

class Dinner(models.Model):
    bottom_left = models.CharField(max_length=100, null=True)
    bottom_center = models.CharField(max_length=100, null=True)
    top_center = models.CharField(max_length=100, null=True)

Django is sometimes smart enough to determine the old and new field names correctly. You’ll be asked for confirmation:

python manage.py makemigrations
Did you rename dinner.center to dinner.bottom_center (a CharField)? [y/N] y
Did you rename dinner.left_side to dinner.bottom_left (a CharField)? [y/N] y
Did you rename dinner.right_side to dinner.top_center (a CharField)? [y/N] y
Migrations for 'backend':
  backend/migrations/0004_auto_20200914_2345.py
    - Rename field center on dinner to bottom_center
    - Rename field left_side on dinner to bottom_left
    - Rename field right_side on dinner to top_center

In some cases, you’ll want to try renaming the field and running makemigrations one at a time.

Now that the existing fields have been migrated to their new names, add the remaining fields to the model:

class Dinner(models.Model):
    top_left = models.CharField(max_length=100, null=True)
    top_center = models.CharField(max_length=100, null=True)
    top_right = models.CharField(max_length=100, null=True)
    bottom_left = models.CharField(max_length=100, null=True)
    bottom_center = models.CharField(max_length=100, null=True)
    bottom_right = models.CharField(max_length=100, null=True)

Running makemigrations again now gives us:

python manage.py makemigrations
Migrations for 'backend':
  backend/migrations/0005_auto_20200914_2351.py
    - Add field bottom_right to dinner
    - Add field top_left to dinner
    - Add field top_right to dinner

You’re done! By generating Django migrations, you’ve successfully set up your dinner_table and moved existing data to its new spot.

Additional complexity

You’ll notice that our Lunch and Dinner models are not very complex. Out of Django’s many model field options, we’re just using CharField. We also set null=True to let Django store empty values as NULL in the database.

Django migrations can handle additional complexity, such as changing field types, and whether a blank or null value is permitted. I keep Django’s model field reference handy as I work with varying types of data and different use cases.

De-mystified migrations

I hope this article has helped you better understand Django migrations and how they work!

Now that you can change models and manipulate existing data in your Django application, be sure to use your powers wisely! Backup your database, research and plan your migrations, and always run tests before working with customer data. By doing so, you have the potential to enable your application to grow – with manageable levels of complexity.

What is TLS? Transport Layer Security encryption explained in plain english

2020-09-05T04:48:39-06:00

If you want to have a confidential conversation with someone you know, you might meet up in person and find a private place to talk. If you want to send data confidentially over the Internet, you might have a few more considerations to cover.

TLS, or Transport Layer Security, refers to a protocol. “Protocol” is a word that means, “the way we’ve agreed to do things around here,” more or less. The “transport layer” part of TLS simply refers to host-to-host communication, such as how a client and a server interact, in the Internet protocol suite model.

The TLS protocol attempts to solve these fundamental problems:

How do I know you are who you say you are?
How do I know this message from you hasn’t been tampered with?
How can we communicate securely?

Here’s how TLS works, explained in plain English. As with many successful interactions, it begins with a handshake.

Getting to know you

The basic process of a TLS handshake involves a client, such as your web browser, and a server, such as one hosting a website, establishing some ground rules for communication. It begins with the client saying hello. Literally. It’s called a ClientHello message.

The ClientHello message tells the server which TLS protocol version and cipher suites it supports. While “cipher suite” sounds like a fancy hotel upgrade, it just refers to a set of algorithms that can be used to secure communications. The server, in a similarly named ServerHello message, chooses the protocol version and cipher suite to use from the choices offered. Other data may also be sent, for example, a session ID if the server supports resuming a previous handshake.

Depending on the cipher suite chosen, the client and server exchange further information in order to establish a shared secret. Often, this process moves the exchange from asymmetric cryptography to symmetric cryptography with varying levels of complexity. Let’s explore these concepts at a general level and see why they matter to TLS.

Asymmetric beginnings

This is asymmetry:

Small egg, big egg.

Asymmetric cryptography is one method by which you can perform authentication. When you authenticate yourself, you answer the fundamental question, “How do I know you are who you say you are?”

In an asymmetric cryptographic system, you use a pair of keys in order to achieve authentication. These keys are asymmetric. One key is your public key, which, as you would guess, is public. The other is your private key, which – well, you know.

Typically, during the TLS handshake, the server will provide its public key via its digital certificate, sometimes still called its SSL certificate, though TLS replaces the deprecated Secure Sockets Layer (SSL) protocol. Digital certificates are provided and verified by trusted third parties known as Certificate Authorities (CA), which are a whole other article in themselves.

While anyone may encrypt a message using your public key, only your private key can then decrypt that message. The security of asymmetric cryptography relies only on your private key staying private, hence the asymmetry. It’s also asymmetric in the sense that it’s a one-way trip. Alice can send messages encrypted with your public key to you, but neither of your keys will help you send an encrypted message to Alice.

Symmetric secrets

Asymmetric cryptography also requires more computational resources than symmetric cryptography. Thus when a TLS handshake begins with an asymmetric exchange, the client and server will use this initial communication to establish a shared secret, sometimes called a session key. This key is symmetric, meaning that both parties use the same shared secret and must maintain that secrecy for the encryption to be secure.

Wise man say: share your public key, but keep your shared keys private.

By using the initial asymmetric communication to establish a session key, the client and server can rely on the session key being known only to them. For the rest of the session, they’ll both use this same shared key to encrypt and decrypt messages, which speeds up communication.

Secure sessions

A TLS handshake may use asymmetric cryptography or other cipher suites to establish the shared session key. Once the session key is established, the handshaking portion is complete and the session begins.

The session is the duration of encrypted communication between the client and server. During this time, messages are encrypted and decrypted using the session key that only the client and server have. This ensures that communication is secure.

The integrity of exchanged information is maintained by using a checksum. Messages exchanged using session keys have a message authentication code (MAC) attached. This is not the same thing as your device’s MAC address. The MAC is generated and verified using the session key. Because of this, either party can detect if a message has been changed before being received. This solves the fundamental question, “How do I know this message from you hasn’t been tampered with?”

Sessions can end deliberately, due to network disconnection, or from the client staying idle for too long. Once a session ends, it must be re-established via a new handshake or through previously established secrets called session IDs that allow resuming a session.

TLS and you

Let’s recap:

TLS is a cryptographic protocol for providing secure communication.
The process of creating a secure connection begins with a handshake.
The handshake establishes a shared session key that is then used to secure messages and provide message integrity.
Sessions are temporary, and once ended, must be re-established or resumed.

This is just a surface-level skim of the very complex cryptographic systems that help to keep your communications secure. For more depth on the topic, I recommend exploring cipher suites and the various supported algorithms.

The TLS protocol serves a very important purpose in your everyday life. It helps to secure your emails to family, your online banking activities, and the connection by which you’re reading this article. The HTTPS communication protocol is encrypted using TLS. Every time you see that little lock icon in your URL bar, you’re experiencing firsthand all the concepts you’ve just read about in this article. Now you know the answer to the last question: “How can we communicate securely?”

Deceptively simple search-and-replace across multiple files

2020-08-25T04:48:39-06:00

While a multitude of methods exist to search for and replace words in a single file, what do you do when you’ve got a string to update across multiple unrelated files, all with different names? You harness the power of command line tools, of course!

First, you’ll need to find all the files you want to change. Stringing together what are effectively search queries for find is really only limited by your imagination. Here’s a simple example that finds Python files:

find . -name '*.py'

The -name test searches for a pattern, such as all files ending in .py, but find can do a lot more with other test conditions, including -regex tests. Run find --help to see the multitude of options.

Further tune your search by using grep to get only the files that contain the string you want to change, such as by adding:

grep -le '\'

The -l option gives you just the file names for all files containing a pattern (denoted with -e) that match “a whale”.

Using Vim’s impressive :bufdo lets you run the same command across multiple buffers, interactively working with all of these files without the tedium of opening, saving, and closing each file, one at a time.

Let’s plug your powerful find+grep results into Vim with:

vim `find . -name '*.py' \
-exec grep -le '\' {} \;`

Using backtick-expansion to pass our search to Vim opens up multiple buffers ready to go. (Do :h backtick-expansion in Vim for more.) Now you can apply the Vim command :bufdo to all of these files and perform actions such as interactive search-and-replace:

:bufdo %s/a whale/a bowl of petunias/gce

The g for “global” will change occurrences of the pattern on all lines. The e will omit errors if the pattern is not found. The c option makes this interactive; if you’re feeling confident, you can omit it to make the changes without reviewing each one.

If one of the patterns contains a / character, you can substitute the separator in the above command to make it more readable. Vim will assume the character following the %s is the separator, so for example:

:bufdo %s_a whale_a bowl of peonies/petunias_gce

When you’ve finished going through all the buffers, save all the work you’ve completed with:

:bufdo wq!

Then bask in the glory of your saved time and effort.

How GitHub Codespaces increases productivity and lowers barriers

2020-08-15T16:08:08-04:00

The most recent integration between Visual Studio Code and GitHub can help make development accessible and welcoming: Codespaces in GitHub!

Now in beta, GitHub Codespaces provide an online, in-the-browser IDE powered by Visual Studio Code. This lets you use this full-featured IDE, complete with extensions, terminal, Git commands, and all the settings you’re accustomed to, on any machine. You can now bring your development workflow anywhere using a tablet or other browser-based device.

Codespaces is great news for open source contributors, too. Adding a codespace configuration to your project is a great way to invite new folks to easily start contributing.

A new open source contributor or new hire at your organization can quickly fire up a codespace and get hacking on a good first issue with no local environment set up or installations necessary!

We’ve added codespace configuration settings over at the OWASP Web Security Testing Guide (WSTG). Want to take it for a spin? See our open issues.

Configuring Codespaces

You can use Visual Studio Code’s .devcontainer folder to configure a development container for your repository as well.

Many pre-built containers are available – just copy the .devcontainer you need to your repository root. If your repository doesn’t have one, a default base Linux image will be used.

Here’s a reason to remove .vscode from your .gitignore file. Any new codespaces created in your repository will now respect settings found at .vscode/settings.json. This means that your online IDE can have the same Workspace configuration as you have on your local machine. Isn’t that useful!

Making Codespaces personal

For next-level dotfiles personalization, consider committing relevant files from your local dotfiles folder as a public GitHub repository at yourusername/dotfiles.

When you create a new codespace, this brings in your configurations, such as shell aliases and preferences, by creating symlinks to dotfiles in your codespace $HOME. This personalizes all the codespaces you create in your account.

Need some inspiration? Browse my dotfiles repository on GitHub.

Developing in a codespace is a familiar experience for Visual Studio Code users, right down to running an application locally.

Thanks to port forwarding, when I run an application in a codespace terminal, clicking on the resulting localhost URL takes me to the appropriate port as output from my codespace.

When I’m working on this website in my codespace, for example, I run hugo serve then click the provided localhost:1313 link to see a preview of my changes in another browser tab.

Want to stay in sync between devices? There’s an extension for that. You can connect to your codespace from Visual Studio Code on your local machine so you can always pick up right where you left off.

Develop anywhere

Codespaces is a super exciting addition to my GitHub workflow. It allows me to access my full development process pretty much anywhere, using devices like my iPad.

It’ll also make it easier for new open source contributors or new hires at your organization to hit the ground running with a set-up IDE. If you have access to the limited beta, I invite you to spin up a codespace and try contributing to the WSTG, or to an issue on one of my open source projects.

I’m looking forward to general availability and seeing what the open source community will dream up for GitHub Codespaces next!

And yes – codespaces support your favorite Visual Studio Code theme. 😈

Screenshot of a codespace with the Kabukichō theme for Visual Studio Code

How to create a self-documenting Makefile

2020-08-05T08:55:19-04:00

My new favorite way to completely underuse a Makefile? Creating personalized, per-project repository workflow command aliases that you can check in.

Can a Makefile improve your DevOps and keep developers happy? How awesome would it be if a new developer working on your project didn’t start out by copying and pasting commands from your README? What if instead of:

pip3 install pipenv
pipenv shell --python 3.8
pipenv install --dev
npm install
pre-commit install --install-hooks
# look up how to install Framework X...
# copy and paste from README...
npm run serve

… you could just type:

make start

…and then start working?

Making a difference

I use make every day to take the tedium out of common development activities like updating programs, installing dependencies, and testing. To do all this with a Makefile (GNU make), we use Makefile rules and recipes. Similar parallels exist for POSIX flavor make, like Target Rules; here’s a great article on POSIX-compatible Makefiles.

Here’s some examples of things we can make easier (sorry):

update: ## Do apt upgrade and autoremove
    sudo apt update && sudo apt upgrade -y
    sudo apt autoremove -y

env:
    pip3 install pipenv
    pipenv shell --python 3.8

install: ## Install or update dependencies
    pipenv install --dev
    npm install
    pre-commit install --install-hooks

serve: ## Run the local development server
    hugo serve --enableGitInfo --disableFastRender --environment development

initial: update env install serve ## Install tools and start development server

Now we have some command-line aliases that you can check in! Great idea! If you’re wondering what’s up with that weird ## comment syntax, it gets better.

A self-documenting Makefile

Aliases are great, if you remember what they all are and what they do without constantly typing cat Makefile. Naturally, you need a help command:

.PHONY: help
help: ## Show this help
    @egrep -h '\s##\s' $(MAKEFILE_LIST) | sort | awk 'BEGIN {FS = ":.*?## "}; {printf "\033[36m%-20s\033[0m %s\n", $$1, $$2}'

With a little command-line magic, this egrep command takes the output of MAKEFILE_LIST, sorts it, and uses awk to find strings that follow the ## pattern. It then prints a helpful formatted version of the comments.

We’ll put it at the top of the file so it’s the default target. Now to see all our handy shortcuts and what they do, we just run make, or make help:

help                 Show this help
initial              Install tools and start development server
install              Install or update dependencies
serve                Run the local development server
update               Do apt upgrade and autoremove

Now we have our very own personalized and project-specific CLI tool!

The possibilities for improving your DevOps flow with a self-documenting Makefile are almost endless. You can use one to simplify any workflow and produce some very happy developers.

Please enjoy the (live!) Makefile I use to manage and develop this Hugo site. I hope it inspires you!

My Hugo site Makefile

Climbing Mt. Fuji

2020-08-02T06:35:45-04:00

In 2017, I climbed Mt. Fuji, in Japan.

Mt. Fuji is, some folks would say, the cakewalk of mountain climbing. Physically, the hardest portions amount to scrambling over some big boulders; most of it is no more taxing than a hike or climbing a set of stairs. For spiritual reasons, some Japanese folks make the climb at ages upwards of 80 years. There are huts to stop at along the way where you can rent a sleeping bag inside, and buy food and water. Naturally, having done this research and deciding it sounded like a fun outing, I arrived at basecamp in sneakers.

Most of the way up was amazing and thoroughly enjoyable. I saw sights I’d never seen before, like the glow of a city under the sun through a break in the clouds, from above. Walking a path through a cloud was like taking a road into nothingness, with blank grey on all sides that weren’t a mountain. Every time we hit a station marker, I felt pride and accomplishment.

Until it was time to summit.

Most of the people who climb Mt. Fuji wish to reach the summit at sunrise. Some for spiritual reasons, others for Instagram, and for those like myself, it just seemed like the thing to do. Regardless, it was because of these other 5,000 average daily climbers that I found myself in an actual queue that snaked the entire path from the last station hut to the summit – in the pitch black pre-dawn cold. It took hours, for most of which, we stood stock-still, going nowhere. I took to doing calisthenics to stave off frostbite from the cold that threatened my sneaker-shod toes.

We did, eventually, reach the summit, and before sunrise. It remains one of the most beautiful sunrises I’ve seen – a pink-gold light that lit up the peak like breathing life into a painting, and that brought, mercifully, a degree of warmth. I was extremely happy, and felt pride and accomplishment.

Until it was time to descend.

There is a Japanese proverb: “A wise man will climb Mt Fuji once; a fool will climb Mt Fuji twice.” It is my own suspicion that this saying is based entirely on the difficulty of climbing down. The descent is essentially a loosely-packed, dirt and gravel road – on a decline. It is not, I imagine, significantly taxing with proper hiking boots, maybe snow tread, and a couple good spiked hiking poles thrown in. Wearing a pair of flat-soled street shoes, however, I fell. I fell often, and hard, about every three steps, for hours. I tried to take larger steps; it didn’t help. I tried to take smaller steps; that didn’t help, either. I tried cunningly to find a way to surf-slide my way down the mountainside and nearly ended up with a mouthful of dirt. As if literally rubbing salt into my wounds, without the gaiters I hadn’t brought, sand found its way into my shoes. It was without a doubt the most stupefyingly discouraging experience of my life.

On several occasions, more seasoned (smarter? well-prepared?) hikers passed me, a good many of them at least twice my age. I’m hard-pressed to remember another time in my life where I have been so thoroughly shown up by someone who might have been my grandmother, plunking hiking poles into the earth and sauntering past at a steady pace while I picked myself up, elbows scratched and covered in dirt, for the umpteenth time.

Eventually, we reached the bottom. At a tiny basecamp gift shop, I ate a delicious bowl of ramen and the tastiest sponge cake in the shape of a mountain that I’ll likely ever have.

The experience drove home two lessons that have gone on to serve me well: one, that all the good research in the world will not guarantee your experience; and two, that even when faced with a discouraging situation that you can’t seem to think yourself out of and thus the only way is “through,” there may still be something to learn from it, and there may be really good cake at the bottom.

The Descent Is Harder Than the Climb

2020-08-02T06:35:45-04:00

In 2017, I climbed Mt. Fuji in sneakers. This was not a deliberate choice to increase the challenge—it was the result of excellent research and poor judgment about what that research actually meant.

Everything I’d read suggested that Mt. Fuji was the “cakewalk of mountain climbing.” Physically, the hardest portions amounted to scrambling over some big boulders. Most of the climb was no more taxing than hiking or climbing stairs. Japanese folks in their eighties made the journey for spiritual reasons. There were huts along the way for rest, food, and water. Based on this research, I concluded that sneakers would be perfectly adequate.

The ascent was everything I’d been promised. I experienced sights I’d never imagined—cities glowing through breaks in clouds from above, walking through paths of grey nothingness where the trail disappeared into cloud cover. Each station marker brought genuine pride and accomplishment. Even the pre-dawn summit queue with 5,000 other climbers, standing in freezing darkness for hours, felt manageable. We reached the summit before sunrise, and it remains one of the most beautiful moments I’ve experienced.

Then came the descent. That’s where I learned that all the research in the world about reaching goals doesn’t prepare you for what comes after you achieve them.

When Success Becomes the Set Up for Failure

The descent from Mt. Fuji is essentially a loosely-packed dirt and gravel road on a steep decline. With proper hiking boots and trekking poles, it’s probably manageable. In flat-soled street shoes, I fell constantly, and fell hard—every three steps, for hours. I tried to take larger steps; it didn’t help. I tried to take smaller steps; that didn’t help, either. I tried cunningly to find a way to surf-slide my way down the mountainside and nearly ended up with a mouthful of dirt. As if literally rubbing salt into my wounds, without the gaiters I hadn’t brought, sand found its way into my shoes. It was without a doubt the most stupefyingly discouraging experience of my life.

As I picked myself up repeatedly, covered in dirt with scratched elbows, seasoned hikers passed me with ease. Many of them could have been my grandparents, using proper equipment and technique to descend at a steady pace while I struggled and stopped to pour tiny rocks out of my sneakers. The contrast was humbling and instructive.

This experience taught me something crucial about leadership that I’ve applied countless times since: the skills and preparation that get you to success are often different from the skills required to maintain or scale that success. The descent is frequently harder than the climb, and most people don’t prepare for it adequately.

The Post-Achievement Challenge

In business and team leadership, I’ve watched this pattern repeat consistently. The energy, skills, and resources required to achieve a goal are usually well-understood and planned for. But the challenges that come after success—maintaining market position, scaling team culture, or managing the operational complexity of growth—often catch leaders unprepared.

I’ve seen teams that executed brilliant product launches struggle with customer support and maintenance. Startups that successfully raised funding stumble when it comes to executing on their promises to investors. Engineering teams that built innovative solutions fail to create sustainable systems for maintaining and scaling those solutions. The problem isn’t lack of capability—it’s that the descent requires different preparation and different skills than the ascent. What gets you to the summit (innovation, speed, breakthrough thinking) often isn’t what gets you safely back to basecamp (consistency, processes, systematic execution).

Learning from Those Who’ve Made the Journey

Watching those experienced hikers pass me on Mt. Fuji was initially frustrating, but it became one of the most valuable parts of the experience. They had proper equipment, understood the terrain, and moved with confidence that came from experience. Most importantly, they had prepared specifically for the descent, not just the climb.

In leadership roles, I’ve learned to actively seek out people who’ve successfully navigated the “descent” phase of challenges I’m facing. Entrepreneurs who’ve managed hypergrowth. Product managers who’ve maintained market leadership over multiple years. Engineering leaders who’ve scaled teams from ten to fifty people, or CEOs who’ve scaled companies from fifty to five hundred.

These conversations can reveal patterns you may not have discovered on your own. Successful scaling requires different organizational structures than startup growth. Maintaining team culture during rapid hiring requires intentional systems that don’t emerge naturally. Sustaining innovation while managing operational complexity demands new kinds of leadership skills.

People who’ve successfully managed the descent often have hard-won wisdom about preparation and technique that isn’t captured in most “how to reach the summit” advice.

Building Skills Before You Need Them

The most effective leaders I know prepare for post-success challenges while they’re still climbing toward their initial goals. They think systematically about what will be required to maintain and scale whatever they’re building, not just achieve it.

This means building operational capabilities alongside product capabilities. Developing team management skills in individual contributors. Creating sustainable processes while you’re still in startup mode. Planning for the maintenance and evolution of systems as part of their initial implementation.

It also means recognizing that the mindset and skills that drive breakthrough achievements—risk-taking, speed, creative problem-solving—need to be balanced with different capabilities like consistency, systematic thinking, and process optimization. I’ve learned to explicitly ask: “What will success look like, and what challenges will that create?” This question reveals preparation gaps that aren’t obvious when you’re focused entirely on reaching your goals.

When You Find Yourself Unprepared

Despite best intentions, you’ll sometimes find yourself in descent mode without proper preparation—leading a team through unexpected growth, managing a product that succeeded beyond projections, or scaling systems that weren’t designed for current loads. The Mt. Fuji experience taught me how to navigate these situations effectively.

First, acknowledge the reality of your situation without wasting energy on regret about preparation gaps. You can’t change what you didn’t know or plan for previously, but you can adapt your approach based on current conditions. Take the time to solidify new goals in writing, then evaluate whether your efforts are serving them effectively.

Second, focus on learning from people who are managing similar challenges successfully. This isn’t the time for pride or trying to figure everything out independently. The hikers who passed me weren’t showing off—they had practical knowledge that could help. Conversations you have with others who came before you can save you from a lot of stumbles.

Third, lift your gaze. While the ascent phase requires day-to-day tactical thinking, the descent phase requires a strategic longer-term outlook. Implementing systems and culture that support continued success will require patience, persistence, and often a completely different pace than what got you to the summit. Expecting it to be as expedient as the climb leads to frustration and poor decision-making.

Finding Meaning in the Difficult Parts

Eventually, I reached the bottom of Mt. Fuji, exhausted and humbled but intact. At a tiny basecamp shop, I ate the most delicious bowl of ramen and the tastiest mountain-shaped sponge cake I’ll likely ever have.

Even when you’re unprepared and struggling, there’s value in the journey itself. The descent taught me lessons about preparation, humility, and persistence that I’ve applied to all sorts of challenges for years since.

Preparing for Your Next Descent

There is a Japanese proverb: “A wise man will climb Mt Fuji once; a fool will climb Mt Fuji twice.” I suspect this wisdom is based entirely on the difficulty of the descent. But in leadership, you don’t get to choose how many times you’ll face descent challenges—they’re inevitable parts of any significant journey.

The key is recognizing that achieving your goals is often just the beginning of a different kind of challenge. Success creates new problems that require different skills, different preparation, and different mindsets than what got you there initially.

Whether you’re building teams, scaling products, or managing organizational growth, prepare for the descent while you’re planning the climb. Study what happens after success. Learn from people who’ve navigated similar transitions. Build operational capabilities alongside innovative ones.

Most importantly, remember that the descent is still part of the journey, not a failure of the ascent. The challenges that come with success are signs that you’ve accomplished something meaningful. Navigate them with patience, preparation, and the understanding that getting back to basecamp safely can be an even more important achievement than reaching the summit.

Go automate your GitHub profile README

2020-07-25T10:51:15-04:00

GitHub’s new profile page README feature is having the wonderful effect of bringing some personality to the Myspace pages of the developer Internet. Though Markdown lends itself best to standard static text content, that’s not stopping creative folks from working to create a next-level README. You can include GIFs and images to add some motion and pizazz (they’re covered in GitHub Flavor Markdown), but I’m thinking of something a little more dynamic.

At front-and-center on your GitHub profile, your README is a great opportunity to let folks know what you’re about, what you find important, and to showcase some highlights of your work. You might like to show off your latest repositories, tweet, or blog post. Keeping it up to date doesn’t have to be a pain either, thanks to continuous delivery tools like GitHub Actions.

My current README refreshes itself daily with a link to my latest blog post. Here’s how I’m creating a self-updating README.md with Go and GitHub actions.

Reading and writing files with Go

I’ve been writing a lot of Python lately, but for some things I really like using Go. You could say it’s my go-to language for just-for-func projects. Sorry. Couldn’t stop myself.

To create my README.md, I’m going to get some static content from an existing file, mash it together with some new dynamic content that we’ll generate with Go, then bake the whole thing at 400 degrees until something awesome comes out.

Here’s how we read in a file called static.md and put it in string form:

// Unwrap Markdown content
content, err := ioutil.ReadFile("static.md")
if err != nil {
    log.Fatalf("cannot read file: %v", err)
    return err
}

// Make it a string
stringyContent := string(content)

The possibilities for your dynamic content are only limited by your imagination! Here, I’ll use the github.com/mmcdole/gofeed package to read the RSS feed from my blog and get the newest post.

fp := gofeed.NewParser()
feed, err := fp.ParseURL("https://victoria.dev/index.xml")
if err != nil {
    log.Fatalf("error getting feed: %v", err)
}
// Get the freshest item
rssItem := feed.Items[0]

To join these bits together and produce stringy goodness, we use fmt.Sprintf() to create a formatted string.

// Whisk together static and dynamic content until stiff peaks form
blog := "Read my latest blog post: **[" + rssItem.Title + "](" + rssItem.Link + ")**"
data := fmt.Sprintf("%s\n%s\n", stringyContent, blog)

Then to create a new file from this mix, we use os.Create(). There are more things to know about deferring file.Close(), but we don’t need to get into those details here. We’ll add file.Sync() to ensure our README gets written.

// Prepare file with a light coating of os
file, err := os.Create("README.md")
if err != nil {
    return err
}
defer file.Close()

// Bake at n bytes per second until golden brown
_, err = io.WriteString(file, data)
if err != nil {
    return err
}
return file.Sync()

View the full code here in my README repository.

Mmmm, doesn’t that smell good? 🍪 Let’s make this happen on the daily with a GitHub Action.

Running your Go program on a schedule with Actions

You can create a GitHub Action workflow that triggers both on a push to your master branch as well as on a daily schedule. Here’s a slice of the .github/workflows/update.yaml that defines this:

on:
  push:
    branches:
      - master
  schedule:
    - cron: '0 11 * * *'

To run the Go program that rebuilds our README, we first need a copy of our files. We use actions/checkout for that:

steps:
    - name: 🍽️ Get working copy
      uses: actions/checkout@master
      with:
        fetch-depth: 1

This step runs our Go program:

- name: 🍳 Shake & bake README
  run: |
    cd ${GITHUB_WORKSPACE}/update/
    go run main.go

Finally, we push the updated files back to our repository. Learn more about the variables shown at Using variables and secrets in a workflow.

- name: 🚀 Deploy
  run: |
    git config user.name "${GITHUB_ACTOR}"
    git config user.email "${GITHUB_ACTOR}@users.noreply.github.com"
    git add .
    git commit -am "Update dynamic content"
    git push --all -f https://${{ secrets.GITHUB_TOKEN }}@github.com/${GITHUB_REPOSITORY}.git

View the full code for this Action workflow here in my README repository.

Go forth and auto-update your README

Congratulations and welcome to the cool kids’ club! You now know how to build an auto-updating GitHub profile README. You may now go forth and add all sorts of neat dynamic elements to your page – just go easy on the GIFs, okay?

Writing efficient Django

2020-07-09T04:02:47-05:00

I like Django. It’s a well-considered and intuitive framework with a name I can pronounce out loud. You can use it to quickly spin up a weekend-sized project, and you can still use it to run full-blown production applications at scale. I’ve done both these things, and over the years I’ve discovered how to use some of Django’s features for maximum efficiency. These are:

Class-based versus function-based views
Django models
Retrieving objects with queries

Understanding these main features are the building blocks for maximizing development efficiency with Django. They’ll build the foundation for you to test efficiently and create an awesome development experience for your engineers. Let’s look at how these tools let you create a performant Django application that’s pleasant to build and maintain.

Class-based versus function-based views

Remember that Django is all Python under the hood. When it comes to views, you’ve got two choices: view functions (sometimes called “function-based views”), or class-based views.

Years ago when I first built ApplyByAPI, it was initially composed entirely of function-based views. These offer granular control, and are good for implementing complex logic; just as in a Python function, you have complete control (for better or worse) over what the view does. With great control comes great responsibility, and function-based views can be a little tedious to use. You’re responsible for writing all the necessary methods for the view to work - this is what allows you to completely tailor your application.

In the case of ApplyByAPI, there were only a sparse few places where that level of tailored functionality was really necessary. Everywhere else, function-based views began making my life harder. Writing what is essentially a custom view for run-of-the-mill operations like displaying data on a list page became tedious, repetitive, and error-prone.

With function-based views, you’ll need figure out which Django methods to implement in order to handle requests and pass data to views. Unit testing can take some work to write. In short, the granular control that function-based views offer also requires some granular tedium to properly implement.

I ended up holding back ApplyByAPI while I refactored the majority of the views into class-based views. This was not a small amount of work and refactoring, but when it was done, I had a bunch of tiny views that made a huge difference. I mean, just look at this one:

class ApplicationsList(ListView):
    model = Application
    template_name = "applications.html"

It’s three lines. My developer ergonomics, and my life, got a lot easier.

You may think of class-based views as templates that cover most of the functionality any app needs. There are views for displaying lists of things, for viewing a thing in detail, and editing views for performing CRUD (Create, Read, Update, Delete) operations. Because implementing one of these generic views takes only a few lines of code, my application logic became dramatically succinct. This gave me less repeated code, fewer places for something to go wrong, and a more manageable application in general.

Class-based views are fast to implement and use. The built-in class-based generic views may require less work to test, since you don’t need to write tests for the base view Django provides. (Django does its own tests for that; no need for your app to double-check.) To tweak a generic view to your needs, you can subclass a generic view and override attributes or methods. In my case, since I only needed to write tests for any customizations I added, my test files became dramatically shorter, as did the time and resources it took to run them.

When you’re weighing the choice between function-based or class-based views, consider the amount of customization the view needs, and the future work that will be necessary to test and maintain it. If the logic is common, you may be able to hit the ground running with a generic class-based view. If you need sufficient granularity that re-writing a base view’s methods would make it overly complicated, consider a function-based view instead.

Django models

Models organize your Django application’s central concepts to help make them flexible, robust, and easy to work with. If used wisely, models are a powerful way to collate your data into a definitive source of truth.

Like views, Django provides some built-in model types for the convenience of implementing basic authentication, including the User and Permission models. For everything else, you can create a model that reflects your concept by inheriting from a parent Model class.

class StaffMember(models.Model):
    user = models.OneToOneField(User, on_delete=models.CASCADE)
    company = models.OneToOneField(Company, on_delete=models.CASCADE)

    def __str__(self):
        return self.company.name + " - " + self.user.email

When you create a custom model in Django, you subclass Django’s Model class and take advantage of all its power. Each model you create generally maps to a database table. Each attribute is a database field. This gives you the ability to create objects to work with that humans can better understand.

You can make a model useful to you by defining its fields. Many built-in field types are conveniently provided. These help Django figure out the data type, the HTML widget to use when rendering a form, and even form validation requirements. If you need to, you can write custom model fields.

Database relationships can be defined using a ForeignKey field (many-to-one), or a ManyToManyField (give you three guesses). If those don’t suffice, there’s also a OneToOneField. Together, these allow you to define relations between your models with levels of complexity limited only by your imagination. (Depending on the imagination you have, this may or may not be an advantage.)

Retrieving objects with queries

Use your model’s Manager (objects by default) to construct a QuerySet. This is a representation of objects in your database that you can refine, using methods, to retrieve specific subsets. All available methods are in the QuerySet API and can be chained together for even more fun.

Post.objects.filter(
    type="new"
).exclude(
    title__startswith="Blockchain"
)

Some methods return new QuerySets, such as filter(), or exclude(). Chaining these can give you powerful queries without affecting performance, as QuerySets aren’t fetched from the database until they are evaluated. Methods that evaluate a QuerySet include get(), count(), len(), list(), or bool().

Iterating over a QuerySet also evaluates it, so avoid doing so where possible to improve query performance. For instance, if you just want to know if an object is present, you can use exists() to avoid iterating over database objects.

Use get() in cases where you want to retrieve a specific object. This method raises MultipleObjectsReturned if something unexpected happens, as well as the DoesNotExist exception, if, take a guess.

If you’d like to get an object that may not exist in the context of a user’s request, use the convenient get_object_or_404() or get_list_or_404() which raises Http404 instead of DoesNotExist. These helpful shortcuts are suited to just this purpose. To create an object that doesn’t exist, there’s also the convenient get_or_create().

Efficient essentials

You’ve now got a handle on these three essential tools for building your efficient Django application – congratulations! You can make Django work even better for you by learning about manipulating data with migrations, testing effectively, and setting up your team’s Django development for maximum happiness.

If you’re going to build on GitHub, you may like to set up my django-security-check GitHub Action. In the meantime, you’re well on your way to building a beautiful software project.

Look mom, I'm a GitHub Action Hero

2020-06-27T09:06:33-04:00

GitHub recently interviewed me for their blog editorial entitled GitHub Action Hero: Victoria Drake. Here’s a behind-the-scenes peek at the original interview questions and my answers.

What is the name of your Action? Please include a link too.

Among the several Actions I’ve built, I have two current favorites. One is hugo-remote, which lets you continuously deploy a Hugo static site from a private source repository to a public GitHub Pages repository. This keeps the contents of the source repository private, such as your unreleased drafts, while still allowing you to have a public open source site using GitHub Pages.

The second is django-security-check. It’s an effortless way to continuously check that your production Django application is free from a variety of security misconfigurations. You can think of it as your little CI/CD helper for busy projects – a security linter!

Tell us a little bit more about yourself—how did you get started in software tools?

When I was a kid, I spent several summer vacations coding a huge medieval fantasy world MUD (Multi-User Dungeon, like a multiplayer role-playing game) written in LPC, with friends. It was entirely text-based, and built and played via Telnet. I fell in love with the terminal and learned a lot about object-oriented programming and prototype-based programming early on.

I became a freelance developer and had the privilege of working on a wide variety of client projects. Realizing the difficulty that companies have with hiring experienced developers, I built ApplyByAPI.com to help. As you might imagine, it allows candidates to apply for jobs via API, instead of emailing a resume. It’s based on the Django framework, so in the process, I learned even more about building reusable units of software.

When I became a co-author and a core maintainer for the Open Web Application Security Project (OWASP) Web Security Testing Guide (WSTG), I gained an even broader appreciation for how a prototype-based, repeatable approach can help build secure web applications. Organizations worldwide consider the WSTG the foremost open source resource for testing the security of web applications. We’ve applied this thinking via the use of GitHub Actions in our repository – I’ll tell you more about that later.

Whether I’m creating an open source tool or leading a development team, my childhood experience still informs how I think about programming today. I strive to create repeatable units of software like GitHub Actions – only now, I make them for large enterprises in the real world!

What is the story behind your built GitHub Action? (Why did you build this?)

Developers take on a lot of responsibility when it comes to building secure applications these days. I’m a full-time senior software developer at a cybersecurity company. I’ve found that I’m maximally productive when I create systems and processes that help myself and my team make desired outcomes inevitable. So I spend my free time building tools that make it easy for other developers to build secure software as well. My Actions help to automate contained, repeatable units of work that can make a big difference in a developer’s day.

Do you have future plans for this or other Actions?

Yes! I’m always finding ways for tools like GitHub Actions to boost the velocity of technical teams, whether at work or in my open source projects. Remember the Open Web Application Security Project? In the work I’ve lead with OWASP, I’ve championed the effort to increase automation using GitHub Actions to maintain quality, securely deploy new versions to the web, and even build PDFs of the WSTG. We’re constantly looking into new ways that GitHub Actions can make our lives easier and our readers’ projects more secure.

What has been your favorite feature of GitHub Actions?

I like that I can build an Action using familiar and portable technologies, like Docker. Actions are easy for collaborators to work on too, since in the case of a Dockerized Action, you can use any language your team is comfortable with. This is especially useful in large organizations with polyglot teams and environments. There aren’t any complicated dependencies for running these portable tasks, and you don’t need to learn any special frameworks to get started.

One of my first blog posts about GitHub Actions even describes how I used an Action to run a Makefile! This is especially useful for large legacy applications that want to modernize their pipeline by using GitHub Actions.

What are the biggest challenges you’ve faced while building your GitHub Action?

The largest challenge of GitHub Actions isn’t really in GitHub Actions, but in the transition of legacy software and company culture.

Migrating legacy software is always challenging, particularly with large legacy applications. Moving to modern CI/CD processes requires changes at the software level, team level, and even a shift in thinking when it comes to individual developers. It can help to have a tool like GitHub Actions, which is at once seamlessly modern and familiar, when transitioning legacy code to a modern pipeline.

I’m happiest when I’m solving a challenge that makes developing secure software less challenging in the future, both for myself and for the technology organization I’m leading. With tools like GitHub Actions, a lot of mental overhead can be offloaded to automatic processes – like getting a whole other brain, for free! This can massively help organizations that are ready to scale up their development output.

In the realm of cybersecurity, not only does creating portable and reusable software make developers’ lives easier, it helps to make whole workflows repeatable, which in turn makes software development processes more secure. With smart processes in place, technical teams are happier. As an inevitable result, they’ll build better software for customers, too.

Technical ergonomics for the efficient developer

2020-06-22T06:33:28-04:00

This article isn’t going to tell you about saving your neck with a Roost stand, or your wrists with a split keyboard - I’ve already done that. This article is about saving your brain.

When I first began to program full time, I found myself constantly tired from the mental exertion. Programming is hard! Thankfully, you can take some solace in knowing it gets easier with practice, and with a great supporting cast. Some very nice folks who preceded us both came up with tools to make the difficult bits of communicating with computers much easier on our poor human meatbrains.

I invite you to explore these super helpful technical tools. They’ll improve your development set up and alleviate much of the mental stress of programming. You soon won’t believe you could have done without them.

Not your average syntax highlighting

If you’re still working with syntax highlighting that just picks out variable and class names for you, that’s cute. Time to turn it up a notch.

In all seriousness, syntax highlighting can make it much easier to find what you’re looking for on your screen: the current line, where your current code block starts and ends, or the absolute game-changing which-bracket-set-am-I-in highlight. I primarily use Visual Studio Code, but similar extensions can be found for the major text editors.

The theme pictured in Visual Studio Code above is Kabukichō. I made it.

Use Git hooks

I previously brought you an interactive pre-commit checklist in the style of infomercials that’s both fun and useful for reinforcing the quality of your commits. But that’s not all!

Git hooks are scripts that run automatically at pre-determined points in your workflow. Use them well, and you can save a ton of brainpower. A pre-commit hook remembers to do things like lint and format code, and even runs local tests for you before you indelibly push something embarrassing. Hooks can be a little annoying to share (the .git/hooks directory isn’t tracked and thus omitted when you clone or fork a repository) but there’s a framework for that: the confusingly-named pre-commit framework, which allows you to create a sharable configuration file of Git hook plugins, not just for pre-commit.

I spend a majority of my time these days coding in Python, so here is my current .pre-commit-config.yaml:

fail_fast: true
repos:
  - repo: https://github.com/DavidAnson/markdownlint-cli2
    rev: v0.1.3
    hooks:
    - id: markdownlint-cli2
      name: markdownlint-cli2
      description: "Checks the style of Markdown/CommonMark files."
      entry: markdownlint-cli2
      language: node
      types: [markdown]
      minimum_pre_commit_version: 0.15.0

There are tons of supported hooks to explore.

Use a type system

If you write in languages like Python and JavaScript, get yourself an early birthday present and start using a static type system. Not only will this help improve the way you think about code, it can help make type errors clear before running a single line.

For Python, I like using mypy for static type checking. You can set it up as a pre-commit hook (see above) and it’s supported in Visual Studio Code too.

TypeScript is my preferred way to write JavaScript. You can run the compiler on the command line using Node.js (see instructions in the repo), it works pretty well with Visual Studio Code out of the box, and of course there are multiple options for extension integrations.

Quit unnecessarily beating up your meatbrain

I mean, you wouldn’t stand on your head all day to do your work. It would be rather inconvenient to read things upside down all the time (at least until your brain adjusted), and in any case you’d likely get uncomfortably congested in short order. Working without taking advantage of the technical ergonomic tools I’ve given you today is a little like unnecessary inversion - why would you, if you don’t have to?

How to choose and care for a secure open source project

2020-05-25T05:53:09-04:00

There is a rather progressive sect of the software development world that believes that most people would be a lot happier and get a lot more work done if they just stopped building things that someone else has already built and is offering up for free use. They’re called the open source community. They want you to take their stuff.

Besides existing without you having to lift a finger, open source tools and software have some distinct advantages. Especially in the case of well-established projects, it’s highly likely that someone else has already worked out all the most annoying bugs for you. Thanks to the ease with which users can view and modify source code, it’s also more likely that a program has been tinkered with, improved, and secured over time. When many developers contribute, they bring their own unique expertise and experiences. This can result in a product far more robust and capable than one a single developer can produce.

Of course, being as varied as the people who build them, not all open source projects are created equal, nor maintained to be equally secure. There are many factors that affect a project’s suitability for your use case. Here are a few general considerations that make a good starting point when choosing an open source project.

How to choose an open source project

As its most basic requirements, a good software project is reliable, easy to understand, and has up-to-date components and security. There are several indicators that can help you make an educated guess about whether an open source project satisfies these criteria.

Who’s using it

Taken in context, the number of people already using an open source project may be indicative of how good it is. If a project has a hundred users, for instance, it stands to reason that someone has tried to use it at least a hundred times before you found it. Thus by the ancient customs of “I don’t know what’s in that cave, you go first,” it’s more likely to be fine.

You can draw conclusions about a project’s user base by looking at available statistics. Depending on your platform, these may include the number of downloads, reviews, issues or tickets, comments, contributions, forks, or “stars,” whatever those are.

Evaluate social statistics on platforms like GitHub with a grain of salt. They can help you determine how popular a project may be, but only in the same way that restaurant review apps can help you figure out if you should eat at Foo’s Grill & Bar. Depending on where Foo’s Grill & Bar is, when it opened, and how likely people are to be near it when the invariable steak craving should call, having twenty-six reviews may be a good sign or a terrible one. While you would not expect a project that addresses a very obscure use case or technology to have hundreds of users, having a few active users is, in such a case, just as confidence-inspiring.

External validation can also be useful. For example, packages that are included in a Linux operating system distribution (distro) must conform to stringent standards and undergo vetting. Choosing software that is included in a distro’s default repositories can mean it’s more likely to be secure.

Perhaps one of the best indications to look for is whether a project’s development team is using their own project. Look for issues, discussions, or blog posts that show that the project’s creators and maintainers are using what they’ve built themselves. Commonly referred to as “eating your own dog food,” or “dogfooding,” it’s an indicator that the project is most likely to be well-maintained by its developers.

Who’s building it

The main enemy of good open source software is usually a lack of interest. The parties involved in an open source project can make the difference between a flash-in-the-pan library and a respected long-term utility. Multiple committed maintainers, even making contributions in their spare time, have a much higher success rate of sustaining a project and generating interest.

Projects with healthy interest are usually supported by, and in turn cultivate, a community of contributors and users. New contributors may be actively welcomed, clear guides are available explaining how to help, and project maintainers are available and approachable when people have inevitable questions. Some communities even have chat rooms or forums where people can interact outside of contributions. Active communities help sustain project interest, relevance, and its ensuing quality.

In a less organic fashion, a project can also be sustained through organizations that sponsor it. Governments and companies with financial interest are open source patrons too, and a project that enjoys public sector use or financial backing has added incentive to remain relevant and useful.

How alive is it

The recency and frequency of an open source project’s activity is perhaps the best indicator of how much attention is likely paid to its security. Look at releases, commit history, changelogs, or documentation revisions to determine if a project is active. As projects vary in size and scope, here are some general things to look for.

Maintaining security is an ongoing endeavor that requires regular monitoring and updates, especially for projects with third-party components. These may be libraries or any part of the project that relies on something outside itself, such as a payment gateway integration. An inactive project is more likely to have outdated code or use outdated versions of components. For a more concrete determination, you can research a project’s third-party components and compare their most recent patches or updates with the project’s last updates.

Projects without third-party components may have no outside updates to apply. In these cases, you can use recent activity and release notes to determine how committed a project’s maintainers may be. Generally, active projects should show updates within the last months, with a notable release within the last year. This can be a good indication of whether the project is using an up-to-date version of its language or framework.

You can also judge how active a project may be by looking at the project maintainers themselves. Active maintainers quickly respond to feedback or new issues, even if it’s just to say, “We’re on it.” If the project has a community, its maintainers are a part of it. They may have a dedicated website or write regular blogs. They may offer ways to contact them directly and privately, especially to raise security concerns.

Can you understand it

Having documentation is a baseline requirement for a project that’s intended for anyone but its creator to use. Good open source projects have documentation that is easy to follow, honest, and thorough.

Having well-written documentation is one way a project can stand out and demonstrate the thoughtfulness and dedication of its maintainers. A “Getting Started” section may detail all the requirements and initial set up for running the project. An accurate list of topics in the documentation enables users to quickly find the information they need. A clear license statement leaves no doubt as to how the project can be used, and for what purposes. These are characteristic aspects of documentation that serves its users.

A project that is following sound coding practices likely has code that is as readable as its documentation. Code that is easy to read lends itself to being understood. Generally, it has clearly defined and appropriately-named functions and variables, a logical flow, and apparent purpose. Readable code is easier to fix, secure, and build upon.

How compatible is it

A few factors will determine how compatible a project is with your goals. These are objective qualities, and can be determined by looking at a project’s repository files. They include:

Code language
Specific technologies or frameworks
License compatibility

Compatibility doesn’t necessarily mean a direct match. Different code languages can interact with each other, as can various technologies and frameworks. You should carefully read a project’s license to understand if it permits usage for your goal, or if it is compatible with a license you would like to use.

Ultimately, a project that satisfies all these criteria may still not quite suit your use case. Part of the beauty of open source software, however, is that you may still benefit from it by making alterations that better suit your usage. If those alterations make the project better for everyone, you can pay it back and pay it forward by contributing your work to the project.

Proper care and feeding of an open source project

Once you adopt an open source project, a little attention is required to make sure it continues to be a boon to your goals. While its maintainers will look after the upstream project files, you alone are responsible for your own copy. Like all software, your open source project must be well-maintained in order to remain as secure and useful as possible.

Have a system that provides you with notifications when updates for your software are made available. Update software promptly, treating each patch as if it were vital to security; it may well be. Keep in mind that open source project creators and maintainers are, in most cases, acting only out of the goodness of their own hearts. If you’ve got a particularly awesome one, its developers may make updates and security patches available on a regular basis. It’s up to you to keep tabs on updates and promptly apply them.

As with most things in software, keeping your open source additions modular can come in handy. You might use git submodules, branches, or environments to isolate your additions. This can make it easier to apply updates or pinpoint the source of any bugs that arise.

So although an open source project may cost no money, caveat emptor, which means, “Jimmy, if we get you a puppy, it’s your responsibility to take care of it.”

If you want to build a treehouse, start at the bottom

2020-05-11T05:46:47-04:00

If you’ve ever watched a kid draw a treehouse, you have some idea of how applications are built when security isn’t made a priority. It’s far more fun to draw the tire swing, front porch, and swimming pool than to worry about how a ten-thousand-gallon bucket of water stays suspended in midair. With too much attention spent on fun and flashy features, foundations suffer.

Of course, spending undue hours building a back end like Fort Knox may not be necessary for your application, either. Being an advocate for security doesn’t mean always wearing your tinfoil hat (although you do look dashing in it) but does mean building in an appropriate amount of security.

How much security is appropriate? The answer, frustratingly, is, “it depends.” The right amount of security for your application depends on who’s using it, what it does, and most importantly, what undesirable things it could be made to do. It takes some analysis to make decisions about the kinds of risks your application faces and how you’ll prepare to handle them. Okay, now’s a good time to don your tinfoil hat. Let’s imagine the worst.

Threat modeling: what’s the worst that could happen

A threat model is a stuffy term for the result of trying to imagine the worst things that could happen to an application. Using your imagination to assess risks (fittingly called risk assessment) is a conveniently non-destructive method for finding ways an application can be attacked. You won’t need any tools; just an understanding of how the application might work, and a little imagination. You’ll want to record your results with pen and paper. For the younger folks, that means the notes app on your phone.

A few different methodologies for application risk assessment can be found in the software world, including the in-depth NIST Special Publication 800-30. Each method’s framework has specific steps and output, and will go into various levels of detail when it comes to defining threats. If following a framework, first choose the one you’re most likely to complete. You can always add more depth and detail from there.

Even informal risk assessments are beneficial. Typically taking the form of a set of questions, they may be oriented around possible threats, the impact to assets, or ways a vulnerability could be exploited. Here are some examples of questions addressing each orientation:

What kind of adversary would want to break my app? What would they be after?
If the control of x fell into the wrong hands, what could an attacker do with it?
Where could a x vulnerability occur in my app?

A basic threat model explains the technical, business, and human considerations for each risk. It will typically detail:

The vulnerabilities or components that can cause the risk
The impact that a successful execution of the risk would have on the application
The consequences for the application’s users or organization

The result of a risk assessment exercise is your threat model; in other words, a list of things you would very much like not to occur. It is usually sorted in a hierarchy of risks, from the worst to the mildest. The worst risks have the most negative impact, and are most important to protect against. The mildest risks are the most acceptable - while still an undesirable outcome, they have the least negative impact on the application and users.

You can use this resulting hierarchy as a guide to determine how much of your cybersecurity efforts to apply to each risk area. An appropriate amount of security for your application will eliminate (where possible) or mitigate the worst risks.

Pushing left

Although it sounds like a dance move meme, pushing left refers instead to building in as much of your planned security as possible in the early stages of software development.

Building software is a lot like building a treehouse, just without the pleasant fresh air. You start with the basic supporting components, such as attaching a platform to a tree. Then comes the framing, walls, and roof, and finally, your rustic-modern Instagram-worthy wall hangings and deer bust.

The further along in the build process you are, the harder and more costly it becomes to make changes to a component that you’ve already installed. If you discover a problem with the walls only after the roof is put in place, you may need to change or remove the roof in order to fix it. Similar parallels can be drawn for software components, only without similar ease in detangling the attached parts.

In the case of a treehouse, it’s rather impossible to start with decorations or even a roof, since you can’t really suspend them in midair. In the case of software development, it is, unfortunately, possible to build many top-layer components and abstractions without a sufficient supporting architecture. A push-left approach views each additional layer as adding cost and complication. Pushing left means attempting to mitigate security risks as much as possible at each development stage before proceeding to the next.

Building bottom-to-top

By considering your threat model in the early stages of developing your application, you reduce the chances of necessitating a costly remodel later on. You can make choices about architecture, components, and code that support the main security goals of your particular application.

While it’s not possible to foresee all the functionality your application may one day need to support, it is possible to prepare a solid foundation that allows additional functionality to be added more securely. Building in appropriate security from the bottom to the top will help make mitigating security risks much easier in the future.

Hugo vs Jekyll: an epic battle of static site generator themes

2020-04-27T06:34:41-04:00

I recently took on the task of creating a documentation site theme for two projects. Both projects needed the same basic features, but one uses Jekyll while the other uses Hugo.

In typical developer rationality, there was clearly only one option. I decided to create the same theme in both frameworks, and to give you, dear reader, a side-by-side comparison.

This post isn’t a comprehensive theme-building guide, but intended to familiarize you with the process of building a theme in either generator. Here’s what we’ll cover:

How theme files are organized
Where to put content
How templating works
Creating a top-level menu with the pages object
Creating a menu with nested links from a data list
Putting the template together
Create a stylesheet
- Sass and CSS in Jekyll
- Sass and Hugo Pipes in Hugo
Configure and deploy to GitHub Pages
Showtime
Wait who won

Here’s a crappy wireframe of the theme I’m going to create.

If you’re planning to build-along, it may be helpful to serve the theme locally as you build it; both generators offer this functionality. For Jekyll, run jekyll serve, and for Hugo, hugo serve.

There are two main elements: the main content area, and the all-important sidebar menu. To create them, you’ll need template files that tell the site generator how to generate the HTML page. To organize theme template files in a sensible way, you first need to know what directory structure the site generator expects.

How theme files are organized

Jekyll supports gem-based themes, which users can install like any other Ruby gems. This method hides theme files in the gem, so for the purposes of this comparison, we aren’t using gem-based themes.

When you run jekyll new-theme , Jekyll will scaffold a new theme for you. Here’s what those files look like:

.
├── assets
├── Gemfile
├── _includes
├── _layouts
│   ├── default.html
│   ├── page.html
│   └── post.html
├── LICENSE.txt
├── README.md
├── _sass
└── .gemspec

The directory names are appropriately descriptive. The _includes directory is for small bits of code that you reuse in different places, in much the same way you’d put butter on everything. (Just me?) The _layouts directory contains templates for different types of pages on your site. The _sass folder is for Sass files used to build your site’s stylesheet.

You can scaffold a new Hugo theme by running hugo new theme . It has these files:

.
├── archetypes
│   └── default.md
├── layouts
│   ├── 404.html
│   ├── _default
│   │   ├── baseof.html
│   │   ├── list.html
│   │   └── single.html
│   ├── index.html
│   └── partials
│       ├── footer.html
│       ├── header.html
│       └── head.html
├── LICENSE
├── static
│   ├── css
│   └── js
└── theme.toml

You can see some similarities. Hugo’s page template files are tucked into layouts/. Note that the _default page type has files for a list.html and a single.html. Unlike Jekyll, Hugo uses these specific file names to distinguish between list pages (like a page with links to all your blog posts on it) and single pages (like one of your blog posts). The layouts/partials/ directory contains the buttery reusable bits, and stylesheet files have a spot picked out in static/css/.

These directory structures aren’t set in stone, as both site generators allow some measure of customization. For example, Jekyll lets you define collections, and Hugo makes use of page bundles. These features let you organize your content multiple ways, but for now, lets look at where to put some simple pages.

Where to put content

To create a site menu that looks like this:

Introduction
    Getting Started
    Configuration
    Deploying
Advanced Usage
    All Configuration Settings
    Customizing
    Help and Support

You’ll need two sections (“Introduction” and “Advanced Usage”) containing their respective subsections.

Jekyll isn’t strict with its content location. It expects pages in the root of your site, and will build whatever’s there. Here’s how you might organize these pages in your Jekyll site root:

.
├── 404.html
├── assets
├── Gemfile
├── _includes
├── index.markdown
├── intro
│   ├── config.md
│   ├── deploy.md
│   ├── index.md
│   └── quickstart.md
├── _layouts
│   ├── default.html
│   ├── page.html
│   └── post.html
├── LICENSE.txt
├── README.md
├── _sass
├── .gemspec
└── usage
    ├── customizing.md
    ├── index.md
    ├── settings.md
    └── support.md

You can change the location of the site source in your Jekyll configuration.

In Hugo, all rendered content is expected in the content/ folder. This prevents Hugo from trying to render pages you don’t want, such as 404.html, as site content. Here’s how you might organize your content/ directory in Hugo:

.
├── _index.md
├── intro
│   ├── config.md
│   ├── deploy.md
│   ├── _index.md
│   └── quickstart.md
└── usage
    ├── customizing.md
    ├── _index.md
    ├── settings.md
    └── support.md

To Hugo, _index.md and index.md mean different things. It can be helpful to know what kind of Page Bundle you want for each section: Leaf, which has no children, or Branch.

Now that you have some idea of where to put things, let’s look at how to build a page template.

How templating works

Jekyll page templates are built with the Liquid templating language. It uses braces to output variable content to a page, such as the page’s title: {{ page.title }}.

Hugo’s templates also use braces, but they’re built with Go Templates. The syntax is similar, but different: {{ .Title }}.

Both Liquid and Go Templates can handle logic. Liquid uses tags syntax to denote logic operations:

{% if user %}
  Hello {{ user.name }}!
{% endif %}

And Go Templates places its functions and arguments in its braces syntax:

{{ if .User }}
    Hello {{ .User }}!
{{ end }}

Templating languages allow you to build one skeleton HTML page, then tell the site generator to put variable content in areas you define. Let’s compare two possible default page templates for Jekyll and Hugo.

Jekyll’s scaffold default theme is bare, so we’ll look at their starter theme Minima. Here’s _layouts/default.html in Jekyll (Liquid):


<html lang="{{ page.lang | default: site.lang | default: "en" }}">

  {%- include head.html -%}

  <body>

    {%- include header.html -%}

    <main class="page-content" aria-label="Content">
      <div class="wrapper">
        {{ content }}
      div>
    main>

    {%- include footer.html -%}

  body>

html>

Here’s Hugo’s scaffold theme layouts/_default/baseof.html (Go Templates):


<html>
    {{- partial "head.html" . -}}
    <body>
        {{- partial "header.html" . -}}
        <div id="content">
        {{- block "main" . }}{{- end }}
        div>
        {{- partial "footer.html" . -}}
    body>
html>

Different syntax, same idea. Both templates pull in reusable bits for head.html, header.html, and footer.html. These show up on a lot of pages, so it makes sense not to have to repeat yourself. Both templates also have a spot for the main content, though the Jekyll template uses a variable ({{ content }}) while Hugo uses a block ({{- block "main" . }}{{- end }}). Blocks are just another way Hugo lets you define reusable bits.

Now that you know how templating works, you can build the sidebar menu for the theme.

Creating a top-level menu with the `pages` object

You can programmatically create a top-level menu from your pages. It will look like this:

Introduction
Advanced Usage

Let’s start with Jekyll. You can display links to site pages in your Liquid template by iterating through the site.pages object that Jekyll provides and building a list:

<ul>
    {% for page in site.pages %}
    <li><a href="{{ page.url | absolute_url }}">{{ page.title }}a>li>
    {% endfor %}
ul>

This returns all of the site’s pages, including all the ones that you might not want, like 404.html. You can filter for the pages you actually want with a couple more tags, such as conditionally including pages if they have a section: true parameter set:

<ul>
    {% for page in site.pages %}
    {%- if page.section -%}
    <li><a href="{{ page.url | absolute_url }}">{{ page.title }}a>li>
    {%- endif -%}
    {% endfor %}
ul>

You can achieve the same effect with slightly less code in Hugo. Loop through Hugo’s .Pages object using Go Template’s range action:

<ul>
{{ range .Pages }}
    <li>
        <a href="{{.Permalink}}">{{.Title}}a>
    li>
{{ end }}
ul>

This template uses the .Pages object to return all the top-level pages in content/ of your Hugo site. Since Hugo uses a specific folder for the site content you want rendered, there’s no additional filtering necessary to build a simple menu of site pages.

Both site generators can use a separately defined data list of links to render a menu in your template. This is more suitable for creating nested links, like this:

Introduction
    Getting Started
    Configuration
    Deploying
Advanced Usage
    All Configuration Settings
    Customizing
    Help and Support

Jekyll supports data files in a few formats, including YAML. Here’s the definition for the menu above in _data/menu.yml:

section:
  - page: Introduction
    url: /intro
    subsection:
      - page: Getting Started
        url: /intro/quickstart
      - page: Configuration
        url: /intro/config
      - page: Deploying
        url: /intro/deploy
  - page: Advanced Usage
    url: /usage
    subsection:
      - page: Customizing
        url: /usage/customizing
      - page: All Configuration Settings
        url: /usage/settings
      - page: Help and Support
        url: /usage/support

Here’s how to render the data in the sidebar template:

{% for a in site.data.menu.section %}
<a href="{{ a.url }}">{{ a.page }}a>
<ul>
    {% for b in a.subsection %}
    <li><a href="{{ b.url }}">{{ b.page }}a>li>
    {% endfor %}
ul>
{% endfor %}

This method allows you to build a custom menu, two nesting levels deep. The nesting levels are limited by the for loops in the template. For a recursive version that handles further levels of nesting, see Nested tree navigation with recursion.

Hugo does something similar with its menu templates. You can define menu links in your Hugo site config, and even add useful properties that Hugo understands, like weighting. Here’s a definition of the menu above in config.yaml:

sectionPagesMenu: main

menu:
  main:
    - identifier: intro
      name: Introduction
      url: /intro/
      weight: 1
    - name: Getting Started
      parent: intro
      url: /intro/quickstart/
      weight: 1
    - name: Configuration
      parent: intro
      url: /intro/config/
      weight: 2
    - name: Deploying
      parent: intro
      url: /intro/deploy/
      weight: 3
    - identifier: usage
      name: Advanced Usage
      url: /usage/
    - name: Customizing
      parent: usage
      url: /usage/customizing/
      weight: 2
    - name: All Configuration Settings
      parent: usage
      url: /usage/settings/
      weight: 1
    - name: Help and Support
      parent: usage
      url: /usage/support/
      weight: 3

Hugo uses the identifier, which must match the section name, along with the parent variable to handle nesting. Here’s how to render the menu in the sidebar template:

<ul>
    {{ range .Site.Menus.main }}
    {{ if .HasChildren }}
    <li>
        <a href="{{ .URL }}">{{ .Name }}a>
    li>
    <ul class="sub-menu">
        {{ range .Children }}
        <li>
            <a href="{{ .URL }}">{{ .Name }}a>
        li>
        {{ end }}
    ul>
    {{ else }}
    <li>
        <a href="{{ .URL }}">{{ .Name }}a>
    li>
    {{ end }}
    {{ end }}
ul>

The range function iterates over the menu data, and Hugo’s .Children variable handles nested pages for you.

Putting the template together

With your menu in your reusable sidebar bit (_includes/sidebar.html for Jekyll and partials/sidebar.html for Hugo), you can add it to the default.html template.

In Jekyll:


<html lang="{{ page.lang | default: site.lang | default: "en" }}">

{%- include head.html -%}

<body>
    {%- include sidebar.html -%}

    {%- include header.html -%}

    <div id="content" class="page-content" aria-label="Content">
        {{ content }}
    div>

    {%- include footer.html -%}

body>

html>

In Hugo:


<html>
{{- partial "head.html" . -}}

<body>
    {{- partial "sidebar.html" . -}}

    {{- partial "header.html" . -}}
    <div id="content" class="page-content" aria-label="Content">
        {{- block "main" . }}{{- end }}
    div>
    {{- partial "footer.html" . -}}
body>

html>

When the site is generated, each page will contain all the code from your sidebar.html.

Create a stylesheet

Both site generators accept Sass for creating CSS stylesheets. Jekyll has Sass processing built in, and Hugo uses Hugo Pipes. Both options have some quirks.

Sass and CSS in Jekyll

To process a Sass file in Jekyll, create your style definitions in the _sass directory. For example, in a file at _sass/style-definitions.scss:

$background-color: #eef !default;
$text-color: #111 !default;

body {
  background-color: $background-color;
  color: $text-color;
}

Jekyll won’t generate this file directly, as it only processes files with front matter. To create the end-result filepath for your site’s stylesheet, use a placeholder with empty front matter where you want the .css file to appear. For example, assets/css/style.scss. In this file, simply import your styles:

---
---

@import "style-definitions";

This rather hackish configuration has an upside: you can use Liquid template tags and variables in your placeholder file. This is a nice way to allow users to set variables from the site _config.yml, for example.

The resulting CSS stylesheet in your generated site has the path /assets/css/style.css. You can link to it in your site’s head.html using:

<link rel="stylesheet" href="{{ "/assets/css/style.css" | relative_url }}" media="screen">

Sass and Hugo Pipes in Hugo

Hugo uses Hugo Pipes to process Sass to CSS. You can achieve this by using Hugo’s asset processing function, resources.ToCSS, which expects a source in the assets/ directory. It takes the SCSS file as an argument. With your style definitions in a Sass file at assets/sass/style.scss, here’s how to get, process, and link your Sass in your theme’s head.html:

{{ $style := resources.Get "/sass/style.scss" | resources.ToCSS }}
<link rel="stylesheet" href="{{ $style.RelPermalink }}" media="screen">

Hugo asset processing requires extended Hugo, which you may not have by default. You can get extended Hugo from the releases page.

Configure and deploy to GitHub Pages

Before your site generator can build your site, it needs a configuration file to set some necessary parameters. Configuration files live in the site root directory. Among other settings, you can declare the name of the theme to use when building the site.

Configure Jekyll

Here’s a minimal _config.yml for Jekyll:

title: Your awesome title
description: >- # this means to ignore newlines until "baseurl:"
  Write an awesome description for your new site here. You can edit this
  line in _config.yml. It will appear in your document head meta (for
  Google search results) and in your feed.xml site description.
baseurl: "" # the subpath of your site, e.g. /blog
url: "" # the base hostname & protocol for your site, e.g. http://example.com
theme: # for gem-based themes
remote_theme: # for themes hosted on GitHub, when used with GitHub Pages

With remote_theme, any Jekyll theme hosted on GitHub can be used with sites hosted on GitHub Pages.

Jekyll has a default configuration, so any parameters added to your configuration file will override the defaults. Here are additional configuration settings.

Configure Hugo

Here’s a minimal example of Hugo’s config.yml:

baseURL: https://example.com/ # The full domain your site will live at
languageCode: en-us
title: Hugo Docs Site
theme: # theme name

Hugo makes no assumptions, so if a necessary parameter is missing, you’ll see a warning when building or serving your site. Here are all configuration settings for Hugo.

Deploy to GitHub Pages

Both generators build your site with a command.

For Jekyll, use jekyll build. See further build options here.

For Hugo, use hugo. You can run hugo help or see further build options here.

You’ll have to choose the source for your GitHub Pages site; once done, your site will update each time you push a new build. Of course, you can also automate your GitHub Pages build using GitHub Actions. Here’s one for building and deploying with Hugo, and one for building and deploying Jekyll.

Showtime

All the substantial differences between these two generators are under the hood; all the same, let’s take a look at the finished themes, in two color variations.

Here’s Hugo:

Here’s Jekyll:

Spiffy!

Wait who won

🤷

Both Hugo and Jekyll have their quirks and conveniences.

From this developer’s perspective, Jekyll is a workable choice for simple sites without complicated organizational needs. If you’re looking to render some one-page posts in an available theme and host with GitHub Pages, Jekyll will get you up and running fairly quickly.

Personally, I use Hugo. I like the organizational capabilities of its Page Bundles, and it’s backed by a dedicated and conscientious team that really seems to strive to facilitate convenience for their users. This is evident in Hugo’s many functions, and handy tricks like Image Processing and Shortcodes. They seem to release new fixes and versions about as often as I make a new cup of coffee.

If you still can’t decide, don’t worry. Many themes are available for both Hugo and Jekyll! Start with one, switch later if you want. That’s the benefit of having options.

Outsourcing security with 1Password, Authy, and Privacy.com

2020-03-16T08:12:32-04:00

We’ve already got enough to deal with without worrying about our cybersecurity. When humans are busy and under stress, we tend to get lax in less-obviously-pressing areas, like the integrity of our online accounts. These areas only become an obvious problem when it’s too late for prevention.

Cybersecurity can be fiddly and time-consuming. You might need to reset forgotten passwords, transfer multifactor authentication (MFA) codes to different devices, or deal with the fallout of compromised payment details in the event one of your accounts is still breached.

Thankfully, most of the work necessary to keep up our cybersecurity measures can be outsourced.

Here are three changes you can make to significantly reduce the chances of needing to fiddle with any of these things again.

1Password

I’ve historically avoided password managers because of an irrational knee-jerk reaction to putting all my eggs in one basket. You know what’s great for irrational reactions? Education.

To figure out if putting all my passwords into a password manager is more secure than not using one, I set out to see what some smart people wrote about it.

First, we need to know a thing or two about passwords. Troy Hunt figured out almost a decade ago that trying to remember strong passwords doesn’t work. In more recent times, Alex Weinert expanded on this in Your Pa$$word doesn’t matter. TL;DR: our brains aren’t better at passwords than computers, and please use MFA.

So passwords don’t matter, but complicated passwords are still better than memorable and guessable ones. Since I’ve next to no hope of remembering a dozen variations of p/q2-q4! (I’m not a chess player), this is a task I can outsource to 1Password. I’ll still need to remember one, long, complicated master password - 1Password uses this to encrypt my data, so I really can’t lose it - but I can handle just one.

Using 1Password specifically has another, decidedly obvious, advantage. I chose 1Password because of their Watchtower feature. Thanks to Troy Hunt’s Have I Been Pwned, Watchtower will alert you if any of your passwords show up in a breach so you can change them. Passwords still don’t completely work, but this is probably the best band-aid there is.

One last bonus is that using a password manager is a heck of a lot more convenient. I don’t need to take a few tries to type in a complicated password. I don’t end up spending time resetting passwords I’ve forgotten on sites I only rarely use.

When tasked with remembering all their own passwords, people typically create simpler passwords that are easier to remember – and easier to hack. This occurs most frequently on sites that are considered unimportant. Using 1Password and generated passwords, those sites are now also first-class citizens in the land of strong passwords, instead of being half-abandoned and half-open attack vectors.

So, yes, all my eggs are in one basket. A well-protected, complex, and monitored basket.

Authy

Okay - so it’s more like one-and-a-half baskets. 🤷🏻

Authy, from the folks over at Twilio, provides a 2FA solution that’s more secure than SMS. Unlike Google Authenticator, you can choose to back up your 2FA codes in case you lose or change your phone. (1Password offers 2FA functionality as well - but, you know, redundancies.)

With Authy, your back up is encrypted with your password, similarly to how 1Password works. This makes it the second password you can’t forget, if you don’t want to lose access to your codes. If you reset your account, they all go away. I can deal with remembering two passwords; I’ll take that trade.

I’ve tried other methods of MFA, including hardware keys, which can make accessing accounts on your phone more complicated than I care to put up with. I find the combination of 1Password and Authy to be the most practical combination of convenience and security that yet exists to my knowledge.

Privacy.com

Finally, there’s one last line of defense you can put in place in the unfortunate event that one of your accounts is still compromised. All the strong passwords and MFA in the world won’t help if you open the doors yourself, and scams and phishing are a thing.

Since it’s rather impractical to use a different real credit card every place you shop, virtual cards are just a great idea. There’s no good reason to spend an afternoon (or more) resetting your payment information on every account just to thwart a misbehaving merchant or patch up a data breach from that online shop for cute salt shakers you made a purchase at last year (just me?).

As a bonus, a partnership between 1Password and Privacy.com lets you easily create virtual credit cards using the 1Password extension.

By setting up a separate virtual card for each merchant, in the event that one of those merchants is compromised, you can simply pause or delete that card. None of your other accounts or actual bank details are caught up in the process. Cards can have time-based limits or be one-off burner numbers, making them ideal for setting up subscriptions.

This is the sort of basic functionality that I hope, one day, becomes more prevalent from banks and credit cards. In the meantime, I’ll keep using Privacy.com. That’s my referral link; if you’d like to thank me by using it, we’ll both get five bucks as a bonus.

Outsource better security

All together, implementing these changes will probably take up an afternoon, depending on how many accounts you have. It’s worth it for the time you’d otherwise spend resetting passwords, setting up new devices, or (knock on wood) recovering from compromised banking details. Best of all, you’ll have continual protection just running in the background.

We have the technology. Free up some brain cycles to focus on other things - or simply remove some unnecessary stress from your life by outsourcing the fiddly bits.

Want to give the gift of cybersecurity to someone you know? Get them started with a cybersecurity starter pack.

SQLite in Production with WAL

2020-03-05T10:14:43-05:00

Update: read the HackerNews discussion.

So you need a database. It’s going to handle a few hundred users and mostly read operations. Time to set up a PostgreSQL cluster, debate connection pooling strategies, configure replication, and design backup procedures… right?

When I say, “What about SQLite?”, the response is usually some variation of “That’s not a real database.”

This reaction reveals something important about how engineering teams make technology decisions. We often choose tools based on what sounds impressive rather than what solves our actual problems. SQLite represents an underappreciated truth in engineering leadership: sometimes the boring, simple solution is exactly what your team needs.

~~SQLite~~ (“see-quell-lite”) is a lightweight SQL database engine that’s self-contained in a single file. It’s library, database, and data, all in one package. For certain applications, SQLite is a solid choice for a production database. It’s lightweight, ultra-portable, and has no external dependencies.

Matching Tools to Actual Requirements

As an engineering leader, one of your most important responsibilities is helping your team choose appropriate technology rather than impressive technology. SQLite excels in specific scenarios that are more common than most teams realize.

SQLite is best suited for production use in applications that:

Desire fast and simple setup
Require high reliability in a small package
Have, and want to retain, a small footprint
Are read-heavy but not write-heavy
Don’t need multiple user accounts or features like multiversion concurrency snapshots

These criteria describe a significant percentage of web applications, internal tools, and even customer-facing products. But teams often dismiss SQLite because it doesn’t match their mental model of what a “serious” database looks like.

Recognizing when your team’s technology choices are driven by resume-driven development rather than problem-solving can save you oodles of time and budget wiggle room. Complex solutions carry hidden costs in deployment complexity, operational overhead, and cognitive load that simple solutions avoid entirely.

Understanding the Technical Trade-offs

To guide these decisions effectively, it helps to understand the technical details well enough to evaluate trade-offs intelligently. In the case of SQLite, you can examine its performance characteristics to make this evaluation concrete:

Database Transaction Modes

POSIX system call fsync() commits buffered data (data saved in the operating system cache) referred to by a specified file descriptor to permanent storage or disk. This is relevant to understanding the difference between SQLite’s two modes, as fsync() will block until the device reports the transfer is complete.

SQLite uses ~~atomic commits~~ to batch database changes into single transactions, enabling apparent simultaneous writing of multiple operations. This is accomplished through one of two modes: rollback journal or write-ahead log (WAL).

Rollback Journal Mode

A rollback journal is essentially a back-up file created by SQLite before write changes occur on a database file. It has the advantage of providing high reliability by helping SQLite restore the database to its original state in case a write operation is compromised during the disk-writing process.

Assuming a cold cache, SQLite first needs to read the relevant pages from a database file before it can write to it. Information is read out into the operating system cache, then transferred into user space. SQLite obtains a reserved lock on the database file, preventing other processes from writing to the database. At this point, other processes may still read from the database.

SQLite creates a separate file, the rollback journal, with the original content of the pages that will be changed. Initially existing in the cache, the rollback journal is written to persistent disk storage with fsync() to enable SQLite to restore the database should its next operations be compromised.

SQLite then obtains an exclusive lock preventing other processes from reading or writing, and writes the page changes to the database file in cache. Since writing to disk is slower than interaction with the cache, writing to disk doesn’t occur immediately. The rollback journal continues to exist until changes are safely written to disk with a second fsync(). From a user-space process point of view, the change to the disk (the COMMIT, or end of the transaction) happens instantaneously once the rollback journal is deleted - hence, atomic commits. However, the two fsync() operations required to complete the COMMIT make this option, from a transactional standpoint, slower than SQLite’s lesser known WAL mode.

Write-ahead logging (WAL)

While the rollback journal method uses a separate file to preserve the original database state, the WAL method uses a separate WAL file to instead record the changes. Instead of a COMMIT depending on writing changes to disk, a COMMIT in WAL mode occurs when a record of one or more commits is appended to the WAL. This has the advantage of not requiring blocking read or write operations to the database file in order to make a COMMIT, so more transactions can happen concurrently.

WAL mode introduces the concept of the checkpoint, which is when the WAL file is synced to persistent storage before all its transactions are transferred to the database file. You can optionally specify when this occurs, but SQLite provides reasonable defaults. The checkpoint is the WAL version of the atomic commit.

In WAL mode, write transactions are performed faster than in the traditional rollback journal mode. Each transaction involves writing the changes only once to the WAL file instead of twice - to the rollback journal, and then to disk - before the COMMIT signals that the transaction is over.

For teams handling moderate write loads, WAL mode often provides the performance characteristics they actually need without the operational complexity of distributed databases.

The Performance Reality

Benchmarks tell a compelling story about SQLite’s practical capabilities. On modest hardware—the smallest EC2 instance with no provisioned IOPS—SQLite with WAL mode handles 400 write transactions per second and thousands of reads. For many applications, this represents more capacity than they need.

These numbers matter because they provide concrete data for technology discussions. Instead of theoretical conversations about “what if we need to scale,” you can evaluate whether 400 writes per second actually meets your requirements. Often, it does—with significant room for growth.

More importantly, SQLite eliminates entire categories of operational complexity: connection pooling, database server maintenance, backup procedures, replication configuration, and deployment coordination. The operational overhead you don’t have to manage often provides more value than the theoretical scalability you might need someday.

Making Strategic Technology Decisions

Engineering teams often equate complexity with sophistication and assume that simple solutions won’t scale or aren’t “enterprise-ready” without considering the actual requirements of the enterprise. The SQLite decision exemplifies a broader principle in engineering leadership: optimizing for actual constraints rather than imaginary ones. This requires understanding both the technical capabilities of your options and the real requirements of your systems.

This means asking your teams to articulate specific performance requirements, operational constraints, and growth projections rather than making technology choices based on industry trends or resume building. It means evaluating the total cost of ownership including deployment complexity, operational overhead, and team cognitive load.

Most importantly, it means recognizing that the best technology choice is often the one that solves your current problems effectively while remaining simple enough to understand, maintain, and evolve as requirements change.

Building a Culture of Appropriate Technology

Teams that consistently make good technology choices develop systematic approaches to evaluation rather than relying on instinct or industry hype. They start with requirements, evaluate options based on total cost of ownership, and choose solutions that match their actual needs rather than their aspirational ones.

This culture emerges when leaders model technical decision-making that prioritizes problem-solving over impressiveness. When you advocate for SQLite over PostgreSQL because it better matches your workload, you’re teaching your team to think critically about technology trade-offs.

The long-term impact is teams that build sustainable systems they can actually maintain and evolve. Simple solutions that solve real problems create more value than complex solutions that solve theoretical ones.

For medium-sized, read-heavy applications, SQLite with WAL mode represents exactly this kind of appropriate technology choice. It provides perfectly adequate capability in a perfectly compact package—which is often exactly what your application needs.

17 Minutes to 16 Seconds: a 60x Performance Improvement from… Python?!

2020-02-28T09:31:02-05:00

Engineering teams will spend weeks optimizing database queries that run in milliseconds while ignoring network requests that take hundreds of milliseconds. They’ll debate the performance implications of different sorting algorithms while their application spends seventeen minutes on network latency to process a few hundred requests. This misallocation of optimization effort reveals a common leadership challenge: helping teams identify and focus on the bottlenecks that actually matter.

When I developed Hydra, a multithreaded link checker written in Python, the performance requirements were clear. It needed to run as part of CI/CD processes, which meant speed was essential for developer productivity. Nobody wants to wait seventeen minutes to learn whether their build succeeded—that’s long enough to make coffee, check Twitter, question your career choices, and wonder if the process crashed.

The project became a case study in systematic performance optimization and the leadership decisions that guide technical implementation. Unlike many Python site crawlers that rely on external dependencies like BeautifulSoup, Hydra uses only standard libraries. This constraint required thinking carefully about how to achieve optimal performance within Python’s limitations.

Understanding the Performance Landscape

As an engineering leader, one of your most important responsibilities is helping your team understand where performance problems actually occur versus where they assume they occur. Most developers have an intuitive sense that network operations are slower than CPU operations, but the actual magnitude of these differences is staggering.

Here are approximate timings for tasks performed on a typical PC:

	Task	Time
CPU	execute typical instruction	1/1,000,000,000 sec = 1 nanosec
CPU	fetch from L1 cache memory	0.5 nanosec
CPU	branch misprediction	5 nanosec
CPU	fetch from L2 cache memory	7 nanosec
RAM	Mutex lock/unlock	25 nanosec
RAM	fetch from main memory	100 nanosec
Network	send 2K bytes over 1Gbps network	20,000 nanosec
RAM	read 1MB sequentially from memory	250,000 nanosec
Disk	fetch from new disk location (seek)	8,000,000 nanosec (8ms)
Disk	read 1MB sequentially from disk	20,000,000 nanosec (20ms)
Network	send packet US to Europe and back	150,000,000 nanosec (150ms)

Peter Norvig first published these numbers in Teach Yourself Programming in Ten Years. While hardware continues to evolve, the relative relationships remain humbling for anyone who’s ever spent time optimizing the wrong thing.

Notice that sending a simple packet over the Internet is over a million times slower than fetching from RAM. These aren’t small performance differences—they’re fundamental constraints that should guide every optimization decision you make.

For Hydra, parsing response data and assembling results happens on the CPU and is relatively fast. The overwhelming bottleneck—by over six orders of magnitude—is network latency. Any optimization effort that didn’t address network I/O would miss the point.

Working Within Python’s Constraints

Python presents an interesting challenge for performance-critical applications. The Global Interpreter Lock (GIL) prevents multiple threads from executing Python bytecodes simultaneously—each thread must wait for the GIL to be released by the currently executing thread. This eliminates race conditions but also prevents true parallel execution of CPU-bound tasks.

For many engineering teams, this limitation becomes a reason to dismiss Python entirely for performance work. But effective technical leadership involves understanding how to work within constraints rather than avoiding tools with limitations.

The key insight is that Python’s GIL limitation doesn’t apply uniformly. While CPU-bound tasks suffer from the GIL, I/O-bound tasks can benefit from concurrent execution because the GIL is released during I/O operations. For Hydra’s use case—fetching web pages over the network—multithreading in Python can provide significant performance improvements despite the GIL.

This distinction matters for strategic technical decisions. Instead of automatically reaching for Go or Rust when performance requirements emerge, understanding Python’s actual constraints can enable better technology choices based on specific workload characteristics.

Choosing the Right Concurrency Model

Python provides multiple approaches to parallel execution, each suited for different types of bottlenecks. Making the right choice requires understanding both technical trade-offs and your application’s specific performance characteristics.

Multiple Processes

Python’s ProcessPoolExecutor uses worker subprocesses to bypass the GIL entirely. This approach maximizes parallelization for CPU-bound tasks by utilizing multiple processor cores effectively.

For compute-heavy operations—mathematical calculations, data processing, algorithm execution—multiple processes provide genuine parallel execution. However, this carries overhead costs in memory usage and inter-process communication that may not be justified for I/O-bound workloads.

Multiple Threads

Python’s ThreadPoolExecutor uses a pool of threads that can execute I/O operations concurrently. While threads can’t execute Python code in parallel due to the GIL, they can perform I/O operations concurrently because the GIL is released during system calls.

For I/O-bound applications—web scraping, API calls, file operations—threading provides excellent performance improvements with lower overhead than multiprocessing.

Implementation Strategy

Here’s how Hydra uses ThreadPoolExecutor to achieve concurrent link checking:

# Create the Checker class
class Checker:
    # Queue of links to be checked
    TO_PROCESS = Queue()
    # Maximum workers to run
    THREADS = 100
    # Maximum seconds to wait for HTTP response
    TIMEOUT = 60

    def __init__(self, url):
        ...
        # Create the thread pool
        self.pool = futures.ThreadPoolExecutor(max_workers=self.THREADS)


def run(self):
    # Run until the TO_PROCESS queue is empty
    while True:
        try:
            target_url = self.TO_PROCESS.get(block=True, timeout=2)
            # If we haven't already checked this link
            if target_url["url"] not in self.visited:
                # Mark it as visited
                self.visited.add(target_url["url"])
                # Submit the link to the pool
                job = self.pool.submit(self.load_url, target_url, self.TIMEOUT)
                job.add_done_callback(self.handle_future)
        except Empty:
            return
        except Exception as e:
            print(e)

The implementation reflects several engineering leadership principles. The thread pool size (100 workers) was determined through profiling and testing rather than guesswork. The timeout mechanism prevents slow requests from blocking overall progress. The callback pattern enables efficient result processing without blocking the main execution thread.

Measuring Real Impact

Performance optimization discussions often remain theoretical without concrete measurements. For Hydra, the improvement was dramatic. Here’s a comparison between the run times for checking my website with a prototype single-thread program and using Hydra:

time python3 slow-link-check.py https://victoria.dev

real    17m34.084s
user    11m40.761s
sys     0m5.436s


time python3 hydra.py https://victoria.dev

real    0m15.729s
user    0m11.071s
sys     0m2.526s

The single-threaded implementation took over seventeen minutes. The multithreaded version completed in under sixteen seconds. That’s a performance improvement of more than 60x.

These aren’t marginal gains from micro-optimizations. They represent fundamental improvements in application efficiency that users immediately notice. While specific timings vary based on site size and network conditions, the order-of-magnitude improvement demonstrates the value of addressing actual bottlenecks systematically.

Leadership Lessons in Performance Optimization

The Hydra project illustrates several principles that engineering leaders can apply across different technologies and applications.

Focus on actual bottlenecks, not theoretical ones. Teams often optimize the wrong things because they focus on code that feels slow to write rather than code that’s actually slow to execute. Teaching teams to measure and identify real performance constraints prevents wasted optimization effort.

Understand your tools’ limitations and strengths. Python’s GIL is a constraint, but it doesn’t preclude high-performance applications in the right contexts. Effective technical leadership involves understanding how to work within technological constraints rather than avoiding tools with limitations.

Make optimization decisions based on requirements. Hydra needed to run quickly in CI/CD environments, which justified the development effort for custom multithreading. But this level of effort isn’t required in every Python application. Understanding your specific requirements helps allocate development efforts appropriately.

Measure improvement, don’t assume it. Performance optimization can introduce complexity and maintenance overhead. Concrete measurements ensure that optimization efforts provide sufficient value to justify their costs.

Building Performance-Conscious Teams

The most effective engineering teams develop systematic approaches to performance rather than relying on intuition or premature optimization. This requires creating culture and processes that encourage measurement, analysis, and strategic optimization decisions.

This means teaching teams to profile applications before optimizing them, helping them understand the performance characteristics of their technology choices, and ensuring that optimization efforts align with actual user requirements rather than theoretical concerns.

Most importantly, it means recognizing that performance optimization is a technical leadership skill that involves strategic thinking about trade-offs, constraints, and business requirements—not just implementation knowledge.

The Real Lesson

Hydra’s performance gains from 17 minutes to 16 seconds teaches a lesson that applies far beyond Python: measure first, optimize second, and always focus on the constraint that’s actually limiting your system. Whether you’re debugging performance bottlenecks or organizational inefficiencies, the biggest wins come from addressing the right problem rather than optimizing the wrong one exceptionally well.

The next time your team debates whether to rewrite everything in Go for performance, remember Hydra’s 60x improvement using standard Python libraries. Sometimes the most effective optimization is the one you can implement this week rather than the solution you’ll build next quarter… or the quarter after that.

From 17 Minutes to 8 Seconds: Strategic Performance Optimization for Engineering Teams

2020-02-25T12:50:29-05:00

Leading engineering teams means constantly balancing technical excellence with organizational needs. I found myself facing a perfect example of this challenge when helping out the Open Web Application Security Project (OWASP). When I joined the core team for OWASP’s Web Security Testing Guide, I found a critical infrastructure problem that was silently undermining both our security mission and our ability to ship quality work efficiently.

OWASP is a big organization with an even bigger website to match. The site serves hundreds of thousands of visitors with cybersecurity resources that security professionals worldwide depend on. But beneath this successful exterior, we had a problem that most engineering leaders will recognize: broken processes that no one had time to fix, creating cascading inefficiencies across our entire development workflow.

OWASP.org lacked any centralized quality assurance processes and was riddled with broken links. Customers don’t like broken links; attackers really do. These weren’t just user experience issues—they represented real security vulnerabilities that could enable attacks like broken link hijacking and subdomain takeovers. Here we were, an organization dedicated to web security, with our own infrastructure exposing the exact vulnerabilities we taught others to prevent.

When Infrastructure Problems Become Leadership Problems

The broken link problem at OWASP had all the hallmarks of technical debt that had become organizational debt: volunteers avoided updating content because they knew links might break, and quality suffered because manual checking was impractical. Our CI/CD pipeline had a glaring gap where automated link validation should have been.

The underlying issue was both technical and strategic. We needed a solution that could integrate into our development workflow, scale with our volunteer contributor model, and actually get adopted by teams who were already stretched thin. This meant thinking beyond just building a tool; I needed to design a solution that addressed the human and process challenges alongside the technical ones.

Strategic Requirements Beyond Just “Make It Work”

When I proposed building an automated link checking solution, the requirements went far beyond technical functionality. As engineering leaders, we know that tools succeed or fail based on adoption, maintainability, and organizational fit. Our solution needed to:

Integrate seamlessly into existing CI/CD workflows without disrupting volunteer contributors
Provide actionable reports that non-technical content maintainers could understand and act on
Run efficiently enough to avoid becoming a bottleneck in our deployment process
Scale with OWASP’s distributed, volunteer-driven development model

The technical challenge was, essentially, to build a web crawler. The leadership challenge was ensuring it would actually solve our organizational problem rather than just creating another tool that sits unused.

This required making strategic decisions about language choice, architecture, and performance that balanced multiple constraints: team familiarity (Python was the common denominator), performance requirements (CI/CD integration demanded speed), and long-term maintainability (volunteers needed to be able to contribute to the codebase).

Understanding the Real Cost of Performance Bottlenecks

As engineering leaders, we need to think about performance in terms of organizational impact, not just technical metrics. The latency numbers that every developer should know tell a story about where bottlenecks hide and how they compound:

Type	Task	Time
CPU	execute typical instruction	1/1,000,000,000 sec = 1 nanosec
CPU	fetch from L1 cache memory	0.5 nanosec
CPU	branch misprediction	5 nanosec
CPU	fetch from L2 cache memory	7 nanosec
RAM	Mutex lock/unlock	25 nanosec
RAM	fetch from main memory	100 nanosec
RAM	read 1MB sequentially from memory	250,000 nanosec
Disk	fetch from new disk location (seek)	8,000,000 nanosec (8ms)
Disk	read 1MB sequentially from disk	20,000,000 nanosec (20ms)
Network	send packet US to Europe and back	150,000,000 nanosec (150ms)

Peter Norvig first published these numbers some years ago in Teach Yourself Programming in Ten Years. While technology changes over the decades, the order-of-magnitude differences between these numbers remain as devastatingly accurate as ever.

These numbers reveal something critical for engineering leaders to know: network operations are over a million times slower than memory operations. In our link checker, every HTTP request was a network operation, meaning we were dealing with the slowest possible operation for a process that needed to run fast and efficiently in CI/CD.

A single-thread crawler workflow creates an inherent bottleneck:

Fetch HTML from a page (network-bound operation)
Parse links from the HTML content
Validate each link by making HTTP requests (more network-bound operations)
Track visited links to avoid duplicate work
Repeat for every page found

Mapping out the execution flow makes the issue clear to see: this process was fundamentally serial, with network latency dominating every step. For a site like OWASP.org with over 12,000 links, this meant potential runtime measured in hours, not minutes.

Bottlenecks like this cascade through entire organizations, affecting developer productivity, deployment confidence, and ultimately in the case of OWASP, our ability to deliver on our security mission effectively.

Checking these links serially would guarantee a performance bottleneck that would hurt team productivity, deployment confidence, and our ability to ship quality software consistently.

How Bottlenecks Cascade Through Engineering Organizations

How long would it have taken to check all 12,000 links on OWASP.org with a single-thread web crawler? We can make a rough estimate:

      150 milliseconds per network request
 x 12,000 links on OWASP.org
---------
1,800,000 milliseconds (30 minutes minimum)

A whole half hour, just for the network tasks. In the real world it would likely be much slower than that, since web pages are frequently much larger than one packet.

When your CI/CD pipeline includes a (very conservative minimum) 30-minute bottleneck, the impact extends far beyond technical metrics. Several things happen:

First, your feedback loops become painfully long. Contributors push changes and then wait more than half an hour to learn if they’ve broken anything. This delays iteration, reduces deployment confidence, and ultimately makes your team more conservative about shipping improvements.

Second, to add insult to injury, the financial impact compounds quickly. In serverless environments like AWS Lambda, compute time directly translates to cost. A process that takes 30 minutes instead of seconds doesn’t just waste time—it multiplies your infrastructure costs dramatically.

Source: Understanding and Controlling AWS Lambda Costs

But the hidden cost is team productivity. When your deployment pipeline has unpredictable bottlenecks, teams start working around them. They try to batch changes into huge PRs instead of making small incremental (and easier to merge) improvements. They skip running full test suites locally. They become hesitant to refactor or make structural improvements that might require multiple deployment cycles to validate.

Identifying and resolving bottlenecks can make the difference between teams that stall at fixing bugs and teams that ship new features fast.

Making Strategic Technology Decisions Under Constraints

This is where engineering leadership gets interesting: balancing competing constraints while making decisions that your team can actually execute on. I had to choose between Python (a comfortable language choice for everyone in the OWASP group) and Go (which offered better concurrency primitives and performance characteristics).

The decision matrix looked like this:

Team familiarity: Python had broad adoption across OWASP contributors
Performance requirements: Go’s goroutines made concurrent programming more straightforward
Maintainability: We needed something contributors could debug and extend
Long-term scalability: The solution needed to handle growing content without constant optimization

I chose to prototype the link checker in both languages. I built a multithreaded Python version that I dubbed Hydra, and a Go version that took advantage of goroutines. This gave us concrete data to inform the decision rather than relying on assumptions. This approach—building multiple solutions to validate architectural choices—is something I’ve found invaluable for critical infrastructure decisions.

Designing Solutions That Scale With Your Team

The good news is that once you identify a bottleneck, you can resolve it. Whether it’s scaling work efficiently across your team, code reviews, incident response, or in our case, link validation, the principle is the same: address the slowest operation.

Think of our single-thread web crawler as if it were one person handling all the work sequentially. The work gets done, but one person doesn’t scale well to thousands of requests. Working in serial, each request has to wait for the previous one to complete, creating an artificial constraint where we’re limited by the slowest individual operation.

Thankfully, link validation is an embarrassingly parallel problem. Each link can be checked independently, which means we could distribute the work across multiple concurrent processes, like having several people split up the work to help it go faster. In computing this is called multithreading.

By designing for concurrency from the start and building a multithreaded link checker, we’d have solution that could scale with different deployment environments, handle varying load patterns, and remain responsive even as OWASP’s content grew.

To illustrate, here are some snippets from the Go implementation. They incorporate some architectural insights that are relevant for any engineering leader designing concurrent systems.

1. Safe Concurrent Access

type Checker struct {
    startDomain             **string**
    brokenLinks             []Result
    visitedLinks            map[**string**]**bool**
    workerCount, maxWorkers **int**
    sync.Mutex
}

The sync.Mutex ensures our shared state remains consistent across goroutines, while the visitedLinks map uses O(1) lookup time to avoid creating new bottlenecks as our dataset grows.

When optimizing one constraint like network latency, make sure you’re not inadvertently creating new constraints elsewhere—like O(n) lookup times that degrade performance as your data grows.

2. Throttling

for i := range toProcess {
    wg.Add(1)
    checker.addWorker()
    go worker(i, &checker, &wg, toProcess)
    if checker.workerCount > checker.maxWorkers {
        time.Sleep(1 * time.Second) *// throttle down*
    }
}
wg.Wait()

Even when you can parallelize work, you need to respect system boundaries. Too many concurrent HTTP requests could overwhelm target servers or trigger rate limiting, so we built in backpressure to ensure our optimization doesn’t create problems for others. This is an effective way to balance between performance and being a good network citizen.

Measuring Impact: The Results That Matter for Engineering Teams

To obtain some concrete data, I compared the numbers between three implementations: a prototype single-thread Python program, the multithreaded Hydra version, and an implementation written in Go. The performance data from our three implementations tells a story about strategic technology choices and their organizational impact. Here’s a comparison run against my website with its few hundred links:

Single-Threaded Python Prototype

time python3 slow-link-check.py https://victoria.dev

real 17m34.084s
user 11m40.761s
sys     0m5.436s

Seventeen minutes for a site much smaller than OWASP.org meant our original approach would have been completely unusable in a CI/CD context.

Hydra: Multithreaded Python Version

time python3 hydra.py https://victoria.dev

real 1m13.358s
user 0m13.161s
sys     0m2.826s

The concurrency improvements brought us down to just over a minute—a 15x improvement that made CI/CD integration viable.

Go Implementation

time ./go-link-check --url=https://victoria.dev

real 0m7.926s
user 0m9.044s
sys     0m0.932s

Eight seconds. This performance improvement fundamentally changed how teams could interact with the tool. With this level of efficiency, link checking could become part of every deployment without friction. Contributors wouldn’t think twice about running it locally. Instead of being a barrier, link checking would be invisible infrastructure.

As fun as it is to simply enjoy the speedups, we can directly relate these results to everything we’ve discussed so far. Consider taking a process that used to soak up seventeen minutes and turning it into an eight-second-affair instead. Not only will that give developers a much shorter and more efficient feedback loop, it gives teams the ability to develop faster while costing less. To drive the point home: a process that runs in seventeen-and-a-half minutes instead of eight seconds will also cost over a hundred and thirty times more to run.

These numbers represent more than technical metrics. They show how strategic performance optimization can transform a tool from something teams avoid to something they rely on.

The Leadership Framework: Turning Performance Wins Into Organizational Impact

The 130x performance improvement we achieved demonstrates a leadership approach to identifying and breaking bottlenecks that affects entire engineering organizations.

When engineering leaders see a 17-minute process become an 8-second process, we should be asking: what other critical workflows are creating similar friction? Where else are teams working around inefficient processes instead of addressing them? How many small compounding delays are preventing our organization from shipping quality software consistently?

The OWASP link checker became a case study for our broader infrastructure strategy. We learned that volunteer contributors were more likely to maintain content quality when the feedback loop was immediate. We discovered that CI/CD performance directly influenced how teams approached incremental improvements versus risky big-batch changes. Most importantly, we proved that strategic performance optimization could transform organizational behavior.

Start with understanding the human and organizational impact, design solutions that respect team constraints, and measure success by adoption and workflow improvement. When you can turn a deployment blocker into invisible infrastructure, you’re optimizing both code and organizational dynamics by removing friction that allows your entire team to focus on delivering value rather than fighting with tools.

Command line tricks for managing your messy open source repository

2020-02-17T08:05:06-05:00

Effective collaboration, especially in open source software development, starts with effective organization. To make sure that nothing gets missed, the general rule, “one issue, one pull request” is a nice rule of thumb.

Instead of opening an issue with a large scope like, “Fix all the broken links in the documentation,” open source projects will have more luck attracting contributors with several smaller and more manageable issues. In the preceding example, you might scope broken links by section or by page. This allows more contributors to jump in and dedicate small windows of their time, rather than waiting for one person to take on a larger and more tedious contribution effort.

Smaller scoped issues also help project maintainers see where work has been completed and where it hasn’t. This reduces the chances that some part of the issue is missed, assumed to be completed, and later leads to bugs or security vulnerabilities.

That’s all well and good; but what if you’ve already opened several massively-scoped issues, some PRs have already been submitted or merged, and you currently have no idea where the work started or stopped?

It’s going to take a little sorting out to get the state of your project back under control. Thankfully, there are a number of command line tools to help you scan, sort, and make sense of a messy repository. Here’s a small selection of ones I use.

Jump to:

Interactive search-and-replace with vim
Find dead links in Markdown files with a node module
List subdirectories with or without a git repository with find
Pull multiple git repositories from a list with xargs
List issues by number with jot
CLI-powered open source organization

Interactive search-and-replace with `vim`

You can open a file in Vim, then interactively search and replace with:

:%s/\/newword/gc

The % indicates to look in all lines of the current file; s is for substitute; \ matches the whole word; and the g for “global” is for every occurrence. The c at the end will let you view and confirm each change before it’s made. You can run it automatically, and much faster, without c; however, you put yourself at risk of complicating things if you’ve made a pattern-matching error.

Find dead links in Markdown files with a node module

The markdown-link-check node module has a great CLI buddy.

I use this so often I turned it into a Bash alias function. To do the same, add this to your .bashrc:

# Markdown link check in a folder, recursive
function mlc () {
    find $1 -name \*.md -exec markdown-link-check -p {} \;
}

Then run with mlc .

List subdirectories with or without a git repository with `find`

Print all subdirectories that are git repositories, or in other words, have a .git in them:

find . -maxdepth 1 -type d -exec test -e '{}/.git' ';' -printf "is git repo: %p\n"

To print all subdirectories that are not git repositories, negate the test with !:

find . -maxdepth 1 -type d -exec test '!' -e '{}/.git' ';' -printf "not git repo: %p\n"

Pull multiple git repositories from a list with `xargs`

I initially used this as part of automatically re-creating my laptop with Bash scripts, but it’s pretty handy when you’re working with cloud instances or Dockerfiles.

Given a file, repos.txt with a repository’s SSH link on each line (and your SSH keys set up), run:

xargs -n1 git clone < repos.txt

If you want to pull and push many repositories, I previously wrote about how to use a Bash one-liner to manage your repositories.

List issues by number with `jot`

I’m a co-author and maintainer for the OWASP Web Security Testing Guide repository where I recently took one large issue (yup, it was “Fix all the broken links in the documentation” - how’d you guess?) and broke it up into several smaller, more manageable issues. A whole thirty-seven smaller, more manageable issues.

I wanted to enumerate all the issues that the original one became, but the idea of typing out thirty-seven issue numbers (#275 through #312) seemed awfully tedious and time-consuming. So, in natural programmer fashion, I spent the same amount of time I would have used to type out all those numbers and crafted a way to automate it instead.

The jot utility (apt install athena-jot) is a tiny tool that’s a big help when you want to print out some numbers. Just tell it how many you want, and where to start and stop.

# jot [ reps [ begin [ end ] ] ]
jot 37 275 312

This prints each number, inclusively, from 275 to 312 on a new line. To make these into issue number notations that GitHub and many other platforms automatically recognize and turn into links, you can pipe the output to awk.

jot 37 275 312 | awk '{printf "#"$0", "}'

#275, #276, #277, #278, #279, #280, #281, #282, #283, #284, #285, #286, #287, #288, #289, #290, #291, #292, #293, #295, #296, #297, #298, #299, #300, #301, #302, #303, #304, #305, #306, #307, #308, #309, #310, #311, #312

You can also use jot to generate random or redundant data, mainly for development or testing purposes.

CLI-powered open source organization

A well-organized open source repository is a well-maintained open source project. Save this post for handy reference, and use your newfound CLI superpowers for good! 🚀

Why PixelFed won't save us from Instagram

2020-02-16T19:23:20-05:00

PixelFed is a decentralized photo sharing network based on the ActivityPub protocol, the same one that Mastodon uses. For a lot of people divorced (or wanting to be) from Instagram over mental health concerns and issues like forced consent to post-GDPR terms, a decentralized social network like PixelFed sounds like an exciting and promising alternative.

Personally, I stopped using Instagram once I accepted the fact that its core premise and integral structure of social interaction was encouraging me to form habits that were harmful to my life goals. I’m not alone - studies have shown that people are happier after deleting apps like Facebook. The reasons for this don’t differ greatly from why any social network can be bad for you - they’re just found in much greater intensity on photo sharing sites, and specifically Instagram.

It is still early days for PixelFed. As I write this I have no way to know what kind of network it will become, or even if it will survive at all. I do know, however, that there are many glaring and fundamental problems that a decentralized photo sharing network like PixelFed won’t solve. To elaborate, I’m going to discuss what makes Instagram so poisonous to health, why centralized social networks aren’t likely to ever be healthy, and why decentralized social networks have a very slim chance of being better.

Why Instagram is bad for your health

Let’s start with the basics. Your brain responds very differently to reading text than it does to looking at images.

It doesn’t take more than a quick search to find hundreds of articles and studies about how reading can make you smarter, more empathetic, and stave off cognitive decline by improving brain connectivity. In essence, reading involves a multitude of brain regions including the temporal and frontal lobes. There’s still a lot to be discovered about the human brain, but here’s what we think we know. The frontal lobes control important cognitive skills like emotional expression, problem solving, memory, language, judgement, and sexual behaviors. The temporal lobes handle important functions such as the encoding of memory, and processing emotions and visual perception.

In other words, reading text - even on social media - stimulates your brain and makes you think about the information you’re taking in. To react to words on a page, you first have to read them and form thoughts about them.

Unlike reading, looking at an image has a very different effect on your brain. Here’s an infographic about infographics that covers some of these effects. Basically, millennia of evolution have produced human brains hardwired to respond quickly to visual stimuli - in less than 1/10 of a second. As the infographic will literally show you, almost 50% of the brain is involved in visual processing, and 70% of all our sensory receptors are in our eyes. That’s a lot of resources devoted to quickly processing visuals. Why could this be bad?

Unlike times past, we’re no longer (day-to-day) concerned with spotting a tiger in the bushes about to pounce on us. The near-instant processing time needed to discern if that shivering tree branch is the just wind or impending mortal danger is outdated in our current living arrangements. Our brain, however, doesn’t know that. It hasn’t evolved faster than our technologies or society. The downside to this is that anyone with a little knowledge of this fundamental flaw in the human mind is able to exploit it.

Advertisers call this exploitation, “visual marketing.”

These linked articles are stuffed with the same factoids over and over again. “The brain processes images 60,000 times faster than text.” “90% of the information sent to the brain is visual.” Whether or not these numbers are accurate, it’s clearly provable that visual marketing is on the whole more effective than advertisements without images. There’s a reason for it, and it should scare you.

Unlike reading, which involves regions of your brain responsible for comprehension, decision making, and emotional control, images are processed by different areas of the brain. Visual input travels from our eyes through our optic nerves to the thalamus (or LGN, Lateral Geniculate Nucleus) and the superior colliculus. From the thalamus, it proceeds to the visual cortex at the rear of our brains, where the image is processed. Effectively, viewing images does not make us think in the same way that reading does. In other words, it’s easy to do.

Let me be clear. This difference in the way words and images are processed is not, in itself, bad. A photo-centric social network is not, in itself, bad. Images and words alike have the power to evoke strong emotions, send powerful messages, spark revolutions, and spur progress. This is good… if it’s used for good.

Instagram, a photo-centric network chock full of product placements, paid sponsorships, and outright advertisements, is a social network primarily designed to bypass your cognitive thinking and sell you stuff.

I don’t think Instagram started out with the same motivations it has now. Along with all the photo sharing networks that blossomed when Instagram first got popular, I still believe its initial vision was to make sharing photos with your friends fun and easy.

It just got too popular.

Why centralized networks are bad for your health

In the wake of privacy concerns over the last few years, new uproar over algorithm-driven timelines, and the #DeleteFacebook, #DeleteTwitter, and #DeleteInstagram movements, more people today are aware of how networks that make their money on your data are bad for your health. This is in part due to their centralized nature - one hierarchy of authority makes decisions for the whole system, and at the same time, has to support it. It’s expensive to support millions of users, so it’s no wonder that the network’s main concern (and let’s just consider the most innocent case) is to remain profitable.

What’s a good way to remain profitable?

Take a human desire, preferably one that has been around for a really long time… Identify that desire and use modern technology to take out steps. – Evan Williams, co-founder of Twitter and Blogger

Quoted in Wired article, 2013, “Twitter Founder Reveals Secret Formula for Getting Rich Online”

There’s a book called Hooked: How to Build Habit-Forming Products which, if you’re ever in the mood for a good horror flick, you should curl up in bed with some popcorn and read.

The book details a simple model for a habit-forming product. The model is cyclical, and has the following key points: a trigger, an action, variable reward, and investment. In summary, if a product can get you to think of it, leading to some action that is easier to do than to think about, give you a reward for that action some of the time, and then compel you to commit or invest in it - you’re hooked.

If you’re paying attention, you might notice I’ve described Instagram. And Twitter. And Facebook. And every other social network.

There’s a reason it’s easy to use Instagram, easy to post a tweet, easy to browse Facebook. These products have been designed to make it easy for you to use them. They’ve been designed to alter your behavior to better suit the product’s goals.

This industry employs some of the smartest people, thousands of Ph.D. designers, statisticians, engineers. They go to work every day to get us to do this one thing, to undermine our willpower. – James Williams, co-founder of Time Well Spent

Quoted in Nautilus article, 2017, “Modern Media Is a DoS Attack on Your Free Will”

At the heart of the idea of getting you hooked is the concept of a dopamine feedback loop. Dopamine, an organic chemical neurotransmitter in your brain, is thought to be responsible for allowing us to anticipate the reward to an action. It inspires us to get a glass of water when we’re thirsty, for example, and may help us to feel good when we take actions towards doing so. Where dopamine is so effectively misused is in the practice of providing variable rewards to drive social media addiction.

Unlike getting a glass of water when you’re thirsty, variable rewards are random. It’s as if drinking water sometimes, but not always, cured your thirst. This effectively programs your mind to pursue the action that results in the unpredictable reward. Since getting the reward isn’t guaranteed, you need to make more attempts to achieve success. Social media is designed to make these variable dopamine hits easy to obtain. It’s designed to hijack your intellectual independence in order to keep you on the network.

Especially when the main goal of a centralized social network is to make a profit, that network is exploiting evolutionary flaws in your brain to make that profit from you. You are literally being hacked.

Now combine this information with the knowledge of how a product comprised primarily of images bypasses your cognitive thinking. Not only are you being hacked, but your main defense system is being easily, laughably, circumvented.

Exploiting users is a particularly compelling temptation for any social networks under pressure to make a profit, and this pressure is amplified in organizations with a centralized structure. Not all centralized networks do this, but undoubtedly, the very successful ones do.

Decentralization is by no means a fix for exploitation and greed, but a decentralized social network might have a few things going for it.

Why decentralized networks might be slightly better for your health

The main issues present in Twitter, Facebook, and Instagram as pertains to social media addiction do not go away on decentralized networks. I’m personally, currently, using both Twitter and Mastodon. The former is centralized and the latter is decentralized, but the the same motivations that could get me in trouble on one platform apply to both. Decentralization does not fix the problem.

It might help.

Unlike a centralized, single-hierarchy, definitely-for-profit social network, decentralization has one thing going for it: more people. Specifically, more instance owners who are in control of their instances.

Running a Mastodon instance is a responsibility, should you choose to accept it. Besides the server itself, instances require their own sets of rules and code of conduct, and like the often adopted mastodon.social code of conduct, it can be collaboratively drafted by the community. Mastodon provides instance owners with moderation tools and provides users with reporting tools, and there’s an expectation that they’ll both be used. As with other decentralized social networks, it is the responsibility of the instance owner to moderate and foster a social environment that serves the best interests of the instance users.

Instances typically run on donations, and in the grand scheme of things, are inexpensive to support. Decentralization means that instance owners individually have to bear smaller costs. There’s no central body being pressured to make a profit in order to run servers that support millions of users. The effect of this many-owners structure is that decisions that concern any particular instance and rules that it might want to adopt are made by that instance’s community, or the instance owner. If a user disagrees with the direction taken, they can communicate directly with the instance owner, or simply move to another instance. There’s no “take it or leave it,” and no forced acceptance of terms. Users always have somewhere else to go.

This, in general, means that over many instances, and via many moderators, more people from diverse backgrounds with a collection of both overlapping and contrasting interests are able to have a voice in how the social network evolves.

If instance owners have their users’ best interests, not addiction, in mind; if moderators act responsibly, and according to their instance rules, moderate for good; and if a wide and varying selection of instances with differing interests, political viewpoints, and topics continue to be available; then decentralized social networks might be better for your health.

All social networks have the potential to do more good than harm, but it is up to those who control them to put in the constant, proactive effort required to make that happen. Twitter has recently been making some steps towards becoming a healthier network, like banning political ads and highlighting manipulated media. I think they’re ahead of the curve. With decentralized social networks, there’s at least more chances for the possibility that instance owners truly want to do more good than harm with their own little piece of the whole.

While photo sharing networks will, by their essential nature, bypass cognitive thinking and have an advantage over their users that way, there are many design considerations that PixelFed can implement in order to make the network healthier. Features such as comments, likes, timelines, and push notifications can be designed to provide utility more than drive addiction, and there are designers more qualified than I who can tell you how.

These networks will have to constantly resist the temptation to take the easy route. They will have to work to avoid success based on the exploitation of their users’ desires to chase the easy dopamine hit. They will have to prioritize the ability of the social network to add real value to the lives of its users - at the expense of its own potential to garner mindless, meaningless popularity.

This is in no way a condemnation of PixelFed or any other decentralized photo sharing network. Personally, I sincerely hope they succeed in giving users a healthy, safe, and free-as-in-freedom network for sharing photos with friends, and with the rest of the federated community. It will require considered design with mental health at the forefront; the active, caring effort of moderators and instance owners; and ongoing collaboration from the federated community at large to work together to build for the greater good.

A photo-sharing social media network that does more good than harm? It’s possible. But it won’t be easy.

The past ten years, or, how to get better at anything

2019-12-31T08:27:31-04:00

If you want to get better at anything:

Solve your own problems,
Write about it,
Teach others.

1. Searching, a decade ago

I was a young graduate with newly-minted freedoms, and I was about to fall in love. I had plenty of imagination, a couple handfuls of tenacity, and no sense of direction at all.

For much of my youth, when I encountered a problem, I just sort of bumped up against it. I tried using whatever was in my head from past experiences or my own imagination to find a solution. For some problems, like managing staff duties at work, my experience was sufficient guidance. For other, more complicated problems, it wasn’t.

When you don’t have a wealth of experience to draw upon, relying on it is a poor strategy. Like many people at my age then, I thought I knew enough. Like many people at my age now, I recognize how insufficient “enough” can be. A lack of self-directed momentum meant being dragged in any direction life’s currents took me. When falling in love turned out to mean falling from a far greater height than I had anticipated, I tumbled on, complacent. When higher-ups at work handed me further responsibilities, I accepted them without considering if I wanted them at all. When, inevitably, life became more and more complicated, I encountered even more problems I didn’t know how to solve. I felt stuck.

Though I was morbidly embarrassed about it at the time, I’m not shy to say it now. At one point, it had to be pointed out to me that I could search the Internet for the solution to any of my problems. Anything I wanted to solve - interactions with people at work, a floundering relationship, or the practicalities of filing taxes - I was lucky enough to have the greatest collection of human knowledge ever assembled at my disposal.

Instead of bumbling along in the floatsam of my own trial and error, I started to take advantage of the collective experiences of all those who have been here before me. They weren’t always right, and I often found information only somewhat similar to my own experience. Still, it always got me moving in the right direction. Eventually, I started to steer.

There’s a learning curve, even when just searching for a problem. Distilling the jumble of confusion in your head to the right search terms is a learned skill. It helped me to understand how search engines like Google work:

We use software known as web crawlers to discover publicly available webpages. Crawlers look at webpages and follow links on those pages, much like you would if you were browsing content on the web. They go from link to link and bring data about those webpages back to Google’s servers…

When crawlers find a webpage, our systems render the content of the page, just as a browser does. We take note of key signals — from keywords to website freshness — and we keep track of it all in the Search index.

Sometimes, I find what I need by using the right keyword. Other times, I discover the keyword by searching for text that might surround it on the content of the page. For software development, I search for the weirdest word or combination of words attached to what I’m trying to learn. I rarely find whole solutions in my search results, but I always find direction for solving the problem myself.

Solving my own problems, even just a few little ones at a time, gave me confidence and built momentum. I began to pursue the experiences I wanted, instead of waiting for experiences to happen to me.

2. Updating the Internet, some years ago

I’d solved myself out of a doomed relationship and stagnant job. I found myself, rather gleefully, country-hopping with just one backpack of possessions. I met, though I didn’t know it at the time, my future husband. I found a new sense of freedom, of having options, that I knew I never wanted to give up. I had to find a means to sustain myself by working remotely.

When I first tried to make a living on the Internet, I felt like a right amateur. Sitting on the bed, hunched over my laptop, I started a crappy Wordpress blog with a modified theme that didn’t entirely work. I posted about how I tried and failed to start a dropshipping business. My site was terrible, and I knew it. My first forays into being a “real” developer were to solve my own problems: how to get my blog working, how to set up a custom domain, how to get and use a security certificate. I found some guidance in blogs and answers that others had written, but much of it was outdated, or not entirely correct. Still, it helped me.

I can’t imagine a world in which people did nothing to pass on their knowledge to future generations. Our stories are all we have beyond instinct and determination.

I stopped posting about dropshipping and started writing about the technical problems I was solving. I wrote about what I tried, and ultimately what worked. I started hearing from people who thanked me for explaining the solution they were looking for. Even in posts where all I’d done was link to the correct set of instructions on some other website, people thanked me for leading them to it. I still thought my website was terrible, but I realized I was doing something useful. The more problems I solved, the better I got at solving them, and the more I wrote about it in turn.

One day, someone offered me money for one of my solutions. To my great delight, they weren’t the last to do so.

As I built up my skills, I started taking on more challenging offers to solve problems. I discovered, as others have before me, that especially in software development, not every solution is out there waiting for you. The most frustrating part of working on an unsolved problem is that, at least to your knowledge, there’s no one about to tell you how to solve it. If you’re lucky, you’ve at least got a heading from someone’s cold trail in an old blog post. If you’re lucky and tenacious, you’ll find a working solution.

Don’t leave it scribbled in the corner of a soon-forgotten notepad, never to ease the path of someone who comes along later. Update that old blog post by commenting on it, or sending a note to the author. Put your solution on the Internet, somewhere. Ideally, blog about it yourself in as much detail as you can recall. Some of the people who find your post might have the same problem, and might even be willing to pay you to solve it. And, if my own experience and some scattered stories hold true, one of the people to who’ll come along later, looking for that same solution, will be you.

3. Paying it forwards, backwards, and investing; two years ago

Already being familiar with how easy it is to stop steering and start drifting, I sought new ways to challenge myself and my skills. I wanted to do more than just sustain my lifestyle. I wanted to offer something to others; something that mattered.

A strange thing started happening when I decided, deliberately, to write an in-depth technical blog about topics I was only beginning to become familiar with. I started to deeply understand some fundamental computer science topics - and trust me, that was strange enough - but odder than that was that others started to see me as a resource. People asked me questions because they thought I had the answers. I didn’t, at least, not always - but I knew enough now to not let that stop me. I went to find the answers, to test and understand them, and then I wrote about them to teach those who had asked. I hardly noticed, along the way, that I was learning too.

When someone’s outdated blog post leads you to an eventual solution, you can pay them back by posting an update, or blogging about it yourself. When you solve an unsolved problem, you pay it forward by recording that solution for the next person who comes along (sometimes you). In either case, by writing about it - honestly, and with your best effort to be thorough and correct - you end up investing in yourself.

Explaining topics you’re interested in to other people helps you find the missing pieces in your own knowledge. It helps you fill those gaps with learning, and integrate the things you learn into a new, greater understanding. Teaching something to others helps you become better at it yourself. Getting better at something - anything - means you have more to offer.

The past decade, and the next decade

It’s the end of a decade. I went from an aimless drift through life to being captain of my ship. I bettered my environment, learned new skills, made myself a resource, and became a wife to my best friend. I’m pretty happy with all of it.

It’s the end of 2019. Despite a whole lot of life happening just this year, I’ve written one article on this blog for each week since I started in July. That’s 23 articles for 23 weeks, plus one Christmas bonus. I hear from people almost every day who tell me that an article I wrote was helpful to them, and it makes me happy and proud to think that I’ve been doing something that matters. The first week of January will make this blog two years old.

The past several months have seen me change tack, slightly. I’ve become very interested in cybersecurity, and have been lending my skills to the Open Web Application Security Project. I’m now an author and maintainer of the Web Security Testing Guide, version 5. I’m pretty happy with that, too.

Next year, I’ll be posting a little less, though writing even more, as I pursue an old dream of publishing a book, as well as develop my new cybersecurity interests. I aim to get better at quite a few things. Thankfully, I know just how to do it - and now, so do you:

Solve your own problems,
Write about it,
Teach others.

Have a very happy new decade, dear reader.

Three healthy cybersecurity habits

2019-12-26T08:27:31-04:00

In a similar fashion to everyone getting the flu now and again, the risk of catching a cyberattack is a common one. Both a sophisticated social engineering attack or grammatically-lacking email phishing scam can cause real damage. No one who communicates over the Internet is immune.

Like proper hand washing and getting a flu shot, good habits can lower your risk of inadvertently allowing cybergerms to spread. Since the new year is an inspiring time for beginning new habits, I offer a few suggestions for ways to help protect yourself and those around you.

1. Get a follow-up

Recognizing a delivery method for cyberattack is getting more difficult. Messages with malicious links do not always come from strangers. They may appear to be routine communications, or seem to originate from someone you know or work with. Attacks use subtle but deeply-engrained cognitive biases to override your common sense. Your natural response ensures you click.

Thankfully, there’s a simple low-tech habit you can use to deter these attacks: before you act, follow-up.

You may get an email from a friend that needs help, or from your boss who’s about to get on a plane. It could be as enticing and mysterious as a direct message from an acquaintance who sends a link asking, “Lol. Is this you?” It takes presence of mind to override the panic these attacks prey on, but the deterrent itself is quick and straightforward. Send a text message, pick up the phone and call, or walk down the hall, and ask, “Did you send me this?”

If the message is genuine, there’s no harm in a few extra minutes to double check. If it’s not, you’ll immediately alert the originating party that they may be compromised, and you may have deterred a cyberattack!

2. Use, and encourage others to use, end-to-end encrypted messaging

When individuals in a neighborhood get the flu shot, others in that neighborhood are safer for it. Encryption is similarly beneficial. Encourage your friends, coworkers, and Aunt Matilda to switch to an app like Signal. By doing so, you’ll reduce everyone’s exposure to more exploitable messaging systems.

This doesn’t mean that you must stop using other methods of communication entirely. Instead, think of it as a hierarchy. Use Signal for important messages that should be trusted, like requests for money or making travel arrangements. Use all other methods of messaging, like SMS or social sites, only for “unimportant” communications. Now, if requests or links that seem important come to you through your unimportant methods, you’ll be all the more likely to second-guess them.

3. Don’t put that dirty USB plug into your ***

You wouldn’t brush your teeth with a toothbrush you found on the sidewalk. Why would you plug in a USB device if you don’t know where it’s been?! While we might ascribe putting a random found USB drive in your computer to a clever exploitation of natural human curiosity, we’re no sooner likely to suspect using a public phone-charging station or a USB cable we bought ourselves. Even seemingly-innocuous USB peripherals or rechargeable devices can be a risk.

Unlike email and some file-sharing services that scan and filter files before they reach your computer, plugging in via USB is as direct and unprotected as connection gets. Once this connection is made, the user doesn’t need to do anything else for a whole host of bad things to happen. Through USB connections, problems like malware and ransomware can easily infect your computer or phone.

There’s no need to swear off the convenience of USB connectivity, or to avoid these devices altogether. Instead of engaging in questionable USB behavior, don’t cheap out on USB devices and cables. If it’s going to get plugged into your computer, ensure you’re being extra cautious. Buy it from the manufacturer (like the Apple Store) or from a reputable company or reseller with supply chain control. When juicing up USB-rechargeables, don’t plug them into your computer. Use a wall charger with a USB port instead.

Practice healthy cybersecurity habits

Keeping your devices healthy and happy is a matter of practicing good habits. Like battling the flu, good habits can help protect yourself and those around you. Incorporate some conscientious cybersecurity practices in your new year resolutions - or start them right away.

Have a safe and happy holiday!

Concurrency, parallelism, and the many threads of Santa Claus 🎅

2019-12-23T19:29:01-05:00

Consider the following: Santa brings toys to all the good girls and boys.

There are 7,713,468,100 people in the world in 2019, around 26.3% of which are under 15 years old. This works out to 2,028,642,110 children (persons under 15 years of age) in the world this year.

Santa doesn’t seem to visit children of every religion, so we’ll generalize and only include Christians and non-religious folks. Collectively that makes up approximately 44.72% of the population. If we assume that all kids take after their parents, then 907,208,751.6 children would appear to be Santa-eligible.

What percentage of those children are good? It’s impossible to know; however, we can work on a few assumptions. One is that Santa Claus functions more on optimism than economics and would likely have prepared for the possibility that every child is a good child in any given year. Thus, he would be prepared to give a toy to every child. Let’s assume it’s been a great year and that all 907,208,751.6 children are getting toys.

That’s a lot of presents, and, as we know, they’re all made by Santa’s elves at his North ~~China~~ Pole workshop. Given that there are 365 days in a year and one of them is Christmas, let’s assume that Santa’s elves collectively have 364 days to create and gift wrap 907,208,752 (rounded up) presents. That works out to 2,492,331.74 presents per day.

Almost two-and-a-half million presents per day is a heavy workload for any workshop. Let’s look at two paradigms that Santa might employ to hit this goal: concurrency, and parallelism.

A sequential process

Suppose that Santa’s workshop is staffed by exactly one, very hard working, very tired elf. The production of one present involves four steps:

Cutting wood
Assembly and glueing
Painting
Gift-wrapping

With a single elf, only one step for one present can be happening at any instance in time. If the elf were to produce one present at a time from beginning to end, that process would be executed sequentially. It’s not the most efficient method for producing two-and-a-half million presents per day; for instance, the elf would have to wait around doing nothing while the glue on the present was drying before moving on to the next step.

Concurrency

In order to be more efficient, the elf works on all presents concurrently.

Instead of completing one present at a time, the elf first cuts all the wood for all the toys, one by one. When everything is cut, the elf assembles and glues the toys together, one after the other. This concurrent processing means that the glue from the first toy has time to dry (without needing more attention from the elf) while the remaining toys are glued together. The same goes for painting, one toy at a time, and finally wrapping.

Since one elf can only do one task at a time, a single elf is using the day as efficiently as possible by concurrently producing presents.

Parallelism

Hopefully, Santa’s workshop has more than just one elf. With more elves, more toys can be built simultaneously over the course of a day. This simultaneous work means that the presents are being produced in parallel. Parallel processing carried out by multiple elves means more work happens at the same time.

Elves working in parallel can also employ concurrency. One elf can still tackle only one task at a time, so it’s most efficient to have multiple elves concurrently producing presents.

Of course, if Santa’s workshop has, say, two-and-a-half million elves, each elf would only need to finish a maximum of one present per day. In this case, working sequentially doesn’t detract from the workshop’s efficiency. There would still be 7,668.26 elves left over to fetch coffee and lunch.

Santa Claus, and threading

After all the elves’ hard work is done, it’s up to Santa Claus to deliver the presents – all 907,208,752 of them.

Santa doesn’t need to make a visit to every kid; just to the one household tree. So how many trees does Santa need to visit? Again with broad generalization, we’ll say that the average number of children per household worldwide is 2.45, based on the year’s predicted fertility rates. That makes 370,289,286.4 houses to visit. Let’s round that up to 370,289,287.

How long does Santa have? The lore says one night, which means one earthly rotation, and thus 24 hours. NORAD confirms.

This means Santa must visit 370,289,287 households in 24 hours (86,400 seconds), at a rate of 4,285.75 households per second, nevermind the time it takes to put presents under the tree and grab a cookie.

Clearly, Santa doesn’t exist in our dimension. This is especially likely given that despite being chubby and plump, he fits down a chimney (with a lit fire, while remaining unhurt) carrying a sack of toys containing presents for all the household’s children. We haven’t even considered the fact that his sleigh carries enough toys for every believing boy and girl around the world, and flies.

Does Santa exist outside our rules of physics? How could one entity manage to travel around the world, delivering packages, in under 24 hours at a rate of 4,285.75 households per second, and still have time for milk and cookies and kissing mommy?

One thing is certain: Santa uses the Internet. No other technology has yet enabled packages to travel quite so far and quite so quickly. Even so, attempting to reach upwards of four thousand households per second is no small task, even with even the best gigabit Internet hookup the North Pole has to offer. How might Santa increase his efficiency?

There’s clearly only one logical conclusion to this mystery: Santa Claus is a multithreaded process.

A single thread

Let’s work outward. Think of a thread as one particular task, or the most granular sequence of instructions that Santa might execute. One thread might execute the task, put present under tree. A thread is a component of a process, in this case, Santa’s process of delivering presents.

If Santa Claus is single-threaded, he, as a process, would only be able to accomplish one task at a time. Since he’s old and a bit forgetful, he probably has a set of instructions for delivering presents, as well as a schedule to abide by. These two things guide Santa’s thread until his process is complete.

Single-threaded Santa Claus might work something like this:

Land sleigh at Timmy’s house
Get Timmy’s present from sleigh
Enter house via chimney
Locate Christmas tree
Place Timmy’s present under Christmas tree
Exit house via chimney
Take off in sleigh

Rinse and repeat… another 370,289,286 times.

Multithreading

Multithreaded Santa Claus, by contrast, is the Doctor Manhattan of the North Pole. There’s still only one Santa Claus in the world; however, he has the amazing ability to multiply his consciousness and accomplish multiple instruction sets of tasks simultaneously. These additional task workers, or worker threads, are created and controlled by the main process of Santa delivering presents.

Each worker thread acts independently to complete its instructions. Since they all belong to Santa’s consciousness, they share Santa’s memory and know everything that Santa knows, including what planet they’re running around on, and where to get the presents from.

With this shared knowledge, each thread is able to execute its set of instructions in parallel with the other threads. This multithreaded parallelism makes the one and only Santa Claus as efficient as possible.

If an average present delivery run takes an hour, Santa need only spawn 4,286 worker threads. With each making one delivery trip per hour, Santa will have completed all 370,289,287 trips by the end of the night.

Of course, in theory, Santa could even spawn 370,289,287 worker threads, each flying to one household to deliver presents for all the children in it! That would make Santa’s process extremely efficient, and also explain how he manages to consume all those milk-dunked cookies without getting full. 🥛🍪🍪🍪

An efficient and merry multithreaded Christmas

Thanks to modern computing, we now finally understand how Santa Claus manages the seemingly-impossible task of delivering toys to good girls and boys the world-over. From my family to yours, I hope you have a wonderful Christmas. Don’t forget to hang up your stockings on the router shelf.

Of course, none of this explains how reindeer manage to fly.

Word bugs in software documentation and how to fix them

2019-12-18T09:01:23-04:00

I’ve been an editor longer than I’ve been a developer, so this topic for me is a real root issue. 🥁 When I see a great project with poorly-written docs, it hits close to /home. Okay, okay, I’m done.

I help the Open Web Application Security Project (OWASP) with their Web Security Testing Guide (WSTG). I was recently tasked with writing a style guide and article template that show how to write technical instruction for testing software applications.

I thought parts of the guide would benefit more people than just OWASP’s contributors, so I’m sharing some here.

Many of the projects I participate in are open source. This is a wonderful way for people to share solutions and to build on each others’ ideas. Unfortunately, it’s also a great way for misused and non-existent words to catch on. Here’s an excerpt of the guide with some mistakes I’ve noticed and how you can fix them in your technical documents.

Use Correct Words

The following are frequently misused words and how to correct them.

and/or

While sometimes used in legal documents, and/or leads to ambiguity and confusion in technical writing. Instead, use or, which in the English language includes and. For example:

Bad: “The code will output an error number and/or description.” Good: “The code will output an error number or description.”

The latter sentence does not exclude the possibility of having both an error number and description.

If you need to specify all possible outcomes, use a list:

“The code will output an error number, or a description, or both.”

frontend, backend

While it’s true that the English language evolves over time, these are not yet words.

When referring to nouns, use front end and back end. For example:

Security is equally important on the front end as it is on the back end.

As a descriptive adverb, use the hyphenated front-end and back-end.

Both front-end developers and back-end developers are responsible for application security.

whitebox, blackbox, greybox

These are not words.

As nouns, use white box, black box, and grey box. These nouns rarely appear in connection with cybersecurity.

My cat enjoys jumping into that grey box.

As adverbs, use the hyphenated white-box, black-box, and grey-box. Do not use capitalization unless the words are in a title.

While white-box testing involves knowledge of source code, black-box testing does not. A grey-box test is somewhere in-between.

ie, eg

These are letters.

The abbreviation i.e. refers to the Latin id est, which means “in other words.” The abbreviation e.g. is for exempli gratia, translating to “for example.” To use these in a sentence:

Write using proper English, i.e. correct spelling and grammar. Use common words over uncommon ones, e.g. “learn” instead of “glean.”

etc

These are also letters.

The Latin phrase et cetera translates to “and the rest.” It is abbreviated etc. and typically placed at the end of a list that seems redundant to complete:

WSTG authors like rainbow colors, such as red, yellow, green, etc.

In technical writing, the use of etc. is problematic. It assumes the reader knows what you’re talking about, and they may not. Violet is one of the colors of the rainbow, but the example above does not explicitly tell you if violet is a color that WSTG authors like.

It is better to be explicit and thorough than to make assumptions of the reader. Only use etc. to avoid completing a list that was given in full earlier in the document.

… (ellipsis)

The ellipsis punctuation mark can indicate that words have been left out of a quote:

Linus Torvalds once said, “Once you realize that documentation should be laughed at… THEN, and only then, have you reached the level where you can safely read it and try to use it to actually implement a driver.”

As long as the omission does not change the meaning of the quote, this is acceptable usage of ellipsis in the WSTG.

All other uses of ellipsis, such as to indicate an unfinished thought, are not.

ex

While this is a word, it is likely not the word you are looking for. The word ex has particular meaning in the fields of finance and commerce, and may refer to a person if you are discussing your past relationships. None of these topics should appear in the WSTG.

The abbreviation ex. may be used to mean “example” by lazy writers. Please don’t be lazy, and write example instead.

Go forth and write docs

If these reminders are helpful, please share them freely and use them when writing your own READMEs and documentation! If there’s some I’ve missed, I’d love to know.

And if you’re here for the comments…

There are none on my blog. You can still @ me.

If you’d like to help contribute to the OWASP WSTG, please read the contribution guide. See the full style guide here.

Secure web forms for the front-end developer

2019-12-11T08:27:31-04:00

While cybersecurity is often thought of in terms of databases and architecture, much of a strong security posture relies on elements in the domain of the front-end developer. For certain potentially devastating vulnerabilities like SQL injection and Cross-Site Scripting (XSS), a well-considered user interface is the first line of defense.

Here are a few areas of focus for front-end developers who want to help fight the good fight.

Control user input

A whole whack of crazy things can happen when developers build a form that fails to control user input. To combat vulnerabilities like injection, it’s important to validate or sanitize user input.

Input can be validated by constraining it to known values, such as by using semantic input types or validation-related attributes in forms. Frameworks like Django also help by providing field types for this purpose. Sanitizing data can be done by removing or replacing contextually-dangerous characters, such as by using a whitelist or escaping the input data.

While it may not be intuitive, even data that a user submits to their own area on a site should be validated. One of the fastest viruses to proliferate was the Samy worm on MySpace (yes, I’m old), thanks to code that Samy Kamkar was able to inject into his own profile page. Don’t directly return any input to your site without thorough validation or santization.

For some further guidance on battling injection attacks, see the OWASP Injection Prevention Cheat Sheet.

Beware hidden fields

Adding type="hidden" is an enticingly convenient way to hide sensitive data in pages and forms, but unfortunately not an effective one. With tools like ZapProxy and even inspection tools in plain ol’ web browsers, users can easily click to reveal tasty bits of invisible information. Hiding checkboxes can be a neat hack for creating CSS-only switches, but hidden fields do little to contribute to security.

Carefully consider autofill fields

When a user chooses to give you their Personally Identifiable Information (PII), it should be a conscious choice. Autofill form fields can be convenient - for both users and attackers. Exploits using hidden fields can harvest PII previously captured by an autocomplete field.

Many users aren’t even aware what information their browser’s autofill has stored up. Use these fields sparingly, and disable autofilled forms for particularly sensitive data.

It’s important to also weigh your risk profile against its trade-offs. If your project must be WCAG compliant, disabling autocomplete can break your input for different modalities. For more, see 1.3.5: Identify Input Purpose in WCAG 2.1.

Keep errors generic

While it may seem helpful to let users know whether a piece of data exists, it’s also very helpful to attackers. When dealing with accounts, emails, and PII, it’s most secure to err (🥁) on the side of less. Instead of returning “Your password for this account is incorrect,” try the more ambiguous feedback “Incorrect login information,” and avoid revealing whether the username or email is in the system.

In order to be more helpful, provide a prominent way to contact a human in case an error should arise. Avoid revealing information that isn’t necessary. If nothing else, for heaven’s sake, don’t suggest data that’s a close match to the user input.

Be a bad guy

When considering security, it’s helpful to take a step back, observe the information on display, and ask yourself how a malicious attacker would be able to utilize it. Play devil’s advocate. If a bad guy saw this page, what new information would they gain? Does the view show any PII?

Ask yourself if everything on the page is actually necessary for a genuine user. If not, redact or remove it. Less is safer.

Security starts at the front door

These days, there’s a lot more overlap between coding on the front end and the back end. To create a well-rounded and secure application, it helps to have a general understanding of ways attackers can get their foot in the front door.

The surprisingly difficult task of printing newlines in a terminal

2019-12-04T09:17:35-05:00

Surprisingly, getting computers to give humans readable output is no easy feat. With the introduction of standard streams and specifically standard output, programs gained a way to talk to each other using plain text streams; humanizing and displaying stdout is another matter. Technology throughout the computing age has tried to solve this problem, from the use of ASCII characters in video computer displays to modern shell commands like echo and printf.

These advancements have not been seamless. The job of printing output to a terminal is fraught with quirks for programmers to navigate, as exemplified by the deceptively nontrivial task of expanding an escape sequence to print newlines. The expansion of the placeholder \n can be accomplished in a multitude of ways, each with its own unique history and complications.

Using `echo`

From its appearance in Multics to its modern-day Unix-like system ubiquity, echo remains a familiar tool for getting your terminal to say “Hello world!” Unfortunately, inconsistent implementations across operating systems make its usage tricky. Where echo on some systems will automatically expand escape sequences, others require the -e option to do the same:

echo "the study of European nerves is \neurology"
# the study of European nerves is \neurology

echo -e "the study of European nerves is \neurology"
# the study of European nerves is
# eurology

Because of these inconsistencies in implementations, echo is considered non-portable. Additionally, its usage in conjunction with user input is relatively easy to corrupt through shell injection attack using command substitutions.

In modern systems, it is retained only to provide compatibility with the many programs that still use it. The POSIX specification recommends the use of printf in new programs.

Using `printf`

Since 4th Edition Unix, the portable printf command has essentially been the new and better echo. It allows you to use format specifiers to humanize input. To interpret backslash escape sequences, use %b. The character sequence \n ensures the output ends with a newline:

printf "%b\n" "Many females in Oble are \noblewomen"
# Many females in Oble are
# oblewomen

Though printf has further options that make it a far more powerful replacement of echo, this utility is not foolproof and can be vulnerable to an uncontrolled format string attack. It’s important for programmers to ensure they carefully handle user input.

Putting newlines in variables

In an effort to improve portability amongst compilers, the ANSI C Standard was established in 1983. With ANSI-C quoting using $'...', escape sequences are replaced in output according to the standard.

This allows us to store strings with newlines in variables that are printed with the newlines interpreted. You can do this by setting the variable, then calling it with printf using $:

puns=$'\number\narrow\nether\nice'

printf "%b\n" "These words started with n but don't make $puns"

# These words started with n but don't make
# umber
# arrow
# ether
# ice

The expanded variable is single-quoted, which is passed literally to printf. As always, it is important to properly handle the input.

Bonus round: shell parameter expansion

In my article explaining Bash and braces, I covered the magic of shell parameter expansion. We can use one expansion, ${parameter@operator}, to interpret escape sequences, too. We use printf’s %s specifier to print as a string, and the E operator will properly expand the escape sequences in our variable:

printf "%s\n" ${puns@E}

# umber
# arrow
# ether
# ice

The ongoing challenge of humanizing output

String interpolation continues to be a chewy problem for programmers. Besides getting languages and shells to agree on what certain placeholders mean, properly using the correct escape sequences requires an eye for detail.

Poor string interpolation can lead to silly-looking output, as well as introduce security vulnerabilities, such as from injection attacks. Until the next evolution of the terminal has us talking in emojis, we’d best pay attention when printing output for humans.

The care and feeding of an IoT device

2019-11-27T08:59:35-05:00

Giving someone a puppy for Christmas might work really well in a movie, but in real life often comes hitched to a multitude of responsibilities that the giftee may not be fully prepared to take on. The same is true for Internet of Things (IoT) devices, including Amazon’s Alexa-enabled devices, Google Home, and other Internet-connected appliances like cameras, lightbulbs, and toasters. Yes, they have those now.

Like puppies, IoT devices are still young. Many contain known vulnerabilities that remote attackers can use to gain access to device owners’ networks. These attacks are sometimes as laughably simple as using a default username and password that the device owner cannot change.

Does all this mean you shouldn’t give Grandma Mabel a new app-enabled coffee maker or Ring doorbell for Christmas? Probably, although not necessarily. Like puppies, properly-maintained IoT devices are capable of warming your heart without causing too much havoc; but they take a lot of work to care for. Here are a few responsibilities to keep in mind for the care and feeding of an IoT device.

Immature security

Many manufacturers of IoT devices have not made security a priority. There aren’t yet any enforced security requirements for this industry, which leaves the protection of your device and the network it’s connected to in the hands of the manufacturer.

It’s not just obscure no-name toasters, either; malicious third-party apps have snuck onto Amazon’s and Google’s more reputable devices and enabled attackers to eavesdrop on unsuspecting owners.

Until security regulations are put in place and enforced, it’s buyer beware for both devices and third-party applications. To the extent possible, potential owners must do ample research to weed out vulnerable devices and untrustworthy apps.

Protecting your network

If you think hackers aren’t likely to find your device in the vast expanse of the Internet, you might be wrong. These days, obscurity doesn’t provide security. It’s no longer left up to a potential attacker’s fallible human eyes to find your insecure front door camera in a cacophony of wireless traffic; IoT search engines like Shodan will do that for them. Thankfully, these search engines are also used for good, enabling white hat hackers and penetration testers to find and fix insecure devices.

Just like locking your own front door, IoT owners are responsible for locking down access to their devices. This may mean searching through device settings to make sure default credentials are changed, or checking to make sure that a device used on your private home network doesn’t by default have public Internet access.

Where the options are available, HTTPS and multifactor authentication should be enabled. The use of a VPN can also keep your devices from being found.

Keeping them patched

Unlike puppies, many IoT devices are “headless” and have no inherent way of interfacing with a human. An app-controlled lightbulb, for example, may be all but useless without the software that makes it shine. As convenient as it may be to have your 1500K mood lighting come on automatically at dusk, it also means automatically ceding control of the device to its software developers.

When vulnerabilities in your phone’s operating system are discovered and patched, it’s likely that automatic updates are pushed and installed overnight, possibly without you even knowing. Your IoT device, on the other hand, may have no such support. In those cases, it’s completely up to the user to discover that an update is needed, find and download the patch, then correctly update their device. Even for owners with some technical expertise, this process takes significant effort. Many device owners aren’t even aware that their software is dangerously outdated.

In practical terms, this means that users without the time, knowledge, or willingness to keep their devices updated should reconsider owning them. Alternatively, some research can help prospective owners choose devices that receive automatic push updates from their (hopefully responsible) manufacturers over WiFi.

Being responsible

Raising a healthy and happy IoT device is no small task, especially for potential owners with little time or willingness to put in the required effort. With the proper attention and maintenance, your Internet-connected appliance can bring joy and convenience to your life; but without, it introduces a potential security risk and a whole lot of trouble.

Before getting or giving IoT, be sure the potential owner is up to the task of caring for it.

You can learn more about basic cybersecurity for IoT (as a user or maker) by reading NIST’s draft guidelines publication.

Bash and shell expansions: lazy list-making

2019-11-18T07:07:24-05:00

It’s that time of year again! When stores start putting up colourful sparkly lit-up plastic bits, we all begin to feel a little festive, and by festive I mean let’s go shopping. Specifically, holiday gift shopping! (Gifts for yourself are still gifts, technically.)

Just so this doesn’t all go completely madcap, you ought to make some gift lists. Bash can help.

Brace expansion

These are not braces: ()

Neither are these: []

These are braces: {}

Braces tell Bash to do something with the arbitrary string or strings it finds between them. Multiple strings are comma-separated: {a,b,c}. You can also add an optional preamble and postscript to be attached to each expanded result. Mostly, this can save some typing, such as with common file paths and extensions.

Let’s make some lists for each person we want to give stuff to. The following commands are equivalent:

touch /home/me/gift-lists/Amy.txt /home/me/gift-lists/Bryan.txt /home/me/gift-lists/Charlie.txt

touch /home/me/gift-lists/{Amy,Bryan,Charlie}.txt

tree gift-lists

/home/me/gift-lists
├── Amy.txt
├── Bryan.txt
└── Charlie.txt

Oh darn, “Bryan” spells his name with an “i.” I can fix that.

mv /home/me/gift-lists/{Bryan,Brian}.txt

renamed '/home/me/gift-lists/Bryan.txt' -> '/home/me/gift-lists/Brian.txt'

Shell parameter expansions

Shell parameter expansion allows us to make all sorts of changes to parameters enclosed in braces, like manipulate and substitute text.

There are a few stocking stuffers that all our giftees deserve. Let’s make that a variable:

STUFF=$'socks\nlump of coal\nwhite chocolate'

echo "$STUFF"
socks
lump of coal
white chocolate

Now to add these items to each of our lists with some help from the tee command to get echo and expansions to play nice.

echo "$STUFF" | tee {Amy,Brian,Charlie}.txt

cat {Amy,Brian,Charlie}.txt

socks
lump of coal
white chocolate
socks
lump of coal
white chocolate
socks
lump of coal
white chocolate

Pattern match substitution

On second thought, maybe the lump of coal isn’t such a nice gift. You can replace it with something better using a pattern match substitution in the form of ${parameter/pattern/string}:

echo "${STUFF/lump of coal/candy cane}" | tee {Amy,Brian,Charlie}.txt

cat {Amy,Brian,Charlie}.txt

socks
candy cane
white chocolate
socks
candy cane
white chocolate
socks
candy cane
white chocolate

This replaces the first instance of “lump of coal” with “candy cane.” To replace all instances (if there were multiple), use ${parameter//pattern/string}. This doesn’t change our $STUFF variable, so we can still reuse the original list for someone naughty later.

Substrings

While we’re improving things, our giftees may not all like white chocolate. We’d better add some regular chocolate to our lists just in case. Since I’m super lazy, I’m just going to hit the up arrow and modify a previous Bash command. Luckily, the last word in the $STUFF variable is “chocolate,” which is nine characters long, so I’ll tell Bash to keep just that part using ${parameter:offset}. I’ll use tee’s -a flag to append to my existing lists:

echo "${STUFF: -9}" | tee -a {Amy,Brian,Charlie}.txt

cat {Amy,Brian,Charlie}.txt

socks
candy cane
white chocolate
chocolate
socks
candy cane
white chocolate
chocolate
socks
candy cane
white chocolate
chocolate

You can also:

Do this	With this
Get substring from n characters onwards	`${parameter:n}`
Get substring for x characters starting at n	`${parameter:n:x}`

There! Now our base lists are finished. Let’s have some eggnog.

Testing variables

You know, it may be the eggnog, but I think I started a list for Amy yesterday and stored it in a variable that I might have called amy. Let’s see if I did. I’ll use the ${parameter:?word} expansion. It’ll write word to standard error and exit if there’s no amy parameter.

echo "${amy:?no such}"

bash: amy: no such

I guess not. Maybe it was Brian instead?

echo "${brian:?no such}"

Lederhosen

You can also:

Do this	With this
Substitute `word` if `parameter` is unset or null	`${parameter:-word}`
Substitute `word` if `parameter` is not unset or null	`${parameter:+word}`
Assign `word` to `parameter` if `parameter` is unset or null	`${parameter:=word}`

Changing case

That’s right! Brian said he wanted some lederhosen and so I made myself a note. This is pretty important, so I’ll add it to Brian’s list in capital letters with the ${parameter^^pattern} expansion. The pattern part is optional. We’re only writing to Brian’s list, so I’ll just use >> instead of tee -a.

echo "${brian^^}" >> Brian.txt

cat Brian.txt

socks
candy cane
white chocolate
chocolate
LEDERHOSEN

You can also:

Do this	With this
Capitalize the first letter	`${parameter^pattern}`
Lowercase the first letter	`${parameter,pattern}`
Lowercase all letters	`${parameter,,pattern}`

Expanding arrays

You know what, all this gift-listing business is a lot of work. I’m just going to make an array of things I saw at the store:

gifts=(sweater gameboy wagon pillows chestnuts hairbrush)

I can use substring expansion in the form of ${parameter:offset:length} to make this simple. I’ll add the first two to Amy’s list, the middle two to Brian’s, and the last two to Charlie’s. I’ll use printf to help with newlines.

printf '%s\n' "${gifts[@]:0:2}" >> Amy.txt
printf '%s\n' "${gifts[@]:2:2}" >> Brian.txt
printf '%s\n' "${gifts[@]: -2}" >> Charlie.txt

cat Amy.txt

socks
candy cane
white chocolate
chocolate
sweater
gameboy

cat Brian.txt

socks
candy cane
white chocolate
chocolate
LEDERHOSEN
wagon
pillows

cat Charlie.txt

socks
candy cane
white chocolate
chocolate
chestnuts
hairbrush

There! Now we’ve got a comprehensive set of super personalized gift lists. Thanks Bash! Too bad it can’t do the shopping for us, too.

A cron job that could save you from a ransomware attack

2019-11-13T08:27:31-04:00

It’s 2019, and ransomware has become a thing.

Systems that interact with the public, like companies, educational institutions, and public services, are most susceptible. While delivery methods for ransomware vary from the physical realm to communication via social sites and email, all methods only require one person to make one mistake in order for ransomware to proliferate.

Ransomware, as you may have heard, is a malicious program that encrypts your files, rendering them unreadable and useless to you. It can include instructions for paying a ransom, usually by sending cryptocurrency, in order to obtain the decryption key. Successful ransomware attacks typically exploit vital, time-sensitive systems. Victims like public services and medical facilities are more likely to have poor or zero recovery processes, leaving governments or insurance providers to reward attackers with ransom payments.

Individuals, especially less-than-tech-savvy ones, are no less at risk. Ransomware can occlude personal documents and family photos that may only exist on one machine.

Thankfully, a fairly low-tech solution exists for rendering ransomware inept: back up your data!

You could achieve this with a straightforward system like plugging in an external hard drive and dragging files over once a day, but this method has a few hurdles. Manually transferring files may be slow or incomplete, and besides, you’ll first have to remember to do it.

In my constant pursuit of automating all the things, there’s one tool I often return to for its simplicity and reliability: cron. Cron does one thing, and does it well: it runs commands on a schedule.

I first used it a few months shy of three years ago (Have I really been blogging that long?!) to create custom desktop notifications on Linux. Using the crontab configuration file, which you can edit by running crontab -e, you can specify a schedule for running any commands you like. Here’s what the scheduling syntax looks like, from the Wikipedia cron page:

# ┌───────────── minute (0 - 59)
# │ ┌───────────── hour (0 - 23)
# │ │ ┌───────────── day of the month (1 - 31)
# │ │ │ ┌───────────── month (1 - 12)
# │ │ │ │ ┌───────────── day of the week (0 - 6)
# │ │ │ │ │
# │ │ │ │ │
# │ │ │ │ │
# * * * * * command to execute

For example, a cron job that runs every day at 00:00 would look like:

0 0 * * *

To run a job every twelve hours, the syntax is:

0 */12 * * *

This great tool can help you wrap your head around the cron scheduling syntax.

What’s a scheduler have to do with backing up? By itself, not much. The simple beauty of cron is that it runs commands - any shell commands, and any scripts that you’d normally run on the command line. As you may have gleaned from my other posts, I’m of the strong opinion that you can do just about anything on the command line, including backing up your files. Options for storage in this area are plentiful, from near-to-free local and cloud options, as well as paid managed services too numerous to list. For CLI tooling, we have utilitarian classics like rsync, and CLI tools for specific cloud providers like AWS.

Backing up with `rsync`

The rsync utility is a classic choice, and can back up your files to an external hard drive or remote server while making intelligent determinations about which files to update. It uses file size and modification times to recognize file changes, and then only transfers changed files, saving time and bandwidth.

The rsync syntax can be a little nuanced; for example, a trailing forward slash will copy just the contents of the directory, instead of the directory itself. I found examples to be helpful in understanding the usage and syntax.

Here’s one for backing up a local directory to a local destination, such as an external hard drive:

rsync -a /home/user/directory /media/user/destination

The first argument is the source, and the second is the destination. Reversing these in the above example would copy files from the mounted drive to the local home directory.

The a flag for archive mode is one of rsync’s superpowers. Equivalent to flags -rlptgoD, it:

Syncs files recursively through directories (r);
Preserves symlinks (l), permissions (p), modification times (t), groups (g), and owner (o); and
Copies device and special files (D).

Here’s another example, this time for backing up the contents of a local directory to a directory on a remote server using SSH:

rsync -avze ssh /home/user/directory/ user@remote.host.net:home/user/directory

The v flag turns on verbose output, which is helpful if you like realtime feedback on which files are being transferred. During large transfers, however, it can tend to slow things down. The z flag can help with that, as it indicates that files should be compressed during transfer.

The e flag, followed by ssh, tells rsync to use SSH according to the destination instructions provided in the final argument.

Backing up with AWS CLI

Amazon Web Services offers a command line interface tool for doing just about everything with your AWS set up, including a straightforward s3 sync command for recursively copying new and updated files to your S3 storage buckets. As a storage method for back up data, S3 is a stable and inexpensive choice. You can even turn on versioning in your bucket.

The syntax for interacting with directories is fairly straightforward, and you can directly indicate your S3 bucket as an S3Uri argument in the form of s3://mybucket/mykey. To back up a local directory to your S3 bucket, the command is:

aws s3 sync /home/user/directory s3://mybucket

Similar to rsync, reversing the source and destination would download files from the S3 bucket.

The sync command is intuitive by default. It will guess the mime type of uploaded files, as well as include files discovered by following symlinks. A variety of options exist to control these and other defaults, even including flags to specify the server-side encryption to be used.

Setting up your cronjob back up

You can edit your machine’s cron file by running:

crontab -e

Intuitive as it may be, it’s worth mentioning that your back up commands will only run when your computer is turned on and the cron daemon is running. With this in mind, choose a schedule for your cronjob that aligns with times when your machine is powered on, and maybe not overloaded with other work.

To back up to an S3 bucket every day at 8AM, for example, you’d put a line in your crontab that looks like:

0 8 * * * aws s3 sync /home/user/directory s3://mybucket

If you’re curious whether your cron job is currently running, find the PID of cron with:

pstree -ap | grep cron

Then run pstree -ap .

This rabbit hole goes deeper; a quick search can reveal different ways of organizing and scheduling cronjobs, or help you find different utilities to run cronjobs when your computer is asleep. To protect against the possibility of ransomware-affected files being transferred to your back up, incrementally separated archives are a good idea. In essence, however, this basic set up is all you really need to create a reliable, automatic back up system.

Don’t feed the trolls

Humans are fallible; that’s why cyberattacks work. The success of a ransomware attack depends on the victim having no choice but to pay up in order to return to business as usual. A highly accessible recent back up undermines attackers who depend on us being unprepared. By blowing away a system and restoring from yesterday’s back up, we may lose a day of progress; ransomers, however, gain nothing at all.

For further resources on ransomware defense for users and organizations, check out CISA’s advice on ransomware.

Publishing GitHub event data with GitHub Actions and Pages

2019-11-04T09:13:23-04:00

Teams who work on GitHub rely on event data to collaborate. The data recorded as issues, pull requests, and comments, become vital to understanding the project.

With the general availability of GitHub Actions, we have a chance to programmatically access and preserve GitHub event data in our repository. Making the data part of the repository itself is a way of preserving it outside of GitHub, and also gives us the ability to feature the data on a front-facing website, such as with GitHub Pages, through an automated process that’s part of our CI/CD pipeline.

And, if you’re like me, you can turn GitHub issue comments into an awesome 90s guestbook page.

No matter the usage, the principle concepts are the same. We can use Actions to access, preserve, and display GitHub event data - with just one workflow file. To illustrate the process, I’ll take you through the workflow code that makes my guestbook shine on.

For an introductory look at GitHub Actions including how workflows are triggered, see A lightweight, tool-agnostic CI/CD flow with GitHub Actions.

Accessing GitHub event data

An Action workflow runs in an environment with some default environment variables. A lot of convenient information is available here, including event data. The most complete way to access the event data is using the $GITHUB_EVENT_PATH variable, the path of the file with the complete JSON event payload.

The expanded path looks like /home/runner/work/_temp/_github_workflow/event.json and its data corresponds to its webhook event. You can find the documentation for webhook event data in GitHub REST API Event Types and Payloads. To make the JSON data available in the workflow environment, you can use a tool like jq to parse the event data and put it in an environment variable.

Below, I grab the comment ID from an issue comment event:

ID="$(jq '.comment.id' $GITHUB_EVENT_PATH)"

Most event data is also available via the github.event context variable without needing to parse JSON. The fields are accessed using dot notation, as in the example below where I grab the same comment ID:

ID=${{ github.event.comment.id }}

For my guestbook, I want to display entries with the user’s handle, and the date and time. I can capture this event data like so:

AUTHOR=${{ github.event.comment.user.login }}
DATE=${{ github.event.comment.created_at }}

Shell variables are handy for accessing data, however, they’re ephemeral. The workflow environment is created anew each run, and even shell variables set in one step do not persist to other steps. To persist the captured data, you have two options: use artifacts, or commit it to the repository.

Preserving event data: using artifacts

Using artifacts, you can persist data between workflow jobs without committing it to your repository. This is handy when, for example, you wish to transform or incorporate the data before putting it somewhere more permanent.

Two actions assist with using artifacts: upload-artifact and download-artifact. You can use these actions to make files available to other jobs in the same workflow. For a full example, see passing data between jobs in a workflow.

The upload-artifact action’s action.yml contains an explanation of the keywords. The uploaded files are saved in .zip format. Another job in the same workflow run can use the download-artifact action to utilize the data in another step.

You can also manually download the archive on the workflow run page, under the repository’s Actions tab.

Persisting workflow data between jobs does not make any changes to the repository files, as the artifacts generated live only in the workflow environment. Personally, being comfortable working in a shell environment, I see a narrow use case for artifacts, though I’d have been remiss not to mention them. Besides passing data between jobs, they could be useful for creating .zip format archives of, say, test output data. In the case of my guestbook example, I simply ran all the necessary steps in one job, negating any need for passing data between jobs.

Preserving event data: pushing workflow files to the repository

To preserve data captured in the workflow in the repository itself, it is necessary to add and push this data to the Git repository. You can do this in the workflow by creating new files with the data, or by appending data to existing files, using shell commands.

Creating files in the workflow

To work with the repository files in the workflow, use the checkout action to first get a copy to work with:

- uses: actions/checkout@master
  with:
    fetch-depth: 1

To add comments to my guestbook, I turn the event data captured in shell variables into proper files, using substitutions in shell parameter expansion to sanitize user input and translate newlines to paragraphs. I wrote previously about why user input should be treated carefully.

- name: Turn comment into file
  run: |
    ID=${{ github.event.comment.id }}
    AUTHOR=${{ github.event.comment.user.login }}
    DATE=${{ github.event.comment.created_at }}
    COMMENT=$(echo "${{ github.event.comment.body }}")
    NO_TAGS=${COMMENT//[<>]/\`}
    FOLDER=comments

    printf '%b\n' "${AUTHOR} says:
${NO_TAGS//$'\n'/\<\/p\>\}
${DATE}
\r\n" > ${FOLDER}/${ID}.html

By using printf and directing its output with > to a new file, the event data is transformed into an HTML file, named with the comment ID number, that contains the captured event data. Formatted, it looks like:

<div class="comment">
    <p>victoriadrake says:p>
    <p>This is a comment!p>
    <p>2019-11-04T00:28:36Zp>
div>

When working with comments, one effect of naming files using the comment ID is that a new file with the same ID will overwrite the previous. This is handy for a guestbook, as it allows any edits to a comment to replace the original comment file.

If you’re using a static site generator like Hugo, you could build a Markdown format file, stick it in your content/ folder, and the regular site build will take care of the rest. In the case of my simplistic guestbook, I have an extra step to consolidate the individual comment files into a page. Each time it runs, it overwrites the existing index.html with the header.html portion (>), then finds and appends (>>) all the comment files’ contents in descending order, and lastly appends the footer.html portion to end the page.

- name: Assemble page
  run: |
    cat header.html > index.html
    find comments/ -name "*.html" | sort -r | xargs -I % cat % >> index.html
    cat footer.html >> index.html

Committing changes to the repository

Since the checkout action is not quite the same as cloning the repository, at time of writing, there are some issues still to work around. A couple extra steps are necessary to pull, checkout, and successfully push changes back to the master branch, but this is pretty trivially done in the shell.

Below is the step that adds, commits, and pushes changes made by the workflow back to the repository’s master branch.

- name: Push changes to repo
  run: |
    REMOTE=https://${{ secrets.GITHUB_TOKEN }}@github.com/${{ github.repository }}
    git config user.email "${{ github.actor }}@users.noreply.github.com"
    git config user.name "${{ github.actor }}"

    git pull ${REMOTE}
    git checkout master
    git add .
    git status
    git commit -am "Add new comment"
    git push ${REMOTE} master

The remote, in fact, our repository, is specified using the github.repository context variable. For our workflow to be allowed to push to master, we give the remote URL using the default secrets.GITHUB_TOKEN variable.

Since the workflow environment is shiny and newborn, we need to configure Git. In the above example, I’ve used the github.actor context variable to input the username of the account initiating the workflow. The email is similarly configured using the default noreply GitHub email address.

Displaying event data

If you’re using GitHub Pages with the default secrets.GITHUB_TOKEN variable and without a site generator, pushing changes to the repository in the workflow will only update the repository files. The GitHub Pages build will fail with an error, “Your site is having problems building: Page build failed.”

To enable Actions to trigger a Pages site build, you’ll need to create a Personal Access Token. This token can be stored as a secret in the repository settings and passed into the workflow in place of the default secrets.GITHUB_TOKEN variable. I wrote more about Actions environment and variables in this post.

With the use of a Personal Access Token, a push initiated by the Actions workflow will also update the Pages site. You can see it for yourself by leaving a comment in my guestbook! The comment creation event triggers the workflow, which then takes around 30 seconds to run and update the guestbook page.

Where a site build is necessary for changes to be published, such as when using Hugo, an Action can do this too. However, in order to avoid creating unintended loops, one Action workflow will not trigger another (see what will). Instead, it’s extremely convenient to handle the process of building the site with a Makefile, which any workflow can then run. Simply add running the Makefile as the final step in your workflow job, with the repository token where necessary:

- name: Run Makefile
  env:
    TOKEN: ${{ secrets.GITHUB_TOKEN }}
  run: make all

This ensures that the final step of your workflow builds and deploys the updated site.

No more event data horizon

GitHub Actions provides a neat way to capture and utilize event data so that it’s not only available within GitHub. The possibilities are only as limited as your imagination! Here are a few ideas for things this lets us create:

A public-facing issues board, where customers without GitHub accounts can view and give feedback on project issues.
An automatically-updating RSS feed of new issues, comments, or PRs for any repository.
A comments system for static sites, utilizing GitHub issue comments as an input method.
An awesome 90s guestbook page.

Did I mention I made a 90s guestbook page? My inner-Geocities-nerd is a little excited.

A lightweight, tool-agnostic CI/CD flow with GitHub Actions

2019-10-28T08:28:52-04:00

Agnostic tooling is the clever notion that you should be able to run your code in various environments. With many continuous integration and continuous development (CI/CD) apps available, agnostic tooling gives developers a big advantage: portability.

Of course, having your CI/CD work everywhere is a tall order. Popular CI apps for GitHub repositories alone use a multitude of configuration languages spanning Groovy, YAML, TOML, JSON, and more… all with differing syntax, of course. Porting workflows from one tool to another is more than a one-cup-of-coffee process.

The introduction of GitHub Actions has the potential to add yet another tool to the mix; or, for the right set up, greatly simplify a CI/CD workflow.

Prior to this article, I accomplished my CD flow with several lashed-together apps. I used AWS Lambda to trigger site builds on a schedule. I had Netlify build on push triggers, as well as run image optimization, and then push my site to the public Pages repository. I used Travis CI in the public repository to test the HTML. All this worked in conjunction with GitHub Pages, which actually hosts the site.

I’m now using the GitHub Actions beta to accomplish all the same tasks, with one portable Makefile of build instructions, and without any other CI/CD apps.

Appreciating the shell

What do most CI/CD tools have in common? They run your workflow instructions in a shell environment! This is wonderful, because that means that most CI/CD tools can do anything that you can do in a terminal… and you can do pretty much anything in a terminal.

Especially for a contained use case like building my static site with a generator like Hugo, running it all in a shell is a no-brainer. To tell the magic box what to do, we just need to write instructions.

While a shell script is certainly the most portable option, I use the still-very-portable Make to write my process instructions. This provides me with some advantages over simple shell scripting, like the use of variables and macros, and the modularity of rules.

I got into the nitty-gritty of my Makefile in my last post. Let’s look at how to get GitHub Actions to run it.

Using a Makefile with GitHub Actions

To our point on portability, my magic Makefile is stored right in the repository root. Since it’s included with the code, I can run the Makefile locally on any system where I can clone the repository, provided I set the environment variables. Using GitHub Actions as my CI/CD tool is as straightforward as making Make go worky-worky.

I found the GitHub Actions workflow syntax guide to be pretty straightforward, though also lengthy on options. Here’s the necessary set up for getting the Makefile to run.

The workflow file at .github/workflows/make-master.yml contains the following:

name: make-master

on:
  push:
    branches:
      - master
  schedule:
    - cron: '20 13 * * *'

jobs:
  build:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@master
        with:
          fetch-depth: 1
      - name: Run Makefile
        env:
          TOKEN: ${{ secrets.TOKEN }}
        run: make all

I’ll explain the components that make this work.

Triggering the workflow

Actions support multiple triggers for a workflow. Using the on syntax, I’ve defined two triggers for mine: a push event to the master branch only, and a scheduled cron job.

Once the make-master.yml file is in your repository, either of your triggers will cause Actions to run your Makefile. To see how the last run went, you can also add a fun badge to the README.

One hacky thing

Because the Makefile runs on every push to master, I sometimes would get errors when the site build had no changes. When Git, via my Makefile, attempted to commit to the Pages repository, no changes were detected and the commit would fail annoyingly:

nothing to commit, working tree clean
On branch master
Your branch is up to date with 'origin/master'.

nothing to commit, working tree clean
Makefile:62: recipe for target 'deploy' failed
make: *** [deploy] Error 1
##[error]Process completed with exit code 2.

I came across some solutions that proposed using diff to check if a commit should be made, but this may not work for reasons. As a workaround, I simply added the current UTC time to my index page so that every build would contain a change to be committed.

Environment and variables

You can define the virtual environment for your workflow to run in using the runs-on syntax. The ~~obvious best choice~~ one I chose is Ubuntu. Using ubuntu-latest gets me the most updated version, whatever that happens to be when you’re reading this.

GitHub sets some default environment variables for workflows. The actions/checkout action with fetch-depth: 1 creates a copy of just the most recent commit your repository in the GITHUB_WORKSPACE variable. This allows the workflow to access the Makefile at GITHUB_WORKSPACE/Makefile. Without using the checkout action, the Makefile won’t be found, and I get an error that looks like this:

make: *** No rule to make target 'all'.  Stop.
Running Makefile
##[error]Process completed with exit code 2.

While there is a default GITHUB_TOKEN secret, this is not the one I used. The default is only locally scoped to the current repository. To be able to push to my separate GitHub Pages repository, I created a personal access token scoped to public_repo and pass it in as the secrets.TOKEN encrypted variable. For a step-by-step, see Creating and using encrypted secrets.

Portable tooling

The nice thing about using a simple Makefile to define the bulk of my CI/CD process is that it’s completely portable. I can run a Makefile anywhere I have access to an environment, which is most CI/CD apps, virtual instances, and, of course, on my local machine.

One of the reasons I like GitHub Actions is that getting my Makefile to run was pretty straightforward. I think the syntax is well done - easy to read, and intuitive when it comes to finding an option you’re looking for. For someone already using GitHub Pages, Actions provides a pretty seamless CD experience; and if that should ever change, I can run my Makefile elsewhere. ¯\_(ツ)_/¯

A portable Makefile for continuous delivery with Hugo and GitHub Pages

2019-10-21T09:09:06-04:00

Fun fact: I first launched this GitHub Pages site 1,018 days ago.

Since then, we’ve grown together. From early cringe-worthy commit messages, through eighty-six versions of Hugo, and up until last week, a less-than-streamlined multi-app continuous integration and deployment (CI/CD) workflow.

If you know me at all, you know I love to automate things. I’ve been using a combination of AWS Lambda, Netlify, and Travis CI to automatically build and publish this site. My workflow for the task includes:

Build with Hugo on push to master, and on a schedule (Netlify and Lambda);
Optimize and resize images (Netlify);
Test with HTMLProofer (Travis CI); and
Deploy to my separate, public, GitHub Pages repository (Netlify).

Thanks to the introduction of GitHub Actions, I’m able to do all the above with just one portable Makefile.

Next week I’ll cover my Actions set up; today, I’ll take you through the nitty-gritty of my Makefile so you can write your own.

Makefile portability

POSIX-standard-flavour Make runs on every Unix-like system out there. Make derivatives, such as GNU Make and several flavours of BSD Make also run on Unix-like systems, though their particular use requires installing the respective program. To write a truly portable Makefile, mine follows the POSIX standard. (For a more thorough summation of POSIX-compatible Makefiles, I found this article helpful: A Tutorial on Portable Makefiles.) I run Ubuntu, so I’ve tested the portability aspect using the BSD Make programs bmake, pmake, and fmake. Compatibility with non-Unix-like systems is a little more complicated, since shell commands differ. With derivatives such as Nmake, it’s better to write a separate Makefile with appropriate Windows commands.

While much of my particular use case could be achieved with shell scripting, I find Make offers some worthwhile advantages. I enjoy the ease of using variables and macros, and the modularity of rules when it comes to organizing my steps.

The writing of rules mostly comes down to shell commands, which is the main reason Makefiles are as portable as they are. The best part is that you can do pretty much anything in a terminal, and certainly handle all the workflow steps listed above.

My continuous deployment Makefile

Here’s the portable Makefile that handles my workflow. Yes, I put emojis in there. I’m a monster.

.POSIX:
DESTDIR=public
HUGO_VERSION=0.58.3

OPTIMIZE = find $(DESTDIR) -not -path "*/static/*" \( -name '*.png' -o -name '*.jpg' -o -name '*.jpeg' \) -print0 | \
xargs -0 -P8 -n2 mogrify -strip -thumbnail '1000>'

.PHONY: all
all: get_repository clean get build test deploy

.PHONY: get_repository
get_repository:
 @echo "🛎 Getting Pages repository"
 git clone https://github.com/victoriadrake/victoriadrake.github.io.git $(DESTDIR)

.PHONY: clean
clean:
 @echo "🧹 Cleaning old build"
 cd $(DESTDIR) && rm -rf *

.PHONY: get
get:
 @echo "❓ Checking for hugo"
 @if ! [ -x "$$(command -v hugo)" ]; then\
  echo "🤵 Getting Hugo";\
     wget -q -P tmp/ https://github.com/gohugoio/hugo/releases/download/v$(HUGO_VERSION)/hugo_extended_$(HUGO_VERSION)_Linux-64bit.tar.gz;\
  tar xf tmp/hugo_extended_$(HUGO_VERSION)_Linux-64bit.tar.gz -C tmp/;\
  sudo mv -f tmp/hugo /usr/bin/;\
  rm -rf tmp/;\
  hugo version;\
 fi

.PHONY: build
build:
 @echo "🍳 Generating site"
 hugo --gc --minify -d $(DESTDIR)

 @echo "🧂 Optimizing images"
 $(OPTIMIZE)

.PHONY: test
test:
 @echo "🍜 Testing HTML"
 docker run -v $(GITHUB_WORKSPACE)/$(DESTDIR)/:/mnt 18fgsa/html-proofer mnt --disable-external

.PHONY: deploy
deploy:
 @echo "🎁 Preparing commit"
 @cd $(DESTDIR) \
 && git config user.email "hello@victoria.dev" \
 && git config user.name "Victoria via GitHub Actions" \
 && git add . \
 && git status \
 && git commit -m "🤖 CD bot is helping" \
 && git push -f -q https://$(TOKEN)@github.com/victoriadrake/victoriadrake.github.io.git master
 @echo "🚀 Site is deployed!"

Sequentially, this workflow:

Clones the public Pages repository;
Cleans (deletes) the previous build files;
Downloads and installs the specified version of Hugo, if Hugo is not already present;
Builds the site;
Optimizes images;
Tests the built site with HTMLProofer, and
Prepares a new commit and pushes to the public Pages repository.

If you’re familiar with command line, most of this may look familiar. Here are a couple bits that might warrant a little explanation.

Checking if a program is already installed

I think this bit is pretty tidy:

if ! [ -x "$$(command -v hugo)" ]; then\
...
fi

I use a negated if conditional in conjunction with command -v to check if an executable (-x) called hugo exists. If one is not present, the script gets the specified version of Hugo and installs it. This Stack Overflow answer has a nice summation of why command -v is a more portable choice than which.

Image optimization

My Makefile uses mogrify to batch resize and compress images in particular folders. It finds them automatically using the file extension, and only modifies images that are larger than the target size of 1000px in any dimension. I wrote more about the batch-processing one-liner in this post.

There are a few different ways to achieve this same task, one of which, theoretically, is to take advantage of Make’s suffix rules to run commands only on image files. I find the shell script to be more readable.

Using Dockerized HTMLProofer

HTMLProofer is installed with gem, and uses Ruby and Nokogiri, which adds up to a lot of installation time for a CI workflow. Thankfully, 18F has a Dockerized version that is much faster to implement. Its usage requires starting the container with the built site directory mounted as a data volume, which is easily achieved by appending to the docker run command.

docker run -v /absolute/path/to/site/:/mounted-site 18fgsa/html-proofer /mounted-site

In my Makefile, I specify the absolute site path using the default environment variable GITHUB_WORKSPACE. I’ll dive into this and other GitHub Actions features in the next post.

In the meantime, happy Making!

How to quickly batch resize, compress, and convert images with a Bash one-liner

2019-10-14T08:27:49-04:00

Part of my Hugo site continuous deployment workflow is the processing of 210 images, at time of writing.

Here’s my one-liner:

find public/ -not -path "*/static/*" \( -name '*.png' -o -name '*.jpg' -o -name '*.jpeg' \) -print0 | xargs -0 -P8 -n2 mogrify -strip -thumbnail '1000>' -format jpg

I use find to target only certain image file formats in certain directories. With mogrify, part of ImageMagick, I resize only the images that are larger than a certain dimension, compress them, and strip the metadata. I tack on the format flag to create jpg copies of the images.

Here’s the one-liner again (broken up for better reading):

# Look in the public/ directory
find public/ \
# Ignore directories called "static" regardless of location
-not -path "*/static/*" \
# Print the file paths of all files ending with any of these extensions
\( -name '*.png' -o -name '*.jpg' -o -name '*.jpeg' \) -print0 \
# Pipe the file paths to xargs and use 8 parallel workers to process 2 arguments
| xargs -0 -P8 -n2 \
# Tell mogrify to strip metadata, and...
mogrify -strip \
# ...compress and resize any images larger than the target size (1000px in either dimension)
-thumbnail '1000>' \
# Convert the files to jpg format
-format jpg

That’s it. That’s the post.

How Engineering Leaders Build Security Culture Through Architecture Decisions

2019-09-30T08:03:12-04:00

Leading engineering teams means constantly balancing several goals: speed to market, feature development, technical debt, and security. When I’ve seen teams struggle with security, it’s rarely because they lack technical knowledge. The real challenge is creating an organizational culture where security decisions are prioritized, even under pressure.

The “we can do security later” mindset creates what I call security debt—technical decisions that make it exponentially harder to secure applications as they scale. Unlike other forms of technical debt, security debt compounds in immediately dangerous ways. A rushed architectural decision made to meet a deadline can become a persistent vulnerability that affects every feature built on top of it.

As engineering leaders, we have a unique opportunity to shape how our teams think about security. The architectural decisions we make and the frameworks we establish don’t just affect our current codebase—they define the security culture that will carry our teams through future challenges.

I’ve found that the most effective approach isn’t to mandate security practices after the fact, but to build security thinking into the fundamental architectural decisions that guide daily development work.

When teams understand why certain architectural patterns prevent entire classes of vulnerabilities, they start making secure choices naturally.

The Leadership Challenge: Building Security Culture Through Architecture

The difference between teams that build secure applications and those that struggle with security incidents comes down to how engineering leaders approach architectural decision-making. Security-conscious teams don’t just follow security checklists—they’ve internalized security principles that guide their architectural choices.

This cultural shift happens when engineering leaders consistently demonstrate how security considerations influence technical decisions. When your team sees you weighing security implications during architecture reviews, evaluating third-party libraries through a security lens, and making trade-offs that prioritize long-term security over short-term convenience, they learn to apply the same thinking to their own work.

The framework I’ve developed focuses on three architectural principles that, when consistently applied, create a foundation for security-conscious engineering culture:

Strategic Separation: Designing systems that isolate different types of data and functionality
Intentional Configuration: Making deliberate choices about system defaults and access patterns
Controlled Access: Building authorization thinking into system design from the start

These are are both technical guidelines and leadership tools for building teams that make decisions that promote built-in security by default.

Strategic Separation: Teaching Teams to Think in Security Boundaries

The most effective security-conscious engineering teams I’ve worked with share a common trait: they instinctively think in terms of security boundaries. This isn’t something that happens overnight—it’s a cultural shift that engineering leaders must deliberately cultivate through architectural decisions and team education.

When I talk about strategic separation, I mean designing systems that isolate different types of data and functionality based on their security requirements and organizational impact. The goal isn’t just to prevent specific vulnerabilities, but to create architectural patterns that make it obvious to your team when they’re crossing security boundaries.

Consider a common scenario that exposes how teams think about security:

Your team is building a user profile feature that includes photo uploads. The natural instinct is to store user photos alongside other application assets. After all, they’re both images. But this decision reveals whether your team thinks in terms of security boundaries.

A security-conscious team immediately recognizes that user-uploaded content and application assets have fundamentally different security requirements. Application assets are controlled, vetted, and part of your deployment process. User uploads are untrusted input that could contain malicious content or exploit path traversal vulnerabilities to access sensitive configuration files.

The architectural decision here can prevent path traversal attacks, but it also establishes a pattern that helps your team understand the security implications of data boundary decisions.

When you consistently demonstrate that different types of data require different security approaches, your team starts applying this thinking to database design, API endpoints, and service architecture.

This is where engineering leadership becomes crucial. The technical solution is straightforward: separate user-uploaded content from application assets using different storage systems, domains, or security contexts. But the leadership challenge is helping your team understand why this separation matters and how to apply the same thinking to future architectural decisions.

I’ve found that the most effective approach is to make security boundaries visible in your architecture discussions. When reviewing designs, ask questions like: “What happens if this data is compromised or malicious?” and “How would we contain an attack that starts here?” These questions help teams internalize security thinking rather than just following rules.

The goal is creating teams that instinctively separate concerns based on security requirements, not just functional requirements. When your team starts proposing separated architectures without being prompted, you know the culture shift is working.

Intentional Configuration: Building Security-Conscious Deployment Culture

Security misconfiguration represents one of the most persistent challenges in engineering leadership because it reveals gaps in team processes and organizational culture. The problem isn’t that engineers don’t understand security—it’s that deployment processes often prioritize speed over security verification.

I’ve seen engineering teams that had excellent security knowledge but still suffered issues because their deployment culture didn’t include security configuration validation. The issue compounds when teams are under pressure to ship features quickly, making it easy to rationalize skipping security configuration reviews.

The solution isn’t just better checklists or automated scanners, though those help. The real challenge is building a deployment culture where security configuration becomes as automatic as running tests. This requires engineering leaders to demonstrate that security configuration is a fundamental part of professional software deployment, not an optional extra.

When I work with teams on configuration security, I focus on three organizational patterns that prevent security misconfiguration:

Configuration as Code: Teams that treat configuration with the same rigor as application code naturally apply security thinking to deployment settings. When configuration changes require code review, security considerations become part of the discussion.
Default-Secure Patterns: Rather than relying on engineers to remember security settings, establish organizational patterns where secure configuration is the default. This might mean custom deployment templates, infrastructure as code patterns, or automated validation that catches insecure defaults.
Security Configuration Reviews: Just as you wouldn’t deploy code without reviewing it, security configuration should be part of your regular architecture review process. This creates opportunities for knowledge sharing and continuous improvement.

Security configuration problems are usually process problems disguised as technical problems.

When teams consistently deploy with insecure configurations, it’s often because their deployment process doesn’t include security validation points, not because they lack security knowledge.

Engineering leaders can transform this by making security configuration visible in deployment workflows. When your team sees you reviewing security settings during deployment reviews, asking questions about default configurations, and prioritizing security hardening alongside feature development, they learn to apply the same standards to their own work.

The goal is creating teams that instinctively question defaults and validate security configurations, not just when prompted by checklists, but because secure configuration has become part of their professional identity as engineers.

Controlled Access: Designing Authorization into Team Thinking

Access control failures represent a particularly insidious class of security vulnerabilities because they’re often invisible until it’s too late. Unlike other security issues that can be caught by automated tools, access control problems require human understanding of business logic and user relationships. This makes them a perfect example of how security culture directly impacts security outcomes.

The challenge for engineering leaders is that beyond being a technical problem, access control is a design thinking problem. Teams that build secure access controls don’t just implement authorization checks; they also think systematically about user relationships, privilege boundaries, and failure modes during the design phase.

I’ve observed that teams struggling with access control issues often share a common pattern: they build features first and add authorization as an afterthought.

This approach creates security debt that compounds over time, making it harder to reason about who should have access to what functionality.

The solution requires a shift in how teams approach feature development. Instead of thinking about authorization as something you add to features, security-conscious teams think about authorization as a fundamental constraint that shapes feature design.

Consider the difference between these two approaches when building an admin moderation feature:

Traditional Approach: Build the moderation interface, then add permission checks to prevent unauthorized access.

Security-First Approach: Design the moderation feature as a completely separate system with its own authentication context, making unauthorized access architecturally impossible.

The second approach requires more upfront planning, but it creates systems that are secure by design rather than secure by careful (and slower, and more costly) implementation. More importantly, it teaches teams to think about authorization as a design constraint, not just a technical requirement.

This shift in thinking has organizational implications beyond just security. When teams consistently design features with authorization constraints in mind, they develop better intuition about user workflows, system boundaries, and API design. The security thinking improves overall system design.

The goal is creating teams that instinctively design authorization into features rather than retrofitting it after implementation.

As engineering leaders, we can foster this thinking by making authorization design visible in architecture discussions. When reviewing feature proposals, ask questions like: “Who needs access to this?” and “How would we prevent privilege escalation?” These questions help teams internalize authorization thinking as part of their design process.

From Architecture to Culture: The Leadership Impact

The three architectural principles I’ve outlined—strategic separation, intentional configuration, and controlled access—represent more than just technical best practices. They’re tools for building engineering cultures that make security decisions instinctively rather than reactively.

The transformation happens when engineering leaders consistently demonstrate that security considerations are integral to professional software development. When your team sees you making architectural decisions that prioritize security boundaries, questioning configuration defaults, and designing authorization into features from the start, they learn to apply the same thinking to their own work.

This cultural shift has organizational benefits that extend far beyond security. Teams that think systematically about security boundaries also design better APIs, create more maintainable systems, and build more robust software overall. This security thinking improves engineering decision-making across the board.

Security thinking isn’t something you can successfully mandate through policies or checklists, though I’ve seen many organizations try. It emerges from the accumulated architectural decisions your team makes and the frameworks they internalize for thinking about system design. When security considerations become part of how your team naturally approaches technical problems, you’ve created something much more valuable than just secure applications—you’ve built a team that can adapt to new security challenges as they (constantly) emerge.

As engineering leaders, our role isn’t just to ensure our current systems are secure, but to build teams that will continue making security-conscious decisions as they face new challenges, technologies, and organizational pressures. The architectural principles we establish today define the security culture that will guide our teams through future unknowns.

Migrating to the cloud but without screwing it up, or how to move house

2019-09-23T08:03:12-04:00

For an application that’s ready to scale, not using managed cloud architecture these days is like insisting on digging your own well for water. It’s far more labour-intensive, requires buying all your own equipment, takes a lot more time, and there’s a higher chance you’re going to get it wrong because you don’t personally have a whole lot of experience digging wells, anyway.

That said - let’s just get this out of the way first - there is no cloud. It’s just someone else’s computer.

Of course, these days, cloud services go far beyond the utility we’d expect from a single computer. Besides being able to quickly set up and utilize the kind of computing power that previously required a new office lease agreement to house, there are now a multitude of monitoring, management, and analysis tools at our giddy fingertips. While it’s important to understand that the cloud isn’t a better option in every case, for applications that can take advantage of it, we can do more, do it faster, and do it for less money than if we were to insist on building our own on-premises infrastructure.

That’s all great, and easily said; moving to the cloud, however, can look from the outset like a pretty daunting task. How, exactly, do we go about shifting what may be years of on-premises data and built-up systems to someone else’s computer? You know, without being able to see it, touch it, and without completely screwing up our stuff.

While it probably takes less work and money than setting up or maintaining the same architecture on-premise, it does take some work to move to the cloud initially. It’s important that our application is prepared to migrate, and capable of using the benefits of cloud services once it gets there. To accomplish this, and a smooth transition, preparation is key. In fact, it’s a whole lot like moving to a new house.

In this article, we’ll take a high-level look at the general stages of taking an on-premise or self-hosted application and moving it to the cloud. This guide is meant to serve as a starting point for designing the appropriate process for your particular situation, and to enable you to better understand the cloud migration process. While cloud migration may not be the best choice for some applications - such as ones without scalable architecture or where very high computing resources are needed - a majority of modular and modern applications stand to benefit from a move to the cloud.

It’s certainly possible, as I discovered at a recent event put on by Amazon Web Services (AWS) Solutions Architects, to migrate smoothly and efficiently, with near-zero loss of availability to customers. I’ll specifically reference some services provided by AWS, however, similar functionality can be found with other cloud providers. I’ve found the offerings from AWS to be pleasantly modular in scope, which is why I use them myself and why they make good examples for discussing general concepts.

To have our move go as smoothly as possible, here are the things we’ll want to consider:

The type of move we’re making;
The things we’ll take, and the things we’ll clean up;
How to choose the right type and size for the infrastructure we’re moving into; and
How to do test runs to practice for the big day.

The type of move we’re making

While it’s important to understand why we’re moving our application to cloud services, we should also have an idea of what we’d like it to look like when it gets there. There are three main ways to move to the cloud: re-host, re-platform, or re-factor.

Re-host

A re-host scenario is the the most straightforward type of move. It involves no change to the way our application is built or how it runs. For example, if we currently have Python code, use PostgreSQL, and serve our application with Apache, a re-host move would mean we use all the same components, combined in just the same way, only now they’re in the cloud. It’s a lot like moving into a new house that has the exact same floor plan as the current one. All the furniture goes into the same room it’s in now, and it’s going to feel pretty familiar when we get there.

The main draw of a re-host move is that it may offer the least amount of complication necessary in order to take advantage of going to the cloud. Scalable applications, for example, can gain the ability to automatically manage necessary application resources.

While re-hosting makes scaling more automatic, it’s important to note that it won’t in itself make an application scalable. If the application infrastructure is not organized in such a way that gives it the ability to scale, a re-factor may be necessary instead.

Re-platform

If a component of our current application set up isn’t working out well for us, we’re probably going to want to re-platform. In this case, we’re making a change to at least one component of our architecture; for example, switching our database from Oracle to MySQL on Amazon Relational Database Service (RDS).

Like moving from a small apartment in Tokyo to an equally small apartment in New York, a re-platform doesn’t change the basic nature of our application, but does change its appearance and environment. In the database change example, we’ll have all the same data, just organized or formatted a little differently. In most cases, we won’t have to make these changes manually. A tool such as Amazon Database Migration Service (DMS) can help to seamlessly shift our data over to the new database.

We might re-platform in order to enable us to better meet a business demand in the future, such as scaling up, integrating with other technological components, or choosing a more modern technology stack.

Re-factor

A move in which we re-factor our application is necessarily more complicated than our other options, however, it may provide the most overall benefit for companies or applications that have reason to make this type of move. As with code, refactoring is done when fundamental changes need to be made in order for our application to meet a business need. The specifics necessarily differ case-by-case, but typically involve changes to architectural components or how those components relate to one another. This type of move may also involve changing application code in order to optimize the application’s performance in a cloud environment. We can think of it like moving out from our parent’s basement in the suburbs and getting a nice townhouse in the city. There’s no way we’re taking that ancient hand-me-down sofa, so we’ll need some new furniture, and for our neighbour’s sake, probably window dressings.

Refactoring may enable us to modernize a dated application, or make it more efficient in general. With greater efficiency, we can better take advantage of services that cloud providers typically offer, like bursting resources or attaining deep analytical insight.

If a re-factor is necessary but time is scarce, it may be better to re-host or re-platform first, then re-factor later. That way, we’ll have a job well done later instead of a hasty, botched migration (and more problems) sooner.

What to take, and what to clean up

Over the years of living in one place, stuff tends to pile up unnoticed in nooks and crannies. When moving house, it’s usually a great opportunity to sort everything out and decide what is useful enough to keep, and what should be discarded or given away. Moving to the cloud is a similarly great opportunity to do the same when it comes to our application.

While cloud storage is inexpensive nowadays, there may be some things that don’t make sense to store any longer, or at least not keep stored with our primary application. If data cannot be discarded due to policy or regulations, we may choose a different storage class to house data that we don’t expect to need anytime soon outside of our main application.

In the case of Amazon’s Simple Storage Service (S3), we can choose to use different storage classes that accomplish this goal. While the data that our business relies on every day can take advantage of the Standard class 99.99% availability, data meant for long-term cold storage such as archival backups can be put into the Glacier class, which has longer retrieval time and lower cost.

The right type and size

Choosing the type and size of cloud infrastructure appropriate for our business is usually the part that can be the most confusing. How should we predict, in a new environment or for a growing company, the computing power we’ll need?

Part of the beauty of not procuring hardware on our own is that won’t have to make predictions like these. Using cloud storage and instances, expanding or scaling back resources can be done in a matter of minutes, sometimes seconds. With managed services, it can even be done automatically for us. With the proper support for scalability in our application, it’s like having a magical house that instantly generates any type of room and amenity we need at that moment. The ability to continually ensure that we’re using appropriate, cost-effective resources is at our fingertips, and often clearly visualized in charts and dashboards.

For applications new to the cloud, some leeway for experimentation may be necessary. While cloud services enables us to quickly spin up and try out different architectures, there’s no guarantee that all of those set ups will work well for our application. For example, running a single instance may be less expensive than going serverless, but we’d be hard pressed to know this until we tried it out.

As a starting point, we simply need enough storage and computing power to support the application as it is currently running, today. For example, in the case of storage, consider the size of the current database - the actual database data, not the total storage capacity of hardware on-premises. For a detailed cost exploration, AWS even offers a Simple Monthly Calculator with use case samples to help guide expectations.

Do test runs before the big day

Running a trial cloud migration may be an odd concept, but it is an essential component to ensuring that the move goes as planned with minimal service interruption. Imagine the time and energy that would be saved in the moving house example if we could automate test runs! Invariably, some box or still-hung picture is forgotten and left out of the main truck, necessitating additional trips in other vehicles. With multiple chances to ensure we’ve got it down pat, we minimize the possibility that our move causes any break in normal day-to-day business.

Generally, to do a test run, we create a duplicate version of our application. The more we can duplicate, the more thorough the test run will be, especially if our data is especially large. Though duplication may seem tedious, working with the actual components we intend to migrate is essential to ensuring the migration goes as planned. After all, if we only did a moving-house test run with one box, it wouldn’t be very representative.

Test runs can help to validate our migration plan against any challenges we may encounter. These challenges might include:

Downtime restrictions;
Encrypting data in transit and immediately when at rest on the target;
Schema conversion to a new target schema (the AWS Schema Conversion Tool can also help);
Access to databases, such as through firewalls or VPNs;
Developing a process to ensure that all the data successfully migrated, such as by using a hash function.

Test runs also help to give us a more accurate picture of the overall time that a migration will take, as well as affording us the opportunity to fine-tune it. Factors that may affect the overall speed of a migration include:

The sizes of the source and target instances;
Available bandwidth for moving data;
Schema configurations; and
Transaction pressure on the source, such as changes to the data and the volume of incoming transactions.

Once the duplicate application has been migrated via one or more options, we test the heck out of the application that’s now running in the cloud to ensure it performs as expected. Ideally, on the big day, we’d follow this same general process to move up-to-date duplicate data, and then seamlessly point the “real” application or web address to the new location in the cloud. This means that our customers experience near-zero downtime; essentially, only the amount of time that the change in location-pointing would need to propagate to their device.

In the case of very large or complex applications with many components or many teams working together at the same time, a more gradual approach may be more appropriate than the “Big Bang” approach, and may help to mitigate risk of any interruptions. This means migrating in stages, component by component, and running tests between stages to ensure that all parts of the application are communicating with each other as expected.

Preparation is essential to a smooth migration

I hope this article has enabled a more practical understanding of how cloud migration can be achieved. With thorough preparation, it’s possible to take advantage of all the cloud has to offer, with minimal hassle to get there.

My thanks to the AWS Solutions Architects who presented at Pop-Up Loft and shared their knowledge on these topics, in particular: Chandra Kapireddy, Stephen Moon, John Franklin, Michael Alpaugh, and Priyanka Mahankali.

One last nugget of wisdom, courtesy of John: “Friends don’t let friends use DMS to create schema objects.”

How users and applications stay safe on the Internet: it's proxy servers all the way down

2019-09-16T09:35:28-04:00

Both Internet users and Internet-connected applications can benefit from investing in cybersecurity. One core aspect of online privacy is the use of a proxy server, though this basic building block may not be initially visible underneath its more recognizable forms. Proxy servers are a useful thing to know about nowadays, for developers, software product owners, as well as the average dog on the Internet. Let’s explore what makes proxy servers an important piece of cybersecurity support.

“On the Internet, nobody knows you’re a dog.”

When Peter Steiner’s caption was first published in The New Yorker in 1993, it reportedly went largely unnoticed. Only later did the ominous and slightly frightening allusion to online anonymity touch the public consciousness with the icy fingers of the unknown. As Internet usage became more popular, users became concerned that other people could represent themselves online in any manner they chose, without anyone else knowing who they truly were.

This, to make a gross understatement, is no longer the case. Thanks to tracking cookies, browser fingerprinting, Internet Service Providers (ISPs) selling our browsing logs to advertisers, and our own inexplicable inclination to put our names and faces on social networks, online anonymity is out like last year’s LaCroix flavours. While your next-door neighbor may not know how to find you online (well, except for through that location-based secondhand marketplace app you’re using), you can be certain that at least one large advertising company has a series of zeroes and ones somewhere that represent you, the specific details of your market demographic, and all your online habits, including your preferred flavour of LaCroix.

There are ways to add some layers of obscurity, like using a corporate firewall that hides your IP, or using Tor. The underlying mechanism of both these methods is the same. Like being enshrouded in the layers of an onion, we’re using one or more proxy servers to shield our slightly sulfuric selves from third-party tracking.

What’s a proxy server, anyway

A proxy, in the traditional English definition, is the “authority or power to act for another.” (Merriam-Webster) A proxy server, in the computing context, is a server that acts on behalf of another server, or a user’s machine.

By using a proxy to browse the Internet, for example, a user can defer being personally identifiable. All of the user’s Internet traffic appears to come from the proxy server instead of their machine.

Proxy servers are for users

There are a few ways that we, as the client, can use a proxy server to conceal our identity when we go online. It’s important to know that these methods offer differing levels of anonymity, and that no single method will really provide true anonymity; if others are actively seeking to find you on the Internet, for whatever reason, further steps should be taken to make your activity truly difficult to identify. (Those steps are beyond the scope of this article, but you can get started with the Electronic Frontier Foundation’s (EFF) Surveillance Self-Defense resource.) For the average user, however, here is a small menu of options ranging from least to most anonymous.

Use a proxy in your web browser

Certain web browsers, including Firefox and Safari on Mac, allow us to configure them to send our Internet traffic through a proxy server. The proxy server attempts to anonymize our requests by replacing our originating IP address with the proxy server’s own IP. This provides us with some anonymity, as the website we’re trying to reach will not see our originating IP address; however, the proxy server that we choose to use will know exactly who originated the request. This method also doesn’t necessarily encrypt traffic, block cookies, or stop social media and cross-site trackers from following us around; on the upside, it’s the method least likely to prevent websites that use cookies from functioning properly.

Public proxy servers are out there, and deciding whether or not we should use any one of them is on par with deciding whether we should eat a piece of candy handed to us by a smiling stranger. If your academic institution or company provides a proxy server address, it is (hopefully) a private server with some security in place. My preferred method, if we have a little time and a few monthly dollars to invest in our security, is to set up our own virtual instance with a company such as Amazon Web Services or Digital Ocean and use this as our proxy server.

To use a proxy through our browser, we can edit our Connection Settings in Firefox, or set up a proxy server using Safari on Mac.

In regards to choosing a browser, I would happily recommend Firefox to any Internet user who wants to beef up the security of their browsing experience right out of the box. Mozilla has been a champion of privacy-first since I’ve heard of them, and recently made some well-received changes to Enhanced Tracking Protection in Firefox Browser that blocks social media trackers, cross-site tracking cookies, fingerprinters, and cryptominers by default.

Use a VPN on your device

In order to take advantage of a proxy server for all our Internet usage instead of just through one browser, we can use a Virtual Private Network (VPN). A VPN is a service, usually paid, that sends our Internet traffic through their servers, thus acting as a proxy. A VPN can be used on our laptop as well as phone and tablet devices, and since it encompasses all our Internet traffic, it doesn’t require much extra effort to use other than ensuring our device is connected. Using a VPN is an effective way to keep nosy ISPs from snooping on our requests.

To use a paid, third-party VPN service, we’d usually sign up on their website and download their app. It’s important to keep in mind that whichever provider we choose, we’re entrusting them with our data. VPN providers anonymize our activity from the Internet, but can themselves see all our requests. Providers vary in terms of their privacy policies and the data they choose to log, so a little research may be necessary to determine which, if any, we are comfortable trusting.

We can also roll our own VPN service by using a virtual instance and OpenVPN. OpenVPN is an open source VPN protocol, and can be used with a few virtual instance providers, such as Amazon VPC, Microsoft Azure, Google Cloud, and Digital Ocean Droplets. I previously wrote a tutorial on setting up your own personal VPN service with AWS using an EC2 instance. I’ve been running this solution personally for about a month, and it’s cost me almost $4 USD in total, which is a price I’m quite comfortable paying for some peace of mind.

Use Tor

Tor takes the anonymity offered by a proxy server and compounds it by forwarding our requests through a relay network of other servers, each called a “node.” Our traffic passes through three nodes on its way to a destination: the guard, middle, and exit nodes. At each step, the request is encrypted and anonymized such that the current node only knows where to send it, and nothing more about what the request contains. This separation of knowledge means that, of the options discussed, Tor provides the most complete version of anonymity. (For a more complete explanation, see Robert Heaton’s article on how Tor works, which is so excellently done that I wish I’d written it myself.)

That said, this level of anonymity comes with its own cost. Not monetary, as Tor Browser is free to download and use. It is, however, slower than using a VPN or simple proxy server through a browser, due to the circuitous route our requests take.

Proxy servers are for servers too

We’re now familiar with proxy servers in the context of protecting users as they surf the web, but proxies aren’t just for clients. Websites and Internet-connected applications can use reverse proxy servers for obfuscation too. The “reverse” part just means that the proxy is acting on behalf of the server, instead of the client.

Why would a web server care about anonymity? Generally, they don’t, at least not in the same way some users do. Web servers can benefit from using a proxy for a few different reasons; for example, they typically offer faster service to users by caching or compressing content to optimize delivery. From a cybersecurity perspective, however, a reverse proxy can improve an application’s security posture by obfuscating the underlying infrastructure.

Basically, by placing another web server (the “proxy”) in front of the web server that directly accesses all the files and assets, we make it more difficult for an attacker to pinpoint our “real” web server and mess with our stuff. Like when you want to see the store manager and the clerk you’re talking to says, “I speak for the manager,” and you’re not really sure there even is a manager, anyway, but you successfully exchange the hot pink My Little Pony they sold you for a fuchsia one, thankyouverymuch, so now you’re no longer concerned with who the manager is and whether or not they really exist, and if you passed them on the street you would not be able to stop them and call them out for passing off hot pink as fuchsia, and the manager is just fine with that.

Some common web servers can also act as reverse proxies, often with just a minimal and straightforward configuration change. While the best choice for your particular architecture is unknown to me, I will offer a couple common examples here.

Using NGINX as a reverse proxy

NGINX uses the proxy_pass directive in its configuration file (nginx.conf by default) to turn itself into a reverse proxy server. The set up requires the following lines to be placed in the configuration file:

location /requested/path/ {
    proxy_pass http://www.example.com/target/path/;
}

This specifies that all requests for the path /requested/path/ are forwarded to http://www.example.com/target/path/. The target can be a domain name or an IP address, the latter with or without a port.

The full guide to using NGINX as a reverse proxy is part of the NGINX documentation.

Using Apache httpd as a reverse proxy

Apache httpd similarly requires some straightforward configuration to act as a reverse proxy server. In the configuration file, usually httpd.conf, set the following directives:

ProxyPass "/requested/path/"  "http://www.example.com/target/path/"
ProxyPassReverse "/requested/path/"  "http://www.example.com/target/path/"

The ProxyPass directive ensures that all requests for the path /requested/path/ are forwarded to http://www.example.com/target/path/. The ProxyPassReverse directive ensures that the headers sent by the web server are modified to point to the reverse proxy server instead.

The full reverse proxy guide for Apache HTTP server is available in their documentation.

Proxy servers most of the way down

I concede that my title is a little facetious, as cybersecurity best practices aren’t really some eternal infinite-regression mystery (though they may sometimes seem to be). Regardless, I hope this post has helped in your understanding of what proxy servers are, how they contribute to online anonymity for both clients and servers, and that they are an integral building block of cybersecurity practices.

If you’d like to learn more about personal best practices for online security, I highly recommend exploring the articles and resources provided by EFF. For a guide to securing web sites and applications, the OWASP Cheat Sheet Series is a fantastic resource.

Building Data Protection Culture: Why Engineering Leaders Must Address the Human Side of Security

2019-09-09T09:10:11-04:00

The most frustrating security incidents I’ve dealt with as an engineering leader weren’t caused by sophisticated attacks or zero-day vulnerabilities. They were caused by well-intentioned team members who accidentally exposed sensitive data through everyday tools and processes. A developer pasting API keys into a public Slack channel. A support engineer sharing database credentials through an unsecured text-sharing service. A product manager including real customer data in a publicly accessible report.

These incidents reveal a fundamental truth about data protection: it’s not primarily a technical problem—it’s an organizational one. The security of our applications depends as much on how our teams handle sensitive data in their daily workflows as it does on our encryption algorithms or access controls.

The challenge for engineering leaders is that traditional security approaches focus on technical controls while overlooking the human systems that determine how data actually flows through our organizations. When we treat data protection as purely a technical problem, we create a gap between our security policies and the reality of how our teams work.

Teams with the strongest data protection practices don’t rely on security tools. They have organizational cultures that make secure data handling feel natural and obvious.

Building this kind of culture requires engineering leaders to think beyond technical solutions and address the underlying organizational patterns that lead to data exposure. The goal isn’t just to prevent specific incidents, but to create teams that instinctively handle sensitive data securely, even under pressure.

The Reality of Data Exposure: It’s Happening Right Now

Before diving into solutions, it’s worth understanding just how pervasive data exposure has become. The reality is that sensitive data from organizations of all sizes is readily discoverable through simple search techniques. A quick search for site:pastebin.com "api_key" or site:github.com "password" reveals thousands of exposed credentials, database connections, and API keys from companies around the world.

This isn’t theoretical—it’s happening to teams just like yours, right now. The Google Hacking Database catalogs thousands of search queries that can expose sensitive data, and security researchers regularly discover new leaked credentials on platforms like Pastebin, GitHub, and even internal Slack channels that have been accidentally made public.

API keys exposed through public paste services—a common data exposure pattern.

The scale of this problem reveals something important: data exposure isn’t just a technical failure, it’s a systematic organizational problem. When thousands of developers across hundreds of companies make the same types of mistakes, it suggests that our industry-wide approach to data protection is fundamentally flawed.

The Leadership Challenge: Beyond Technical Solutions

Most engineering leaders approach data protection through technical controls: better encryption, more restrictive access policies, automated scanning tools. These controls are important, but they don’t address the root cause of most data exposure incidents.

The real challenge is organizational. When team members expose sensitive data, it’s usually because they’re working around limitations in their tools or processes. They’re not necessarily being careless—they’re trying to get their work done efficiently within the constraints of their environment.

Consider these common scenarios:

The Developer’s Dilemma: A developer needs to share a database connection string with a colleague to debug a production issue. The “secure” process involves filing a ticket, waiting for approval, and scheduling a meeting. The expedient process involves pasting it into Slack. Under deadline pressure, which do you think happens more often?

The Support Engineer’s Challenge: A support engineer needs to share customer data with the product team to investigate a bug. The secure process requires anonymizing the data, which takes time they don’t have. The expedient process involves copying real data into a shared document.

The Product Manager’s Bind: A product manager needs to create test data for a demo. The secure process involves generating fake data that matches production patterns. The expedient process involves copying a subset of real customer data.

In each case, the data exposure isn’t caused by malicious intent or even carelessness—it’s caused by organizational friction between security requirements and operational reality.

The most effective approach to data protection isn’t to increase security friction, but to reduce the friction around secure practices while making insecure practices more obviously problematic.

This is where engineering leadership becomes crucial. Technical solutions alone can’t bridge the gap between security policies and operational needs. That requires organizational changes that make secure data handling the easiest and most obvious way to work.

Building Organizational Capabilities for Data Protection

The teams I’ve worked with that have the strongest data protection practices share three organizational capabilities:

1. Secure-by-Default Tooling

Instead of relying on people to remember security practices, these teams build security into their daily tools. This might mean:

Internal paste services that automatically expire and require authentication instead of relying on external tools
Automated credential management through tools like AWS Secrets Manager or HashiCorp Vault that make secure credential sharing easier than insecure sharing
Data anonymization tools that make it trivial to generate realistic test data without using production data

People will use the most convenient option available. If secure practices are more convenient than insecure practices, people will choose secure practices naturally.

2. Visible Security Boundaries

Teams with strong data protection practices make security boundaries obvious in their workflows. This includes:

Clear data classification that helps team members understand what constitutes sensitive data
Workflow integration that flags when someone is about to share sensitive data through insecure channels
Regular security boundary discussions during architectural reviews and design meetings

When security boundaries are visible and well-understood, team members can make informed decisions about data handling without needing to become security experts.

3. Treating Security Mistakes as Learning Opportunities

Perhaps most importantly, these teams create environments where people feel encouraged to report security mistakes or near-misses. This cultural element is often overlooked, but it’s crucial for continuous improvement.

Teams that punish security mistakes create incentives for people to hide problems rather than fix them. Teams that treat security mistakes as learning opportunities create incentives for people to surface issues early and help improve processes.

The Engineering Leader’s Role: Creating Sustainable Change

Building these organizational capabilities requires engineering leaders to approach data protection as a change management challenge, not just a technical one. This means:

Making the Case for Investment: Security tooling and process improvements often require upfront investment that may not have immediate visible returns. Engineering leaders need to advocate for this investment by helping stakeholders understand the organizational costs of data exposure incidents.
Modeling Secure Behavior: When engineering leaders consistently demonstrate secure data handling practices in their own work, it signals to the team that these practices are valued and expected.
Addressing Process Gaps: When team members work around security processes, it’s often because those processes don’t meet their operational needs. Engineering leaders need to identify and address these gaps rather than simply enforcing compliance.
Celebrating Security Improvements: Teams that recognize and celebrate security improvements create cultures where security work is valued rather than seen as overhead.

The goal isn’t to eliminate all possibility of data exposure—that’s impractical.

The goal is to build organizational capabilities that make data exposure increasingly unlikely and ensure that when incidents do occur, they’re caught and resolved quickly.

From Reactive to Proactive: Building Long-Term Security Culture

The most successful engineering leaders I’ve worked with approach data protection as an ongoing organizational capability rather than a one-time project. This means:

Regular Security Culture Assessment: Periodically evaluating whether your team’s tools and processes support secure data handling or create friction that encourages workarounds.
Continuous Tool Investment: Investing in tools and processes that make secure data handling easier and more convenient than insecure alternatives.
Cross-Functional Security Discussions: Including security considerations in product planning, design reviews, and operational discussions rather than treating security as a separate concern.
Security Skill Development: Helping team members develop the knowledge and judgment needed to make good security decisions in novel situations.

The teams that excel at data protection have built organizational cultures where security is valued as part of the development lifecycle. This cultural shift doesn’t happen overnight, but it creates sustainable security practices that adapt to new challenges and technologies.

As engineering leaders, our role is to build teams that will continue making secure choices as they face new tools, processes, and organizational pressures. The organizational capabilities we build today define how our teams will handle tomorrow’s security challenges.

When data protection becomes embedded in your team’s culture rather than imposed through policies, you’ve created something much more valuable than just better security—you’ve built a team that can adapt to new security challenges while maintaining the operational efficiency needed to build great products.

SQL injection and XSS: what white hat hackers know about trusting user input

2019-09-02T09:01:23-04:00

Software developers have a lot on their minds. There are are myriad of questions to ask when it comes to creating a website or application: What technologies will we use? How will the architecture be set up? What functions do we need? What will the UI look like? Especially in a software market where shipping new apps seems more like a race for reputation than a well-considered process, one of the most important questions often falls to the bottom of the “Urgent” column: how will our product be secured?

If you’re using a robust, open-source framework for building your product (and if one is applicable and available, why wouldn’t you?) then some basic security concerns, like CSRF tokens and password encryption, may already be handled for you. Still, fast-moving developers would be well served to brush up on their knowledge of common threats and pitfalls, if only to avoid some embarrass ing rookie mistakes. Usually, the weakest point in the security of your software is you.

I’ve recently become more interested in information security in general, and practicing ethical hacking in particular. An ethical hacker, sometimes called “white hat” hacker, and sometimes just “hacker,” is someone who searches for possible security vulnerabilities and responsibly (privately) reports them to project owners. By contrast, a malicious or “black hat” hacker, also called a “cracker,” is someone who exploits these vulnerabilities for amusement or personal gain. Both white hat and black hat hackers might use the same tools and resources, and generally try to get into places they aren’t supposed to be; however, white hats do this with permission, and with the intention of fortifying defences instead of destroying them. Black hats are the bad guys.

When it comes to learning how to find security vulnerabilities, it should come as no surprise that I’ve been devouring whatever information I can get my hands on; this post is a distillation of some key areas that are specifically helpful to developers when handling user input. These lessons have been collectively gleaned from these excellent resources:

The Open Web Application Security Project guides
The Hacker101 playlist from HackerOne’s YouTube channel
Web Hacking 101 by Peter Yaworski
The Computerphile YouTube channel
Videos featuring Jason Haddix (@jhaddix) and Tom Hudson (@tomnomnom) (two accomplished ethical hackers with different, but both effective, methodologies)

You may be familiar with the catchphrase, “sanitize your inputs!” However, as I hope this post demonstrates, developing an application with robust security isn’t quite so straightforward. I suggest an alternate phrase: pay attention to your inputs. Let’s elaborate by examining the most common attacks that take advantage of vulnerabilities in this area: SQL injection and cross site scripting.

SQL injection attacks

If you’re not yet familiar with SQL (Structured Query Language) injection attacks, or SQLi, here is a great explain-like-I’m-five video on SQLi. You may already know of this attack from xkcd’s Little Bobby Tables. Essentially, malicious actors may be able to send SQL commands that affect your application through some input on your site, like a search box that pulls results from your database. Sites coded in PHP can be especially susceptible to these, and a successful SQL attack can be devastating for software that relies on a database (as in, your Users table is now a pot of petunias).

You have no chance to survive make your time.

You can test your own site to see if you’re susceptible to this kind of attack. (Please only test sites that you own, since running SQL injections where you don’t have permission to be doing so is, possibly, illegal in your locality; and definitely, universally, not very funny.) The following payloads can be used to test inputs:

' OR 1='1 evaluates to a constant true, and when successful, returns all rows in the table.
' AND 0='1 evaluates to a constant false, and when successful, returns no rows.

This video demonstrates the above tests, and does a great job of showing how impactful an SQL injection attack can be.

Thankfully, there are ways to mitigate SQL injection attacks, and they all boil down to one basic concept: don’t trust user input.

SQL injection mitigation

In order to effectively mitigate SQL injections, developers must prevent users from being able to successfully submit raw SQL commands to any part of the site.

Some frameworks will do most of the heavy lifting for you. For example, Django implements the concept of Object-Relational Mapping, or ORM, with its use of QuerySets. We can think of these as wrapper functions that help your application query the database using pre-defined methods that avoid the use of raw SQL.

Being able to use a framework, however, is never a guarantee. When dealing directly with a database, there are other methods we can use to safely abstract our SQL queries from user input, though they vary in efficacy. These are, by order of most to least preferred, and with links to relevant examples:

Prepared statements with variable binding (or parameterized queries),
Stored procedures; and
Whitelisting or escaping user input.

If you want to implement the above techniques, the linked cheatsheets are a great starting point for digging deeper. Suffice to say, the use of these techniques to obtain data instead of using raw SQL queries helps to minimize the chances that SQL will be processed by any part of your application that takes input from users, thus mitigating SQL injection attacks.

The battle, however, is only half won…

Cross Site Scripting (XSS) attacks

If you’re a malicious coder, JavaScript is pretty much your best friend. The right commands will do anything a legitimate user could do (and even some things they aren’t supposed to be able to) on a web page, sometimes without any interaction on the part of an actual user. Cross Site Scripting attacks, or XSS, occur when JavaScript code is injected into a web page and changes that page’s behavior. Its effects can range from prank nuisance occurrences to more severe authentication bypasses or credential stealing.

The annual DOM dance-off receives an unexpected guest);

XSS can occur on the server or on the client side, and generally comes in three flavors: DOM (Document Object Model) based, stored, and reflected XSS. The differences amount to where the attack payload is injected into the application.

DOM-based XSS

DOM-based XSS occurs when a JavaScript payload affects the structure, behavior, or content of the web page the user has loaded in their browser. These are most commonly executed through modified URLs, such as in phishing.

To see how easy it would be for injected JavaScript to manipulate a page, we can create a working example with an HTML web page. Try creating a file on your local system called xss-test.html (or whatever you like) with the following HTML and JavaScript code:

<html>
    <head>
        <title>My XSS Exampletitle>
    head>
    <body>
        <h1 id="greeting">Hello there!h1>
            <script>
                var name = new URLSearchParams(document.location.search).get('name');
                if (name !== 'null') {
                    document.getElementById('greeting').innerHTML = 'Hello ' + name + '!';
                }
            script>
        h1>
html>

This web page will display the title “Hello there!” unless it receives a URL parameter from a query string with a value for name. To see the script work, open the page in a browser with an appended URL parameter, like so:

file:///path/to/file/xss-test.html?name=Victoria

Fun, right? Our insecure (in the safety sense, not the emotional one) page takes the URL parameter value for name and displays it in the DOM. The page is expecting the value to be a nice friendly string, but what if we change it to something else? Since the page is owned by us and only exists on our local system, we can test it all we like. What happens if we change the name parameter to, say, ?

This is just one example that demonstrates how an XSS attack could be executed. Funny pop-up alerts may be amusing, but JavaScript can do a lot of harm, including helping malicious attackers steal passwords and personal information.

Stored and reflected XSS

Stored XSS occurs when the attack payload is stored on the server, such as in a database. The attack affects a victim whenever that stored data is retrieved and rendered in the browser. For example, instead of using a URL query string, an attacker might update their profile page on a social site to include a hidden script in, say, their “About Me” section. The script, improperly stored on the site’s server, would successfully execute at a later time when another user views the attacker’s profile.

One of the most famous examples of this is the Samy worm that all but took over MySpace in 2005. It propagated by sending HTTP requests that replicated it onto a victim’s profile page whenever an infected profile was viewed. Within just 20 hours, it had spread to over a million users.

Reflected XSS similarly occurs when the injected payload travels to the server, however, the malicious code does not end up stored in a database. It is instead immediately returned to the browser by the web application. An attack like this might be executed by luring the victim to click a malicious link that sends a request to the vulnerable website’s server. The server would then send a response to the attacker as well as the victim, which may result in the attacker being able to obtain passwords, or perpetrate actions that appear to originate from the victim.

XSS attack mitigation

In all of these cases, XSS attacks can be mitigated with two key strategies: validating form fields, and avoiding the direct injection of user input on the web page.

Validating form fields

Frameworks can again help us out when it comes to making sure that user-submitted forms are on the up-and-up. One example is Django’s built-in Field classes, which provide fields that validate to some commonly used types and also specify sane defaults. Django’s EmailField, for instance, uses a set of rules to determine if the input provided is a valid email. If the submitted string has characters in it that are not typically present in email addresses, or if it doesn’t imitate the common format of an email address, then Django won’t consider the field valid and the form will not be submitted.

If relying on a framework isn’t an option, we can implement our own input validation. This can be accomplished with a few different techniques, including type conversion, for example, ensuring that a number is of type int(); checking minimum and maximum range values for numbers and lengths for strings; using a pre-defined array of choices that avoids arbitrary input, for example, months of the year; and checking data against strict regular expressions.

Thankfully, we needn’t start from scratch. Open source resources are available to help, such as the OWASP Validation Regex Repository, which provides patterns to match against for some common forms of data. Many programming languages offer validation libraries specific to their syntax, and we can find plenty of these on GitHub. Additionally, the XSS Filter Evasion Cheat Sheet has a couple suggestions for test payloads we can use to test our existing applications.

While it may seem tedious, properly implemented input validation can protect our application from being susceptible to XSS.

Avoiding direct injection

Elements of an application that directly return user input to the browser may not, on a casual inspection, be obvious. We can determine areas of our application that may be at risk by exploring a few questions:

How does data flow through our application?
What does a user expect to happen when they interact with this input?
Where on our page does data appear? Does it become embedded in a string or an attribute?

Here are some sample payloads that we can play with in order to test inputs on our site (again, only our own site!) courtesy of Hacker101. The successful execution of any of these samples can indicate a possible XSS vulnerability due to direct injection.

">test
'+alert(1)+'
"onmouserover="alert(1)
http://"onmouseover="alert(1)

As a general rule, if you are able to design around directly injecting input, do so. Alternatively, be sure to completely understand the effect of the methods you choose; for example, using innerText instead of innerHTML in JavaScript will ensure that content will be set as plain text instead of (potentially vulnerable) HTML.

Pay attention to your inputs

Software developers are at a marked disadvantage when it comes to competing with black hat, or malicious, hackers. For all the work we do to secure each and every input that could potentially compromise our application, an attacker need only find the one we missed. It’s like installing deadbolts on all the doors, but leaving a window open!

By learning to think along the same lines as an attacker, however, we can better prepare our software to stand up against bad actors. Exciting as it may be to ship features as quickly as possible, we’ll avoid racking up a lot of security debt if we take the time beforehand to think through our application’s flow, follow the data, and pay attention to our inputs.

How to set up OpenVPN on AWS EC2 and fix DNS leaks on Ubuntu 18.04 LTS

2019-08-26T09:01:23-04:00

There’s no better way to strive for maximum privacy than a VPN service you control, configure, and maintain yourself. Here’s a step-by-step tutorial for setting up your own OpenVPN on AWS EC2, and how to check for and fix DNS leaks.

For a VPN that also blocks ads and trackers, you can set up a Pi-hole VPN on an AWS Lightsail instance instead.

Set up OpenVPN on AWS EC2

This post will cover how to set up the OpenVPN Access Server product on AWS Marketplace, running on an Amazon EC2 instance. Then, you’ll look at how to fix a known NetworkManager bug in Ubuntu 18.04 that might cause DNS leaks. The whole process should take about fifteen minutes, so grab a ☕ and let’s be configuration superheroes.

Note: IDs and IP addresses shown for demonstration in this tutorial are invalid.

1. Launch the OpenVPN Access Server on AWS Marketplace

The OpenVPN Access Server is available on AWS Marketplace. The Bring Your Own License (BYOL) model doesn’t actually require a license for up to two connected devices; to connect more clients, you can get bundled billing for five, ten, or twenty-five clients, or purchase a minimum of ten OpenVPN licenses a la carte for $15/device/year. For most of us, the two free connected devices will suffice; and if using an EC2 Micro instance, your set up will be AWS Free Tier eligible as well.

Start by clicking Continue to Subscribe for the OpenVPN Access Server, which will bring you to a page that looks like this:

Click Continue to Configuration.

You may notice that the EC2 instance type in the right side bar (and consequently, the Monthly Estimate) isn’t the one you want - that’s okay, you can change it soon. Just ensure that the Region chosen is where you want the instance to be located. Generally, the closer it is to the physical location of your client (your laptop, in this case), the faster your VPN will be. Click Continue to Launch.

On this page, you’ll change three things:

1. The EC2 Instance type

Different types of EC2 (Elastic Compute Cloud) instances will offer you different levels of computing power. If you plan to use your instance for something more than just this VPN, you may want to choose something with higher memory or storage capacity, depending on how you plan to use it. You can view each instance offering on the Amazon EC2 Instance Types page.

For simple VPN use, the t2.nano or t2.micro instances are likely sufficient. Only the Micro instance is Free Tier eligible.

2. The Security Group settings

A Security Group is a profile, or collection of settings, that Amazon uses to control access to your instance. If you’ve set up other AWS products before, you may already have some groups with their own rules defined. You should be careful to understand the reasons for your Security Group settings, as these define how public or private your instance is, and consequently, who has access to it.

If you click Create New Based on Seller Settings, the OpenVPN server defines some recommended settings for a default Security Group.

The default recommended settings are all 0.0.0.0/0 for TCP ports 22, 943, 443, and 945, and UDP port 1194. OpenVPN offers an explanation of how the ports are used on their website. With the default settings, all these ports are left open to support various features of the OpenVPN server. You may wish to restrict access to these ports to a specific IP address or block of addresses (like that of your own ISP) to increase the security of your instance. However, if your IP address frequently changes (like when you travel and connect to a different WiFi network), restricting the ports may not be as helpful as you hope.

In any case, your instance will require SSH keys to connect to, and the OpenVPN server will be password protected. Unless you have other specific security goals, it’s fine to accept the default settings for now.

Let’s give the Security Group a name and brief description, so you know what it’s for. Then click Save.

3. The Key Pair settings

The aforementioned SSH keys are access credentials that you’ll use to connect to your instance. You can create a key pair in this section, or you can choose a key pair you may already be using with AWS.

To create a new set of access credentials, click Create a key pair in EC2 to open a new window. Then, click the Create Key Pair blue button. Once you give your key pair a name, it will be created and the private key will automatically download to your machine. It’s a file ending with the extension .pem. Store this key in a secure place on your computer. You’ll need to refer to it when you connect to your new EC2 instance.

You can return to the previous window to select the key pair you just created. If it doesn’t show up, hit the little “refresh” icon next to the drop-down. Once it’s selected, hit the shiny yellow Launch button.

You should see a message like this:

Great stuff! Now that your instance exists, let’s make sure you can access it and start up your VPN. For a shortcut to the next step, click on the “EC2 Console” link in the success message.

2. Associate an Elastic IP

Amazon’s Elastic IP Addresses provides you with a public IPv4 address controlled by your account, unlike the public IP address tied to your EC2 instance. It’s considered a best practice to create one and associate it with your VPN instance. If anything should go wrong with your instance, or if you want to use a new instance for your VPN in the future, the Elastic IP can be disassociated from the current instance and reassociated with your new one. This makes the transition seamless for your connected clients. Think of the Elastic IP like a web domain address that you register - you can point it at whatever you choose.

We can create a new Elastic IP address on the Amazon EC2 Console. If you clicked the link from the success message above, we’re already there.

If you have more than one instance, take note of the Instance ID of the one you’ve just launched.

In the left sidebar under Network & Security, choose Elastic IPs. Then click the blue Allocate new address button.

Choose Amazon Pool, then click Allocate.

Success! Click Close to return to the Elastic IP console.

Now that you have an Elastic IP, let’s associate it with your instance. Select the IP address, then click Actions, and choose Associate address.

Ensure the Instance option is selected, then click the drop-down menu. You should see your EC2 instance ID there. Select it, then click Associate.

Success! Now that you’ll be able to access your VPN instance, let’s get your VPN service up and running.

3. Initialize OpenVPN on the EC2 server

First, you’ll need to connect to the EC2 instance via your terminal. You’ll use the private key you created earlier.

Open a new terminal window and navigate to the directory containing the private key .pem file. You’ll need to set its permissions with:

sudo chmod 400 .pem

Be sure to substitute with the name of your key.

This sets the file permissions to -r-------- so that it can only be read by the user (you). It may help to protect the private key from read and write operations by other users, but more importantly, will prevent AWS from throwing an error when you try to connect to your instance.

We can now do just that by running:

ssh -i .pem openvpnas@

The user openvpnas is set up by the OpenVPN Access Server to allow you to connect to your instance. Replace with the Elastic IP address you just associated.

We may get a message saying that the authenticity of your host can’t be established. As long as you’ve typed the Elastic IP correctly, go ahead and answer yes to the prompt.

Upon the initial connection to the OpenVPN instance, a set up wizard called Initial Configuration Tool should automatically run. (If, for some reason, it doesn’t, or you panic-mashed a button, you can restart it with sudo ovpn-init –ec2.) You’ll be asked to accept the agreement, then the wizard will help to walk you through some configuration settings for your VPN server.

You may generally accept the default settings, however, there are a couple questions you may like to answer knowledgeably. They are:

Should client traffic be routed by default through the VPN?

Should client DNS traffic be routed by default through the VPN?

These answers depend on your privacy goals for your VPN.

When asked for your OpenVPN-AS license key, you can leave it blank to use the VPN with up to two clients. If you’ve purchased a key, enter it here.

Once the configuration wizard finishes running, you should see the message “Initial Configuration Complete!” Before you move on, you should set a password for your server’s administration account. To do this, run:

sudo passwd openvpn

Then enter your chosen password twice. Now we’re ready to get connected!

To close the SSH connection, type exit.

4. Connect the client to the VPN

To connect your client (in this case, your laptop) to the VPN and start reaping the benefits, you’ll need to do two things; first, obtain your connection profile; second, install the openvpn daemon.

1. Get your `.ovpn` connection profile

You’ll need to download a connection profile; this is like a personal configuration file with information, including keys, that the VPN server will need to allow your connection. You can do this by logging in with the password you just set at your Elastic IP address, port 943. This looks like:

https://:943/

The https part is important; without it, the instance won’t send any data.

When you go to this URL, you may see a page warning you that this site’s certificate issuer is unknown or invalid. As long as you’ve typed your Elastic IP correctly, it’s safe to proceed. If you’re using Firefox, click Advanced, and then Accept the Risk and Continue. In Chrome, click Advanced, then Proceed to the elastic IP.

Log in with the username openvpn and the password you just set. You’ll now be presented with a link to download your user-locked connection profile:

When you click the link, a file named client.ovpn will download.

2. Install and start `openvpn` on your Ubuntu 18.04 client

The openvpn daemon will allow your client to connect to your VPN server. It can be installed through the default Ubuntu repositories. Run:

sudo apt install openvpn

In order for OpenVPN to automatically start when you boot up your computer, you’ll need to rename and move the connection profile file. I suggest using a symlink to accomplish this, as it leaves your original file more easily accessible for editing, and allows you to store it in any directory you choose. You can create a symlink by running this command in the directory where your file is located:

sudo ln -s client.ovpn /etc/openvpn/.conf

This creates a symbolic link for the connection profile in the appropriate folder for systemd to find it. The can be anything. When the Linux kernel has booted, systemd is used to initialize the services and daemons that the user has set up to run; one of these will now be OpenVPN. Renaming the file with the extension .conf will let the openvpn daemon know to use it as your connection file.

For now, you can manually start and connect to OpenVPN by running:

sudo openvpn --config client.ovpn

You’ll be asked for a username and password, which will be the same credentials you used before. Once the service finishes starting up, you’ll see “Initialization Sequence Complete.” If you now visit the DNS leak test website, you should see the Elastic IP and the location of your EC2 server. Yay!

If you’re on a later version of Ubuntu, you may check for DNS leaks by clicking on one of the test buttons. If all the ISPs shown are Amazon and none are your own service provider’s, congratulations! No leaks! You can move on to Step 3 in the second section below, after which, you’ll be finished.

If you’re using Ubuntu 18.04 LTS, however, we’re not yet done.

What a DNS leak looks like

Sites like the DNS leak test website can help you check your configuration and see if the Internet knows more about your location than you’d like. On the main page you’ll see a big hello, your IP address, and your location, so far as can be determined.

If you have a DNS leak, you can see what it looks like by clicking on one of the test buttons on the the DNS leak test page. When you do, you’ll see not only your Amazon.com IP addresses, but also your own ISP and location.

You can also see the leak by running systemd-resolve --status in your terminal. Your results will contain two lines under different interfaces that both have entries for DNS Servers. It’ll look something like this:

Link 7 (tun0)
      Current Scopes: DNS
       LLMNR setting: yes
MulticastDNS setting: no
      DNSSEC setting: no
    DNSSEC supported: no
         DNS Servers: 172.31.0.2
          DNS Domain: ~.

Link 3 (wlp4s0)
      Current Scopes: none
       LLMNR setting: yes
MulticastDNS setting: no
      DNSSEC setting: no
    DNSSEC supported: no
         DNS Servers: 192.168.0.1
          DNS Domain: ~.

The DNS leak problem in Ubuntu 18.04 stems from Ubuntu’s DNS resolver, systemd-resolved, failing to properly handle your OpenVPN configuration. In order to try and be a good, efficient DNS resolver, systemd-resolved will send DNS lookup requests in parallel to each interface that has a DNS server configuration, and then utilizes the fastest response. In your case, you only want to use your VPN’s DNS servers. Sorry, systemd-resolved. You tried.

How to fix OpenVPN DNS leak on Ubuntu 18.04

Luckily, there is a fix that you can implement. You’ll need to install a few helpers from the Ubuntu repositories, update your configuration file, then set up OpenVPN using NetworkManager. Let’s do it!

1. Install some helpers

To properly integrate OpenVPN with systemd-resolved, you’ll need a bit more help. In a terminal, run:

sudo apt install -y openvpn-systemd-resolved network-manager-openvpn network-manager-openvpn-gnome

This will install a helper script that integrates OpenVPN and systemd-resolved, a NetworkManager plugin for OpenVPN, and its GUI counterpart for GNOME desktop environment.

2. Add DNS implementation to your connection profile

You’ll need to edit the connection profile file you downloaded earlier. Since it’s symbolically linked, you can accomplish this by changing the .ovpn file, wherever it’s stored. Run vim .ovpn to open it in Vim, then add the following lines at the bottom. Explanation in the comments:

# Allow OpenVPN to call user-defined scripts
script-security 2
# Tell systemd-resolved to send all DNS queries over the VPN
dhcp-option DOMAIN-ROUTE .

# Use the update-systemd-resolved script when TUN/TAP device is opened,
# and also run the script on restarts and before the TUN/TAP device is closed
up /etc/openvpn/update-systemd-resolved
up-restart
down /etc/openvpn/update-systemd-resolved
down-pre

For the full list of OpenVPN options, see OpenVPN Scripting and Environment Variables. You may also like more information about TUN/TAP.

3. Set up OpenVPN as NetworkManager system connection

Use the GUI to set up your VPN with NetworkManager. Open up Network Settings, which should look something like this:

Then click the plus sign (+) button. On the window that pops up, counterintuitively, choose Import from file… instead of the OpenVPN option.

Navigate to, and then select, your .ovpn file. You should now see something like this:

Add your username and password for the server (openvpn and the password you set in the first section’s Step 3), and your user key password (the same one again, if you’ve followed this tutorial), then click the “Add” button.

4. Edit your OpenVPN NetworkManager configuration

Nearly there! Now that you’ve added the VPN as a NetworkManager connection, you’ll need to make a quick change to it. You can see a list of NetworkManager connections by running:

ls -la /etc/NetworkManager/system-connections/*

The one for your VPN is probably called openvpn, so let’s edit it by running:

sudo vim /etc/NetworkManager/system-connections/openvpn

Under [ipv4], you’ll need to add the line dns-priority=-42. It should end up looking like this:

Setting a negative number is a workaround that prioritizes this DNS server. The actual number is arbitrary (-1 should also work) but I like 42. ¯\_(ツ)_/¯

5. Restart, connect, profit

In a terminal, run:

sudo service network-manager restart

Then in the Network Settings, click the magic button that turns on the VPN:

Finally, visit the DNS leak test website and click on Extended test to verify the fix. If everything’s working properly, you should now see a list containing only your VPN ISP.

And we’re done! Congratulations on rolling your very own VPN server and stopping DNS leaks with OpenVPN. Enjoy surfing in (relative) privacy. Now your only worry at the local coffeeshop is who’s watching you surf from the seat behind you.

If you enjoyed this post, there’s a lot more where it came from! I write about computing, cybersecurity, and leading great technical teams. You can subscribe below to see new posts first.

How to do twice as much with half the keystrokes using `.bashrc`

2019-08-21T09:17:02-04:00

In my recent post about setting up Ubuntu with Bash scripts, I briefly alluded to the magic of .bashrc. This didn’t really do it justice, so here’s a quick post that offers a bit more detail about what the Bash configuration file can do.

My current configuration hugely improves my workflow, and saves me well over 50% of the keystrokes I would have to employ without it! Let’s look at some examples of aliases, functions, and prompt configurations that can improve our workflow by helping us be more efficient with fewer key presses.

Bash aliases

A smartly written .bashrc can save a whole lot of keystrokes. You can take advantage of this in the literal sense by using bash aliases, or strings that expand to larger commands. For an indicative example, here is a Bash alias for copying files in the terminal:

# Always copy contents of directories (r)ecursively and explain (v) what was done
alias cp='cp -rv'

The alias command defines the string you’ll type, followed by what that string will expand to. You can override existing commands like cp above. On its own, the cp command will only copy files, not directories, and succeeds silently. With this alias, you need not remember to pass those two flags, nor cd or ls the location of our copied file to confirm that it’s there! Now, just those two key presses (for c and d) will do all of that for us.

Here are a few more .bashrc aliases for passing flags with common functions.

# List contents with colors for file types, (A)lmost all hidden files (without . and ..), in (C)olumns, with class indicators (F)
alias ls='ls --color=auto -ACF'
# List contents with colors for file types, (a)ll hidden entries (including . and ..), use (l)ong listing format, with class indicators (F)
alias ll='ls --color=auto -alF'

# Explain (v) what was done when moving a file
alias mv='mv -v'
# Create any non-existent (p)arent directories and explain (v) what was done
alias mkdir='mkdir -pv'
# Always try to (c)ontinue getting a partially-downloaded file
alias wget='wget -c'

Aliases come in handy when you want to avoid typing long commands, too. Here are a few I use when working with Python environments:

alias pym='python3 manage.py'
alias mkenv='python3 -m venv env'
alias startenv='source env/bin/activate && which python3'
alias stopenv='deactivate'

For further inspiration on ways Bash aliases can save time, I highly recommend the examples in this article.

Bash functions

One downside of the aliases above is that they’re rather static - they’ll always expand to exactly the text declared. For a Bash alias that takes arguments, you’ll need to create a function. You can do this like so:

# Show contents of the directory after changing to it
function cd () {
    builtin cd "$1"
    ls -ACF
}

I can’t begin to tally how many times I’ve typed cd and then ls immediately after to see the contents of the directory I’m now in. With this function set up, it all happens with just those two letters! The function takes the first argument, $1, as the location to change directory to, then prints the contents of that directory in nicely formatted columns with file type indicators. The builtin part is necessary to get Bash to allow us to override this default command.

Bash functions are very useful when it comes to downloading or upgrading software, too.

Bash function for downloading extended Hugo

Thanks to the static site generator Hugo’s excellent ship frequency, I previously spent at least a few minutes every couple weeks downloading the new extended version. With a Bash function, I only need to pass in the version number, and the upgrade happens in a few seconds.

# Hugo install or upgrade
function gethugo () {
    wget -q -P tmp/ https://github.com/gohugoio/hugo/releases/download/v"$@"/hugo_extended_"$@"_Linux-64bit.tar.gz
    tar xf tmp/hugo_extended_"$@"_Linux-64bit.tar.gz -C tmp/
    sudo mv -f tmp/hugo /usr/local/bin/
    rm -rf tmp/
    hugo version
}

The $@ notation simply takes all the arguments given, replacing its spot in the function. To run the above function and download Hugo version 0.57.2, you use the command gethugo 0.57.2.

Bash function for downloading a specific Go version

I’ve got one for Golang, too:

function getgolang () {
    sudo rm -rf /usr/local/go
    wget -q -P tmp/ https://dl.google.com/go/go"$@".linux-amd64.tar.gz
    sudo tar -C /usr/local -xzf tmp/go"$@".linux-amd64.tar.gz
    rm -rf tmp/
    go version
}

Bash function for adding a GitLab remote

Or how about a function that adds a remote origin URL for GitLab to the current repository?

function glab () {
    git remote set-url origin --add git@gitlab.com:"$@"/"${PWD##*/}".git
    git remote -v
}

With glab username, you can create a new origin URL for the current Git repository with our username on GitLab.com. Pushing to a new remote URL automatically creates a new private GitLab repository, so this is a useful shortcut for creating backups!

Bash functions are really only limited by the possibilities of scripting, of which there are, practically, few limits. If there’s anything you do on a frequent basis that requires typing a few lines into a terminal, you can probably create a Bash function for it!

Bash prompt

Besides directory contents, it’s also useful to see the full path of the directory we’re in. The Bash prompt can show us this path, along with other useful information like our current Git branch. To make it more readable, you can define colours for each part of the prompt. Here’s how you can set up our prompt in .bashrc to accomplish this:

# Colour codes are cumbersome, so let's name them
txtcyn='\[\e[0;96m\]' # Cyan
txtpur='\[\e[0;35m\]' # Purple
txtwht='\[\e[0;37m\]' # White
txtrst='\[\e[0m\]'    # Text Reset

# Which (C)olour for what part of the prompt?
pathC="${txtcyn}"
gitC="${txtpur}"
pointerC="${txtwht}"
normalC="${txtrst}"

# Get the name of our branch and put parenthesis around it
gitBranch() {
    git branch 2> /dev/null | sed -e '/^[^*]/d' -e 's/* \(.*\)/(\1)/'
}

# Build the prompt
export PS1="${pathC}\w ${gitC}\$(gitBranch) ${pointerC}\$${normalC} "

Result:

~/github/myrepo (master) $

Naming the colours helps to easily identify where one colour starts and stops, and where the next one begins. The prompt that you see in our terminal is defined by the string following export PS1, with each component of the prompt set with an escape sequence. Let’s break that down:

\w displays the current working directory,
\$(gitBranch) calls the gitBranch function defined above, which displays the current Git branch,
\$ will display a “$” if you are a normal user or in normal user mode, and a “#” if you are root.

The full list of Bash escape sequences can help us display many more bits of information, including even the time and date! Bash prompts are highly customizable and individual, so feel free to set it up any way you please.

Here are a few options that put information front and centre and can help us to work more efficiently.

For the procrastination-averse

Username and current time with seconds, in 24-hour HH:MM:SS format:

export PS1="${userC}\u ${normalC}at \t >"

user at 09:35:55 >

For those who always like to know where they stand

Full file path on a separate line, and username:

export PS1="${pathC}\w${normalC}\n\u:"

~/github/myrepo
user:

For the minimalist

export PS1=">"

We can build many practical prompts with just the basic escape sequences; once you start to integrate functions with prompts, as in the Git branch example, things can get really complicated. Whether this amount of complication is an addition or a detriment to your productivity, only you can know for sure!

Many fancy Bash prompts are possible with programs readily available with a quick search. I’ve intentionally not provided samples here because, well, if you can tend to get as excited about this stuff as I can, it might be a couple hours before you get back to what you were doing before you started reading this post, and I just can’t have that on my conscience. 🥺

We’ve hopefully struck a nice balance now between time invested and usefulness gained from our Bash configuration file! I hope you use your newly-recovered keystroke capacity for good.

How to set up a fresh Ubuntu desktop using only dotfiles and bash scripts

2019-08-19T07:58:18-04:00

One of my most favorite things about open source files on GitHub is the ability to see how others do (what some people might call) mundane things, like set up their .bashrc and other dotfiles. While I’m not as enthusiastic about ricing as I was when I first came to the Linux side, I still get pretty excited when I find a config setting that makes things prettier and faster, and thus, better.

I recently came across a few such things, particularly in Tom Hudson’s dotfiles. Tom seems to like to script things, and some of those things include automatically setting up symlinks, and installing Ubuntu repository applications and other programs. This got me thinking. Could I automate the set up of a new machine to replicate my current one?

Being someone generally inclined to take things apart in order to see how they work, I know I’ve messed up my laptop on occasion. (Usually when I’m away from home, and my backup hard drive isn’t.) On those rare but really inconvenient situations when my computer becomes a shell of its former self, (ba-dum-ching) it’d be quite nice to have a fast, simple way of putting Humpty Dumpty back together again, just the way I like.

In contrast to creating a disk image and restoring it later, a collection of bash scripts is easier to create, maintain, and move around. They require no special utilities, only an external transportation method. It’s like passing along the recipe, instead of the whole bundt cake. (Mmm, cake.)

Additionally, functionality like this would be super useful when setting up a virtual machine, or VM, or even just a virtual private server, or VPS. (Both of which, now that I write this, would probably make more forgiving targets for my more destructive experiments… live and learn!)

Well, after some grepping and Googling and digging around, I now have a suite of scripts that can do this:

This is the tail end of a test run of the set up scripts on a fresh Ubuntu desktop, loaded off a bootable USB. It had all my programs and settings restored in under three minutes!

This post will cover how to achieve the automatic set up of a computer running Ubuntu Desktop using bash scripts. This exact process was last used on Ubuntu 19.10; see my dotfiles master branch for the latest configuration. The majority of the information covered is applicable to all the Linux desktop flavours, though some syntax may differ. The bash scripts cover three main areas: linking dotfiles, installing software from Ubuntu and elsewhere, and setting up the desktop environment. We’ll cover each of these areas and go over the important bits so that you can begin to craft your own scripts.

Dotfiles

Dotfiles are what most Linux enthusiasts call configuration files. They typically live in the user’s home directory (denoted in bash scripts with the builtin variable $HOME) and control the appearance and behavior of all kinds of programs. The file names begin with ., which denotes hidden files in Linux (hence “dot” files). Here are some common dotfiles and ways in which they’re useful.

`.bashrc`

The .bashrc file is a list of commands executed at startup by interactive, non-login shells. Interactive vs non-interactive shells can be a little confusing, but aren’t necessary for us to worry about here. For our purposes, any time you open a new terminal, see a prompt, and can type commands into it, your .bashrc was executed.

Lines in this file can help improve your workflow by creating aliases that reduce keystrokes, or by displaying a helpful prompt with useful information. It can even run user-created programs, like Eddie. For more ideas, you can have a look at my .bashrc file on GitHub.

`.vimrc`

The .vimrc dotfile configures the champion of all text editors, Vim. (If you haven’t yet wielded the powers of the keyboard shortcuts, I highly recommend a fun game to learn Vim with.)

In .vimrc, we can set editor preferences such as display settings, colours, and custom keyboard shortcuts. You can take a look at my .vimrc on GitHub.

Other dotfiles may be useful depending on the programs you use, such as .gitconfig or .tmux.conf. Exploring dotfiles on GitHub is a great way to get a sense of what’s available and useful to you!

Linking dotfiles

We can use a script to create symbolic links, or symlinks for all our dotfiles. This allows us to keep all the files in a central repository, where they can easily be managed, while also providing a sort of placeholder in the spot that our programs expect the configuration file to be found. This is typically, but not always, the user home directory. For example, since I store my dotfiles on GitHub, I keep them in a directory with a path like ~/github/dotfiles/ while the files themselves are symlinked, resulting in a path like ~/.vimrc.

To programmatically check for and handle any existing files and symlinks, then create new ones, we can use this elegant shell script. I compliment it only because I blatantly stole the core of it from Tom’s setup script, so I can’t take the credit for how lovely it is.

The symlink.sh script works by attempting to create symlinks for each dotfile in our $HOME. It first checks to see if a symlink already exists, or if a regular file or directory with the same name exists. In the former case, the symlink is removed and remade; in the latter, the file or directory is renamed, then the symlink is made.

Installing software

One of the beautiful things about exploring shell scripts is discovering how much can be achieved using only the command line. As someone whose first exposure to computers was through a graphical operating system, I find working in the terminal to be refreshingly fast.

With Ubuntu, most programs we likely require are available through the default Ubuntu software repositories. We typically search for these with the command apt search and install them with sudo apt install . Some software we’d like may not be in the default repositories, or may not be offered there in the most current version. In these cases, we can still install these programs in Ubuntu using a PPA, or Personal Package Archive. We’ll just have to be careful that the PPAs we choose are from the official sources.

If a program we’d like doesn’t appear in the default repositories or doesn’t seem to have a PPA, we may still be able to install it via command line. A quick search for “ installation command line” should get some answers.

Since bash scripts are just a collection of commands that we could run individually in the terminal, creating a script to install all our desired programs is as straightforward as putting all the commands into a script file. I chose to organize my installation scripts between the default repositories, which are installed by my aptinstall.sh script, and programs that involve external sources, handled with my programs.sh script.

Setting up the desktop environment

On the recent occasions when I’ve gotten a fresh desktop (intentionally or otherwise) I always seem to forget how long it takes to remember, find, and then change all the desktop environment settings. Keyboard shortcuts, workspaces, sound settings, night mode… it adds up!

Thankfully, all these settings have to be stored somewhere in a non-graphical format, which means that if we can discover how that’s done, we can likely find a way to easily manipulate the settings with a bash script. Lo and behold the terminal command, gsettings list-recursively.

There are a heck of a lot of settings for GNOME desktop environment. We can make the list easier to scroll through (if, like me, you’re sometimes the type of person to say “Just let me look at everything and figure out what I want!”) by piping to less: gsettings list-recursively | less. Alternatively, if we have an inkling as to what we might be looking for, we can use grep: gsettings list-recursively | grep 'keyboard'.

We can manipulate our settings with the gsettings set command. It can sometimes be difficult to find the syntax for the setting we want, so when we’re first building our script, I recommend using the GUI to make the changes, then finding the gsettings line we changed and recording its value.

For some inspiration, you can view my desktop.sh settings script on GitHub.

Putting it all together

Having modular scripts (one for symlinks, two for installing programs, another for desktop settings) is useful for both keeping things organized and for being able to run some but not all of the automated set up. For instance, if I were to set up a VPS in which I only use the command line, I wouldn’t need to bother with installing graphical programs or desktop settings.

In cases where I do want to run all the scripts, however, doing so one-by-one is a little tedious. Thankfully, since bash scripts can themselves be run by terminal commands, we can simply write another master script to run them all!

Here’s my master script to handle the set up of a new Ubuntu desktop machine:

#!/bin/bash

./symlink.sh
./aptinstall.sh
./programs.sh
./desktop.sh

## Get all upgrades
sudo apt upgrade -y

## See our bash changes
source ~/.bashrc

## Fun hello
figlet "... and we're back!" | lolcat

I threw in the upgrade line for good measure. It will make sure that the programs installed on our fresh desktop have the latest updates. Now a simple, single bash command will take care of everything!

You may have noticed that, while our desktop now looks and runs familiarly, these scripts don’t cover one very important area: our files. Hopefully, you have a back up method for those that involves some form of reliable external hardware. If not, and if you tend to put your work in external repository hosts like GitHub or GitLab, I do have a way to automatically clone and back up your GitHub repositories with bash one-liners.

Relying on external repository hosts doesn’t offer 100% coverage, however. Files that you wouldn’t put in an externally hosted repository (private or otherwise) consequently can’t be pulled. Git ignored objects that can’t be generated from included files, like private keys and secrets, will not be recreated. Those files, however, are likely small enough that you could fit a whole bunch on a couple encrypted USB flash drives (and if you don’t have private key backups, maybe you ought to do that first?).

That said, I hope this post has given you at least some inspiration as to how dotfiles and bash scripts can help to automate setting up a fresh desktop. If you come up with some settings you find useful, please help others discover them by sharing your dotfiles, too!

How to write Bash one-liners for cloning and managing GitHub and GitLab repositories

2019-08-06T10:55:19-04:00

Few things are more satisfying to me than one elegant line of Bash that automates hours of tedious work. As part of some recent explorations into automatically re-creating my laptop with Bash scripts, I wanted to find a way to easily clone my GitHub-hosted repositories to a new machine. After a bit of digging around, I wrote a one-liner that did just that. Then, in the spirit of not putting all our eggs in the same basket, I wrote another one-liner to automatically create and push to GitLab-hosted backups as well. Here they are.

A Bash one-liner to clone all your GitHub repositories

Caveat: you’ll need a list of the GitHub repositories you want to clone. The good thing about that is it gives you full agency to choose just the repositories you want on your machine, instead of going in whole-hog.

You can easily clone GitHub repositories without entering your password each time by using HTTPS with your 15-minute cached credentials or, my preferred method, by connecting to GitHub with SSH. For brevity I’ll assume we’re going with the latter, and our SSH keys are set up.

Given a list of GitHub URLs in the file gh-repos.txt, like this:

git@github.com:username/first-repository.git
git@github.com:username/second-repository.git
git@github.com:username/third-repository.git

We run:

xargs -n1 git clone < gh-repos.txt

This clones all the repositories on the list into the current folder. This same one-liner works for GitLab repositories as well, if you substitute the appropriate URLs.

What’s going on here

There are two halves to this one-liner: the input, counterintuitively on the right side, and the part that makes stuff happen, on the left. We could make the order of these parts more intuitive (maybe?) by writing the same command like this:

To run a command for each line of our input, gh-repos.txt, we use xargs -n1. The tool xargs reads items from input and executes any commands it finds (it will echo if it doesn’t find any). By default, it assumes that items are separated by spaces; new lines also works and makes our list easier to read. The flag -n1 tells xargs to use 1 argument, or in our case, one line, per command. We build our command with git clone, which xargs then executes for each line. Ta-da.

A Bash one-liner to create and push many repositories on GitLab

GitLab, unlike GitHub, lets us do this nifty thing where we don’t have to use the website to make a new repository first. We can create a new GitLab repository from our terminal. The newly created repository defaults to being set as Private, so if we want to make it Public on GitLab, we’ll have to do that manually later.

The GitLab docs tell us to push to create a new project using git push --set-upstream, but I don’t find this to be very convenient for using GitLab as a backup. As I work with my repositories in the future, I’d like to run one command that pushes to both GitHub and GitLab without additional effort on my part.

To make this Bash one-liner work, we’ll also need a list of repository URLs for GitLab (ones that don’t exist yet). We can easily do this by copying our GitHub repository list, opening it up with Vim, and doing a search-and-replace:

cp gh-repos.txt gl-repos.txt
vim gl-repos.txt
:%s/\<github\>/gitlab/g
:wq

This produces gl-repos.txt, which looks like:

git@gitlab.com:username/first-repository.git
git@gitlab.com:username/second-repository.git
git@gitlab.com:username/third-repository.git

We can create these repositories on GitLab, add the URLs as remotes, and push our code to the new repositories by running:

awk -F'\/|(\.git)' '{system("cd ~/FULL/PATH/" $2 " && git remote set-url origin --add " $0 " && git push")}' gl-repos.txt

Hang tight and I’ll explain it; for now, take note that ~/FULL/PATH/ should be the full path to the directory containing our GitHub repositories.

We do have to make note of a couple assumptions:

The name of the directory on your local machine that contains the repository is the same as the name of the repository in the URL (this will be the case if it was cloned with the one-liner above);
Each repository is currently checked out to the branch you want pushed, ie. master.

The one-liner could be expanded to handle these assumptions, but it is the humble opinion of the author that at that point, we really ought to be writing a Bash script.

What’s going on here

Our Bash one-liner uses each line (or URL) in the gl-repos.txt file as input. With awk, it splits off the name of the directory containing the repository on our local machine, and uses these pieces of information to build our larger command. If we were to print the output of awk, we’d see:

cd ~/FULL/PATH/first-repository && git remote set-url origin --add git@gitlab.com:username/first-repository.git && git push
cd ~/FULL/PATH/second-repository && git remote set-url origin --add git@gitlab.com:username/second-repository.git && git push
cd ~/FULL/PATH/third-repository && git remote set-url origin --add git@gitlab.com:username/third-repository.git && git push

Let’s look at how we build this command.

Splitting strings with `awk`

The tool awk can split input based on field separators. The default separator is a whitespace character, but we can change this by passing the -F flag. Besides single characters, we can also use a regular expression field separator. Since our repository URLs have a set format, we can grab the repository names by asking for the substring between the slash character / and the end of the URL, .git.

One way to accomplish this is with our regex \/|(\.git):

\/ is an escaped / character;
| means “or”, telling awk to match either expression;
(\.git) is the capture group at the end of our URL that matches “.git”, with an escaped . character. This is a bit of a cheat, as “.git” isn’t strictly splitting anything (there’s nothing on the other side) but it’s an easy way for us to take this bit off.

Once we’ve told awk where to split, we can grab the right substring with the field operator. We refer to our fields with a $ character, then by the field’s column number. In our example, we want the second field, $2. Here’s what all the substrings look like:

1: git@gitlab.com:username
2: first-repository

To use the whole string, or in our case, the whole URL, we use the field operator $0. To write the command, we just substitute the field operators for the repository name and URL. Running this with print as we’re building it can help to make sure we’ve got all the spaces right.

awk -F'\/|(\.git)' '{print "cd ~/FULL/PATH/" $2 " && git remote set-url origin --add " $0 " && git push"}' gl-repos.txt

Running the command

We build our command inside the parenthesis of system(). By using this as the output of awk, each command will run as soon as it is built and output. The system() function creates a child process that executes our command, then returns once the command is completed. In plain English, this lets us perform the Git commands on each repository, one-by-one, without breaking from our main process in which awk is doing things with our input file. Here’s our final command again, all put together.

awk -F'\/|(\.git)' '{system("cd ~/FULL/PATH/" $2 " && git remote set-url origin --add " $0 " && git push")}' gl-repos.txt

Using our backups

By adding the GitLab URLs as remotes, we’ve simplified the process of pushing to both externally hosted repositories. If we run git remote -v in one of our repository directories, we’ll see:

origin  git@github.com:username/first-repository.git (fetch)
origin  git@github.com:username/first-repository.git (push)
origin  git@gitlab.com:username/first-repository.git (push)

Now, simply running git push without arguments will push the current branch to both remote repositories.

We should also note that git pull will generally only try to pull from the remote repository you originally cloned from (the URL marked (fetch) in our example above). Pulling from multiple Git repositories at the same time is possible, but complicated, and beyond the scope of this post. Here’s an explanation of pushing and pulling to multiple remotes to help get you started, if you’re curious. The Git documentation on remotes may also be helpful.

To elaborate on the succinctness of Bash one-liners

Bash one-liners, when understood, can be fun and handy shortcuts. At the very least, being aware of tools like xargs and awk can help to automate and alleviate a lot of tediousness in our work. However, there are some downsides.

In terms of an easy-to-understand, maintainable, and approachable tool, Bash one-liners suck. They’re usually more complicated to write than a Bash script using if or while loops, and certainly more complicated to read. It’s likely that when we write them, we’ll miss a single quote or closing parenthesis somewhere; and as I hope this post demonstrates, they can take quite a bit of explaining, too. So why use them?

Imagine reading a recipe for baking a cake, step by step. You understand the methods and ingredients, and gather your supplies. Then, as you think about it, you begin to realize that if you just throw all the ingredients at the oven in precisely the right order, a cake will instantly materialize. You try it, and it works!

That would be pretty satisfying, wouldn’t it?

A quick guide to changing your GitHub username

2019-07-28T15:19:13-04:00

This being the 2,38947234th and probably last time I’ll change my username, (marriage is permanent, right?) I thought I’d better write a quick post on how this transition can be achieved as smoothly as possible. You can read official instructions on how to change your GitHub username here, and they will tell you how to do it and what happens. The following is a quick guide to some things to consider afterwards.

Where to make changes

Change username in GitHub account settings.
If using GitHub Pages, change name of your “username.github.io” repository.
If using other services that point to your “username.github.io” repository address, update them.
If using Netlify, you may want to sign in and reconnect your repositories. (Mine still worked, but due to a possibly unrelated issue, I’m not positive.)
Sign in to Travis CI and other integrations (find them in your repository Settings tab -> Integrations & services). This will update your username there.
Update your local files and repository links with very carefully executed find and sed commands, and push back changes to GitHub.
Redeploy any websites you may have with your updated GitHub link.
Fix any links around the web to your profile, your repositories, or Gists you may have shared.

Local file updates

Here are some suggestions for strings to search and replace your username in.

github.com/username (References to your GitHub page in READMEs or in website copy)
username.github.io (Links to your GitHub Page)
git@github.com:username (Git config remote ssh urls)
travis-ci.com/username (Travis badges in READMEs)
shields.io/github/.../username (Shields badges in READMEs, types include contributors, stars, tags, and more)

You can quickly identify where the above strings are located using this command for each string:

grep -rnw -e 'foobar'

This will recursively (r) search all files for strings matching the whole (w) pattern (e) provided and prefix results with the line numbers (n) so you can easily find them.

Using find and sed can make these changes much faster. See this article on search and replace.

Enjoy your new handle! (I hope it sticks.)

Two ways to deploy a public GitHub Pages site from a private Hugo repository

2019-04-22T10:05:15-04:00

Tools like Travis CI and Netlify offer some pretty nifty features, like seamlessly deploying your GitHub Pages site when changes are pushed to its repository. Along with a static site generator like Hugo, keeping a blog up to date is pretty painless.

I’ve used Hugo to build my site for years, but until this past week I’d never hooked up my Pages repository to any deployment service. Why? Because using a tool that built my site before deploying it seemed to require having the whole recipe in one place - and if you’re using GitHub Pages with the free version of GitHub, that place is public. That means that all my three-in-the-morning bright ideas and messy unfinished (and unfunny) drafts would be publicly available - and no amount of continuous convenience was going to convince me to do that.

So I kept things separated, with Hugo’s messy behind-the-scenes stuff in a local Git repository, and the generated public/ folder pushing to my GitHub Pages remote repository. Each time I wanted to deploy my site, I’d have to get on my laptop and hugo to build my site, then cd public/ && git add . && git commit… etc etc. And all was well, except for the nagging feeling that there was a better way to do this.

I wrote another article a little while back about using GitHub and Working Copy to make changes to my repositories on my iPad whenever I’m out and about. It seemed off to me that I could do everything except deploy my site from my iPad, so I set out to change that.

A couple three-in-the-morning bright ideas and a revoked access token later (oops), I now have not one but two ways to deploy to my public GitHub Pages repository from an entirely separated, private GitHub repository. In this post, I’ll take you through achieving this with Travis CI or using Netlify and Make.

There’s nothing hackish about it - my public GitHub Pages repository still looks the same as it does when I pushed to it locally from my terminal. Only now, I’m able to take advantage of a couple great deployment tools to have the site update whenever I push to my private repo, whether I’m on my laptop or out and about with my iPad.

#YouDidNotPushFromThere

This article assumes you have working knowledge of Git and GitHub Pages. If not, you may like to spin off some browser tabs from my articles on using GitHub and Working Copy and building a site with Hugo and GitHub Pages first.

Let’s do it!

Private-to-public GitHub Pages deployment with Travis CI

Travis CI has the built-in ability (♪) to deploy to GitHub Pages following a successful build. They do a decent job in the docs of explaining how to add this feature, especially if you’ve used Travis CI before… which I haven’t. Don’t worry, I did the bulk of the figuring-things-out for you.

Travis CI gets all its instructions from a configuration file in the root of your repository called .travis.yml
You need to provide a GitHub personal access token as a secure encrypted variable, which you can generate using travis on the command line
Once your script successfully finishes doing what you’ve told it to do (not necessarily what you want it to do but that’s a whole other blog post), Travis will deploy your build directory to a repository you can specify with the repo configuration variable.

Setting up the Travis configuration file

Create a new configuration file for Travis with the filename .travis.yml (note the leading “.”). These scripts are very customizable and I struggled to find a relevant example to use as a starting point - luckily, you don’t have that problem!

Here’s my basic .travis.yml:

git:
  depth: false

env:
  global:
    - HUGO_VERSION="0.54.0"
  matrix:
    - YOUR_ENCRYPTED_VARIABLE

install:
  - wget -q https://github.com/gohugoio/hugo/releases/download/v${HUGO_VERSION}/hugo_${HUGO_VERSION}_Linux-64bit.tar.gz
  - tar xf hugo_${HUGO_VERSION}_Linux-64bit.tar.gz
  - mv hugo ~/bin/

script:
  - hugo --gc --minify

deploy:
  provider: pages
  skip-cleanup: true
  github-token: $GITHUB_TOKEN
  keep-history: true
  local-dir: public
  repo: gh-username/gh-username.github.io
  target-branch: master
  verbose: true
  on:
    branch: master

This script downloads and installs Hugo, builds the site with the garbage collection and minify flags, then deploys the public/ directory to the specified repo - in this example, your public GitHub Pages repository. You can read about each of the deploy configuration options here.

To add the GitHub personal access token as an encrypted variable, you don’t need to manually edit your .travis.yml. The travis gem commands below will encrypt and add the variable for you when you run them in your repository directory.

First, install travis with sudo gem install travis.

Then generate your GitHub personal access token, copy it (it only shows up once!) and run the commands below in your repository root, substituting your token for the kisses:

travis login --pro --github-token xxxxxxxxxxxxxxxxxxxxxxxxxxx
travis encrypt GITHUB_TOKEN=xxxxxxxxxxxxxxxxxxxxxxxxxxx --add env.matrix

Your encrypted token magically appears in the file. Once you’ve committed .travis.yml to your private Hugo repository, Travis CI will run the script and if the build succeeds, will deploy your site to your public GitHub Pages repo. Magic!

Travis will always run a build each time you push to your private repository. If you don’t want to trigger this behavior with a particular commit, add the skip command to your commit message.

Yo that’s cool but I like Netlify.

Okay fine.

Deploying to a separate repository with Netlify and Make

We can get Netlify to do our bidding by using a Makefile, which we’ll run with Netlify’s build command.

Here’s what our Makefile looks like:

SHELL:=/bin/bash
BASEDIR=$(CURDIR)
OUTPUTDIR=public

.PHONY: all
all: clean get_repository build deploy

.PHONY: clean
clean:
 @echo "Removing public directory"
 rm -rf $(BASEDIR)/$(OUTPUTDIR)

.PHONY: get_repository
get_repository:
 @echo "Getting public repository"
 git clone https://github.com/gh-username/gh-username.github.io.git public

.PHONY: build
build:
 @echo "Generating site"
 hugo --gc --minify

.PHONY: deploy
deploy:
 @echo "Preparing commit"
 @cd $(OUTPUTDIR) \
 && git config user.email "you@youremail.com" \
 && git config user.name "Your Name" \
 && git add . \
 && git status \
 && git commit -m "Deploy via Makefile" \
 && git push -f -q https://$(GITHUB_TOKEN)@github.com/gh-username/gh-username.github.io.git master

 @echo "Pushed to remote"

To preserve the Git history of our separate GitHub Pages repository, we’ll first clone it, build our new Hugo site to it, and then push it back to the Pages repository. This script first removes any existing public/ folder that might contain files or a Git history. It then clones our Pages repository to public/, builds our Hugo site (essentially updating the files in public/), then takes care of committing the new site to the Pages repository.

In the deploy section, you’ll notice lines starting with &&. These are chained commands. Since Make invokes a new sub-shell for each line, it starts over with every new line from our root directory. To get our cd to stick and avoid running our Git commands in the project root directory, we’re chaining the commands and using the backslash character to break long lines for readability.

By chaining our commands, we’re able to configure our Git identity, add all our updated files, and create a commit for our Pages repository.

Similarly to using Travis CI, we’ll need to pass in a GitHub personal access token to push to our public GitHub Pages repository - only Netlify doesn’t provide a straightforward way to encrypt the token in our Makefile.

Instead, we’ll use Netlify’s Build Environment Variables, which live safely in our site settings in the Netlify app. We can then call our token variable in the Makefile. We use it to push (quietly, to avoid printing the token in logs) to our Pages repository by passing it in the remote URL.

To avoid printing the token in Netlify’s logs, we suppress recipe echoing for that line with the leading @ character.

With your Makefile in the root of your private GitHub repository, you can set up Netlify to run it for you.

Setting up Netlify

Getting set up with Netlify via the web UI is straightforward. Once you sign in with GitHub, choose the private GitHub repository where your Hugo site lives. The next page Netlify takes you to lets you enter deploy settings:

You can specify the build command that will run your Makefile (make all for this example). The branch to deploy and the publish directory don’t matter too much in our specific case, since we’re only concerned with pushing to a separate repository. You can enter the typical master deploy branch and public publish directory.

Under “Advanced build settings” click “New variable” to add your GitHub personal access token as a Build Environment Variable. In our example, the variable name is GITHUB_TOKEN. Click “Deploy site” to make the magic happen.

If you’ve already previously set up your repository with Netlify, find the settings for Continuous Deployment under Settings > Build & deploy.

Netlify will build your site each time you push to the private repository. If you don’t want a particular commit to trigger a build, add [skip ci] in your Git commit message.

Same same but different

One effect of using Netlify this way is that your site will be built in two places: one is the separate, public GitHub Pages repository that the Makefile pushes to, and the other is your Netlify site that deploys on their CDN from your linked private GitHub repository. The latter is useful if you’re going to play with Deploy Previews and other Netlify features, but those are outside the scope of this post.

The main point is that your GitHub Pages site is now updated in your public repo. Yay!

Go forth and deploy fearlessly

I hope the effect of this new information is that you feel more able to update your sites, wherever you happen to be. The possibilities are endless - at home on your couch with your laptop, out cafe-hopping with your iPad, or in the middle of a first date on your phone. Endless!

Don’t do stuff on your phone when you’re on a date. Not if you want a second one, anyway.

A remote sync solution for iOS and Linux: Git and Working Copy

2019-03-15T11:55:28-04:00

I’m always looking for pockets of time in which I can be productive. If you add up the minutes you spend in limbo while waiting in line, commuting, or waiting for food delivery (just me?), you may just find an extra hour or two in your day.

To take full advantage of these bits of time, I needed a solution that let me pick up work on my Git repositories wherever I happen to be. That means a remote sync solution that bridges my iOS devices (iPad and iPhone) and my Linux machine.

After a lot of trial and error, I’ve found one that works really well. With synced Git repositories on iOS, I can seamlessly pick up work for any of my repositories on the go.

Components

Working Copy app ($15.99 one-time pro-unlock and well worth it)
iA Writer app ($8.99 one-time purchase for iOS, also available on Mac, Windows, and Android)
GitHub repositories

Get set up

Here are the steps to setting up that I’ll walk you through in this article.

Create your remote repository
Clone repository to iPad with Working Copy
Open and edit files with iA Writer
Push changes back to remote
Pull changes from repository on your computer

This system is straightforward to set up whether you’re a command line whiz or just getting into Git. Let’s do it!

Create your remote repository

Create a public or private repository on GitHub.

If you’re creating a new repository, you can follow GitHub’s instructions to push some files to it from your computer, or you can add files later from your iOS device.

Clone repository to iOS with Working Copy

Download Working Copy from the App Store. It’s a fantastic app. Developer Anders Borum has a steady track record of frequent updates and incorporating the latest features for iOS apps, like drag and drop on iPad. I think he’s fairly priced his product in light of the work he puts into maintaining and enhancing it.

In Working Copy, find the gear icon in the top left corner and touch to open Settings.

Tap on SSH Keys, and you’ll see this screen:

SSH keys, or Secure Shell keys, are access credentials used in the SSH protocol. Your key is a password that your device will use to securely connect with your remote repository host - GitHub, in this example. Since anyone with your SSH keys can potentially pretend to be you and gain access to your files, it’s important not to share them accidentally, like in a screenshot on a blog post.

Tap on the second line that looks like WorkingCopy@iPad-xxxxxxxx to get this screen:

Working Copy supports easy connection to GitHub. Tap Connect With GitHub to bring up some familiar sign-in screens that will authorize Working Copy to access your account(s).

Once connected, tap the + symbol in the top right of the side bar to add a new repository. Choose Clone repository to bring up this screen:

Here, you can either manually input the remote URL, or simply choose from the list of repositories that Working Copy fetches from your connected account. When you make your choice, the app clones the repository to your device and it will show up in the sidebar. You’re connected!

Open and edit files with iA Writer

One of the (many) reasons I adore iA Writer is its ability to select your freshly cloned remote repository as a Library Location. To enable this, first open your Files app. On the Browse screen, tap the overflow menu (three dots) in the top right and choose Edit.

Turn on Working Copy as a location option:

Then in the iA Writer app:

From the main Library list, in the top right of the sidebar, tap Edit.
Tap Add Location….
A helpful popup appears. Tap OK.
From the Working Copy location, tap Select in the top right, then choose the repository folder.
Tap Open, then Done.

Your remote repository now appears as a Location in the sidebar. Tap on it to work within this directory.

While inside this location, new files you create (by tapping the pencil-and-paper icon in the top right corner) will be saved to this folder locally. As you work, iA Writer automatically saves your progress. Next, we’ll look at pushing those files and changes back to your remote.

Push changes back to remote

Once you’ve made changes to your files, open Working Copy again. You should see a yellow dot on your changed repository.

Tap on your repository name, then on Repository Status and Configuration at the top of the sidebar. Your changed files will be indicated by yellow dots or green + symbols. These mean that you’ve modified or added files, respectively.

Working Copy is a sweet iOS Git client, and you can tap on your files to see additional information including a comparison of changes (“diff”) as well as status and Git history. You can even edit files right within the app, with syntax highlighting for its many supported languages. For now, we’ll look at how to push your changed work to your remote repository.

On the Repository Status and Configuration page, you’ll see right at the top that there are changes to be committed. If you’re new to Git, this is like “saving your changes” to your Git history, something typically done with the terminal command git commit. You can think of this as saving the files that we’ll want to send to the GitHub repository. Tap Commit changes.

Enter your commit message, and select the files you want to add. Toggle the Push switch to send everything to your remote repository when you commit the files. Then tap Commit.

You’ll see a progress bar as your files are uploaded, and then a confirmation message on the status screen.

Congratulations! Your changes are now present in your remote repository on GitHub. You’ve successfully synced your files remotely!

Pull changes from repository on your computer

To bring your updated files full circle to your computer, you pull them from the GitHub repository. I prefer to use the terminal for this as it’s quick and easy, but GitHub also offers a graphical client if terminal commands seem a little alien for now.

If you started with the GitHub repository, you can clone it to a folder on your computer by following these instructions.

Staying in sync

When you update your work on your computer, you’ll use Git to push your changes to the remote repository. To do this, you can use GitHub’s graphical client, or follow these instructions.

On your iOS device, Working Copy makes pulling and pushing as simple as a single tap. On the Repository Status and Configuration page, tap on the remote name under Remotes.

Then tap Synchronize. Working Copy will take care of the details of pushing your committed changes and/or pulling any new changes it finds from the remote repository.

Work anywhere

For a Git-based developer and work-anywhere-aholic like me, this set up couldn’t be more convenient. Working Copy really makes staying in sync with my remote repositories seamless, nevermind the ability to work with any of my GitHub repos on the go.

I most recently used this set up to get some writing done while hanging out in the atrium of Washington DC’s National Portrait Gallery, which is pleasantly photogenic.

Happy working! If you enjoyed this post, there’s a lot more where this came from! I write about computing, cybersecurity, and leading great technical teams. You can subscribe below to see new posts first.

On Doing Great Things

2019-03-08T18:36:15-05:00

It’s International Women’s Day, and I’m thinking about Grace Hopper.

Grace Hopper was an amazing lady who did great things. She envisioned and helped create programming languages that translate English terms into machine code. She persevered in her intention to join the US Navy from the time she was rejected at 34 years old, to being sworn in to the US Navy Reserve three years later, to retiring with the rank of commander at age 60… then was recalled (twice) and promoted to the rank of captain at the age of 67. She advocated for distributed networks and developed computer testing standards we use today, among other achievements too numerous to list here.

By my read, throughout her life, she kept her focus on her work. She did great things because she could do them, and felt some duty to do them. Her work speaks for itself.

I recently came across a sizeable rock denoting a rather small, quiet park. It looks like this:

When I first saw this park, I thought it in no way did this great lady justice. But upon some reflection, its lack of assumption and grandeur grew on me. And today, it drew to the forefront something that’s been on my mind.

I try and contribute regularly to the wide world of technology, usually through building things, writing, and mentorship. I sometimes get asked to participate in female-focused tech events. I hear things like, “too few developers are women,” or “we need more women in blockchain,” or “we need more female coders.”

For some time I haven’t been sure how to respond, because while my answer isn’t “yes,” it’s not exactly “no,” either. It’s really, “no, because…” and it’s because I’m afraid. I’m afraid of misrepresenting myself, my values, and my goals.

Discrimination and racism are real things. They exist in the minds and attitudes of a very small percentage of very loud people, as they always will. These people aren’t, however, the majority. They are small.

I think that on the infrequent occasions when we encounter these people, we should do our best to lead by example. We should have open minds, tell our stories, listen to theirs. Try and learn something. That’s all.

When I present myself, I don’t point out that I’m a woman. I don’t align myself with “women in tech” or seek to represent them. I don’t go to women-only meetings or support organizations that discriminate against men, or anyone at all. It’s not because I’m insecure as a woman, or ashamed that I’m a woman, or some other inflammatory adjective that lately shows up in conjunction with being female. It’s because I’ve no reason to point out my gender, any more than needing to point out that my hair is black, or that I’m short. It’s obvious and simultaneously irrelevant.

When I identify with a group, I talk about the go-getters who wake up at 0500 every day and go work out—no matter the weather, or whether they feel like it. I tell stories about the people I’ve met in different countries around the world, who left home, struck out on their own, and had an adventure, because they saw value in the experience. I identify with people who constantly build things, try things, design and make things, and then share those things with the world, because they love to do so. This is how I see myself. This is what matters to me.

Like the unassuming park named after an amazing woman, when truly great things are done, they are done relatively quietly. Not done for the fanfare of announcing them to the world, but for the love of the thing itself. So go do great things, please. The world still needs them.

Building Code Quality Culture Through Commit Standards

2018-08-06T08:54:56-04:00

When I first started leading engineering teams, I thought high quality code was about efficient algorithms and architecture. I was wrong. The biggest indicator of a team’s engineering maturity shows up in their commit history.

A clean commit history reveals a team that thinks about maintainability, communicates context effectively, and takes pride in their craft. Messy commits signal the opposite: rushed work, poor communication habits, and a culture that prioritizes shipping over sustainability (guaranteed to make the descent harder than the climb). As an engineering leader, establishing commit standards builds the foundation for everything else you want to achieve.

The Cost of Poor Commit Culture

I’ve seen this common pattern especially often in the post-startup phase. Issues that should have been a 30-minute investigation stretch into hours due to useless commit messages like “fix stuff,” “updates,” or “refactor”—messages that make it impossible to understand the intent behind each change. One of the least thoughtful comments I’ve ever heard on the subject went along the lines of, “Capable engineers should just be able to read the code and understand the change, we don’t need good commit messages.” Right. Have fun explaining to the director that a straightforward bug fix requires days of reading lines of code because the team allows lazy commits.

It’s arguable that the real cost isn’t even the lost revenue—it’s the erosion of trust. Teams with lazy commits start questioning each other’s work quality, code reviews became adversarial, and velocity plummets as a result.

Useful commit standards create a culture with:

Context preservation - Future team members (including your future self) can understand not just what changed, but why
Accountability - Engineers take ownership of their changes and think through the impact
Knowledge transfer - Institutional knowledge doesn’t walk out the door when someone leaves
Debugging efficiency - When things break, you can quickly trace the source and reasoning

Poor commit habits compound over time. What starts as a small productivity tax becomes a massive technical debt that slows down everything your team tries to accomplish.

Making Standards Stick: The Template Approach

The biggest challenge with commit standards isn’t defining them—it’s getting your team to actually follow them consistently. I’ve seen too many teams create detailed commit guidelines that gather dust in a README file while engineers continue writing “fixed stuff” messages.

The solution is to make good practices easier than bad ones. Instead of expecting people to remember complex guidelines under deadline pressure, embed the standards directly into the workflow.

Here’s the team commit template I’ve successfully rolled out across multiple organizations:

## If applied, this commit will...
## [Add/Fix/Remove/Update/Refactor/Document] [issue #id] [summary]


## Why is it necessary? (Bug fix, feature, improvements?)
-
## How does the change address the issue?
-
## What side effects does this change have?
-

To implement this across your team, add it to your onboarding process:

git config --global commit.template ~/.gitmessage

The template serves multiple purposes beyond just formatting. It forces engineers to think through the “why” behind their changes, which often reveals edge cases or better approaches before the code ever reaches review. I’ve watched junior engineers discover design flaws simply by trying to articulate their commit message.

More importantly, it creates consistency without feeling like micromanagement. Engineers appreciate having a framework with a short feedback loop rather than being told their commit messages are “wrong” after the fact.

Connecting Work to Business Impact

One pattern I’ve noticed across high-performing teams is how they connect individual commits to larger business objectives. Beyond linking to issue numbers, this creates traceability from business requirements to implementation details.

When commits reference issues consistently, several things happen:

Product managers can track feature progress without constantly asking for updates
Support teams can quickly identify which changes might relate to customer issues
Security audits become straightforward when you need to trace the history of sensitive code
Technical debt discussions become data-driven when you can quantify how much time is spent on maintenance vs. features

Teams can use this traceability to make compelling cases for technical investments. When every bug fix commit links back to customer-reported issues, the cost of poor code quality becomes visible to leadership in a way that resonates.

Removing Friction Through Tooling

The most effective way to improve team habits is to make the desired behavior the easiest behavior. Beyond templates, consider how your development environment can reinforce good practices.

For teams using VS Code, I recommend setting up workspace configurations that include spell check and line wrapping for commit messages. This prevents the common problem of commit messages that are impossible to read in terminal displays.

More importantly, consider integrating commit quality into your CI/CD pipeline. Tools like commitlint can automatically validate commit message format, while pre-commit hooks can catch obvious issues before they reach the remote repository.

The goal is to provide immediate feedback when standards aren’t met, rather than discovering problems during code review when fixing them is more disruptive to workflow.

Teaching Atomic Commits Through Code Review

One of the most valuable lessons I learned as an engineering manager is that teaching atomic commits—one logical change per commit—dramatically improves code review quality and team collaboration.

When engineers make atomic commits, several things happen naturally:

Code reviews become faster because each commit tells a clear story
Debugging becomes surgical because you can isolate exactly which logical change introduced a problem
Feature rollbacks become safe when you can revert a specific piece of functionality without touching unrelated code
Knowledge transfer improves because the commit history becomes a tutorial of how the system evolved

The challenge is that atomic commits require more upfront thinking. Engineers need to plan their approach before writing code, which feels slower initially but pays massive dividends in team velocity over time.

I’ve found the most effective way to teach this is through code review feedback that focuses on commit structure, not just code quality. When I see a pull request with one massive commit containing three different features, I ask the engineer to break it down and explain the reasoning for each piece.

Setting Team Expectations for Commit Cleanup

Here’s where leadership philosophy matters more than technical mechanics. Some teams insist on pristine linear history, while others prefer to preserve the full context of how work actually happened, including false starts and iterations.

I’ve found the most success with a middle path: require clean, atomic commits for the main branch, but allow messy work-in-progress commits on feature branches. This gives engineers the freedom to commit frequently while working (which improves backup and collaboration) while ensuring the permanent history tells a clear story.

The key is establishing this expectation early and consistently. During code reviews, I focus on commit structure as much as code quality. A well-structured commit history often indicates clear thinking about the problem space.

For teams new to this practice, I recommend starting with simple squash merges:

git reset --soft origin/master
git commit

This approach takes multiple messy commits and combines them into one clean commit before merging to main. It’s forgiving for engineers still learning atomic commit habits while maintaining clean project history.

Building Confidence Through Practice

One concern I often hear from engineering managers is that requiring clean commits will slow down their team. In my experience, the opposite is true—but only after an initial learning period where engineers build confidence with git operations.

The most effective approach I’ve found is pairing experienced engineers with those still learning git hygiene. When someone sees a colleague quickly reorganize commits using interactive rebase, it demystifies the process and builds confidence.

For selective commit cleanup, I teach this pattern:

git reset --soft HEAD~5
git commit -m "New message for the combined commit"

This approach lets engineers combine the last few commits while preserving earlier work that was already well-structured. It’s particularly useful for cleaning up the “fix typo” and “address code review feedback” commits that naturally accumulate during development.

The key is making this feel like a normal part of the development process, not a burdensome extra step. I’ve found success by incorporating commit cleanup time into sprint planning and explicitly discussing it during retrospectives.

When to Invest in Advanced Git Skills

Interactive rebase is where engineering teams often get stuck. It’s powerful enough to completely reorganize commit history, but complex enough that many engineers avoid it entirely. As a leader, you need to decide whether this level of git sophistication is worth the investment for your team.

I’ve found that teams working on critical infrastructure or open source projects benefit significantly from advanced git skills. The ability to craft a well-structured commit history pays dividends when you’re debugging production issues or when external contributors need to understand your codebase.

For most product teams, however, I recommend focusing on simpler patterns that achieve 80% of the benefit with 20% of the complexity. Interactive rebase can be intimidating, and I’d rather have consistent, good-enough commits than inconsistent attempts at perfection.

That said, having at least one team member comfortable with complex git operations is valuable. They become the “git expert” who can help others when commits get tangled, and they can teach advanced techniques during pair programming sessions.

The key is matching your git standards to your team’s maturity and project needs. A startup moving fast might prioritize different things than a team maintaining financial systems.

Encouraging Experimentation Through Safety Nets

One of the biggest barriers to adopting better git practices is fear of making mistakes. Engineers worry that attempting to clean up their commits will result in lost work or broken history. Git stash becomes invaluable as both a technical tool and a confidence builder.

I encourage teams to use git stash liberally when learning new git techniques. It creates a safety net that makes experimentation feel safe:

git stash  # Save current work
# Try some git operations
git stash pop  # Restore work if needed

This pattern is particularly useful when teaching engineers to clean up commits before submitting pull requests. They can stash their current work, experiment with interactive rebase or commit squashing, and easily recover if something goes wrong.

Beyond the technical benefits, stash encourages a more exploratory mindset around git. Engineers who feel comfortable experimenting with different approaches often develop better intuition for structuring their commits in the first place.

I’ve also found that teams with good stash habits tend to have fewer “work in progress” commits cluttering their history. When engineers know they can easily save and restore work, they’re more likely to commit only when they’ve reached a logical checkpoint.

Creating Accountability Through Release Markers

Tags serve a purpose beyond marking releases—they create natural checkpoints for reflecting on code quality and team practices. When teams establish a regular tagging cadence, it forces conversations about what constitutes a release-worthy state.

I’ve found that teams with good tagging habits naturally develop better commit discipline. Knowing that commits will be part of a tagged release creates a sense of permanence that encourages more thoughtful commit messages and cleaner history.

The process of creating a release tag often reveals quality issues that might otherwise slip through:

git tag -a v1.2.0 -m "Release: Enhanced user authentication"
git push --follow-tags

When someone has to write a release message that summarizes the changes since the last tag, poorly structured commits become obvious. This creates a feedback loop that naturally improves commit quality over time.

Tags also enable powerful debugging workflows. When production issues arise, being able to quickly identify which release introduced a problem can dramatically reduce time to resolution. This capability becomes especially valuable as teams scale and the commit volume increases.

More importantly, tags create opportunities for celebration. Teams that regularly tag releases can look back at their progress and feel genuine accomplishment. This positive reinforcement helps sustain good commit habits even when deadlines pressure teams to cut corners.

Building Lasting Culture Change

Establishing commit standards is ultimately about building a culture that values craftsmanship and communication. The technical practices matter, but the underlying mindset matters more.

The most successful transformations I’ve led started with making the case for why commit quality matters to the team’s goals. When engineers understand that better commits lead to faster debugging, easier code reviews, and more effective knowledge transfer, they become invested in improvement rather than resistant to new rules.

Implementation should be gradual and supportive rather than punitive. Start with commit message templates and basic guidelines. Celebrate improvements publicly during retrospectives. Use code review as a teaching opportunity rather than a barrier mechanism.

Most importantly, lead by example. When team members see you taking time to craft thoughtful commit messages and clean up your own commit history, it signals that these practices are genuinely valued rather than just bureaucratic overhead.

The payoff extends far beyond git hygiene. Teams that develop discipline around commit quality often improve in other areas too: code review thoroughness, documentation habits, and general attention to craft. These practices compound over time to create engineering cultures that can scale effectively and maintain high quality even under pressure.

Building this kind of culture takes patience and consistency, but the investment pays dividends in team velocity, code quality, and job satisfaction for years to come.

An automatic interactive pre-commit checklist, in the style of infomercials

2018-07-23T09:38:09-04:00

What’s that, you say? You’ve become tired of regular old boring paper checklists? Well, my friend, today is your lucky day! You, yes, you, can become the proud owner of a brand-spanking-new automatic interactive pre-commit hook checklist! You’re gonna love this! Your life will be so much easier! Just wait until your friends see you.

What’s a pre-commit hook

Did you know that nearly 1 out of 5 coders are too embarrassed to ask this question? Don’t worry, it’s perfectly normal. In the next 60 seconds we’ll tell you all you need to know to pre-commit with confidence.

A Git hook is a feature of Git that triggers custom scripts at useful moments. They can be used for all kinds of reasons to help you automate your work, and best of all, you already have them! In every repository that you initialize with git init, you’ll have a set of example scripts living in .git/hooks. They all end with .sample and activating them is as easy as renaming the file to remove the .sample part.

Git hooks are not copied when a repository is cloned, so you can make them as personal as you like.

The useful moment in particular that we’ll talk about today is the pre-commit. This hook is run after you do git commit, and before you write a commit message. Exiting this hook with a non-zero status will abort the commit, which makes it extremely useful for last-minute quality checks. Or, a bit of fun. Why not both!

How do I get a pre-commit checklist

I only want the best for my family and my commits, and that’s why I choose an interactive pre-commit checklist. Not only is it fun to use, it helps to keep my projects safe from unexpected off-spec mistakes!

It’s so easy! I just write a bash script that can read user input, and plop it into .git/hooks as a file named pre-commit. Then I do chmod +x .git/hooks/pre-commit to make it executable, and I’m done!

Oh look, here comes an example bash script now!

#!/bin/sh

echo "Would you like to play a game?"

# Read user input, assign stdin to keyboard
exec < /dev/tty

while read -p "Have you double checked that only relevant files were added? (Y/n) " yn; do
    case $yn in
        [Yy] ) break;;
        [Nn] ) echo "Please ensure the right files were added!"; exit 1;;
        * ) echo "Please answer y (yes) or n (no):" && continue;
    esac
done
while read -p "Has the documentation been updated? (Y/n) " yn; do
    case $yn in
        [Yy] ) break;;
        [Nn] ) echo "Please add or update the docs!"; exit 1;;
        * ) echo "Please answer y (yes) or n (no):" && continue;
    esac
done
while read -p "Do you know which issue or PR numbers to reference? (Y/n) " yn; do
    case $yn in
        [Yy] ) break;;
        [Nn] ) echo "Better go check those tracking numbers!"; exit 1;;
        * ) echo "Please answer y (yes) or n (no):" && continue;
    esac
done

exec <&-

Take my money

Don’t delay! Take advantage right now of this generous one-time offer! An interactive pre-commit hook checklist can be yours, today, for the low, low price of… free? Wait, who wrote this script?

Building High-Performance Engineering Teams Through Feedback Loops

2018-07-02T10:08:41-04:00

The highest-performing engineering teams share one critical characteristic: they’ve mastered rapid feedback loops. While many organizations talk about continuous improvement, few implement the systematic feedback mechanisms that make it possible.

The difference between teams that ship quality software consistently and those that struggle with technical debt and missed deadlines comes down to how quickly they can observe, learn, and adjust their approach. As an engineering leader, your job isn’t to be the source of all feedback—it’s to build systems that enable your team to continuously improve themselves.

The Engineering Leadership OODA Loop

United States Air Force Colonel John Boyd developed the concept of the OODA loop, OODA being an acronym for observe, orient, decide, act. While originally designed for military strategy, this framework translates perfectly to engineering team leadership:

Observe: Gather data about team performance, code quality, delivery metrics, and team dynamics
Orient: Analyze this information in the context of team goals, organizational constraints, and previous experience
Decide: Choose specific interventions or changes to improve team performance
Act: Implement these changes and measure their impact

The power of the OODA loop for engineering leaders is in its emphasis on speed. Teams that can observe problems, orient around solutions, decide on actions, and act quickly will consistently outperform teams with slower feedback cycles. I’ve seen engineering teams transform their delivery speed and quality by implementing systematic OODA loops at multiple levels: individual developer growth, code review processes, sprint retrospectives, and quarterly team health assessments.

High-Performance Team Feedback Systems

The most effective engineering teams I’ve led implement feedback loops at multiple time scales. Here’s what a comprehensive feedback system looks like:

Daily feedback (hours):

Morning standup with updates on blockers and progress
Real-time pair programming and code review
Continuous integration feedback from automated tests
End-of-day team sync on tomorrow’s priorities

Weekly feedback (days):

Sprint planning and backlog refinement
Code quality metrics review
Technical debt assessment
Team velocity and burndown analysis

Monthly feedback (weeks):

Sprint retrospectives with actions for improvements
Team health and satisfaction surveys
Architecture and technical direction discussions
Individual growth and career development conversations

Quarterly feedback (months):

Team performance against organizational goals
Process effectiveness and tooling evaluation
Long-term technical strategy adjustments
Team composition and skill gap analysis

Each feedback loop serves a different purpose and operates at a different time scale. Your job as a leader is to ensure all these loops are functioning and feeding information up and down the hierarchy.

Building Team Feedback Culture

Implementing effective feedback loops requires intentional leadership and systematic approach. Here’s the framework I use to build high-performance engineering teams:

Define clear, measurable team objectives
Create transparent planning and prioritization processes
Implement automation that provides rapid feedback
Build code review culture that accelerates learning
Set up regular process retrospectives
Close the loop: act on feedback systematically

Each of these components reinforces the others, creating a self-improving system where the team becomes increasingly effective at identifying problems and implementing solutions.

1. Define Clear, Measurable Team Objectives

Effective feedback loops require clear success criteria. Without concrete objectives, your team will struggle to know whether their improvements are actually working. As an engineering leader, you need to translate business goals into specific, measurable engineering outcomes.

Technical objectives: Delivery commitments with specific scope and timelines, quality metrics (bug rates, test coverage, performance benchmarks), and technical debt reduction goals with measurable impact
Process objectives: Sprint velocity and predictability targets, code review turnaround time improvements, and deployment frequency and reliability goals
Team health objectives: Individual skill development milestones, team satisfaction and engagement metrics, and knowledge sharing and documentation goals

Make these objectives visible and regularly review progress using dashboards, team ceremonies, and one-on-one conversations. Treat objectives as hypotheses to test, not contracts to fulfill at all costs. When feedback indicates an objective is no longer relevant or achievable, adjust it.

2. Create Transparent Planning and Prioritization Processes

High-performance teams excel at breaking down complex objectives into manageable, measurable work streams. This decomposition serves two purposes: it makes work achievable and it creates multiple feedback points where the team can course-correct.

Epic level (quarterly goals): Large initiatives that deliver significant business value, typically spanning 2-3 sprints. Example: “Implement real-time collaboration features”
Story level (sprint goals): Deliverable features that can be completed within a sprint. Example: “Users can see live cursor positions of other editors”
Task level (daily progress): Specific implementation work that can be completed in 1-2 days. Example: “Implement WebSocket connection handling for cursor events”

Create feedback loops at each level:

Daily standups surface task-level blockers and progress
Sprint reviews evaluate story completion and quality
Quarterly planning assesses epic success and organizational alignment

Teams perform best when planning is collaborative, transparent, and regularly revisited. Use tools like story mapping sessions, planning poker, and retrospective-driven backlog refinement to ensure the whole team understands and contributes to prioritization decisions. Treat plans as living documents that adjust quickly when feedback indicates priorities should shift.

3. Implement Automation That Provides Rapid Feedback

Automation is critical for high-performance teams because it accelerates feedback loops and eliminates sources of inconsistency and error. Automation creates systems that provide immediate, reliable information about code quality and system health.

Immediate feedback (seconds to minutes): Pre-commit hooks that run tests, IDE integrations, and linting tools that enforce consistent standards
Short-term feedback (minutes to hours): Continuous integration pipelines, automated security scanning, performance regression testing, and automated deployment to staging environments
Medium-term feedback (hours to days): Automated monitoring and alerting for production systems, code quality metrics tracking, and performance monitoring alerts

The key principle is “shift left”: catch problems as early as possible in the development cycle when they’re cheaper and easier to fix. Start by documenting manual processes that the team repeats regularly, then prioritize automation based on frequency of use and the consequences of human error. The automation itself becomes a team learning exercise and creates shared ownership of the development process.

4. Build Code Review Culture That Accelerates Learning

Code review is one of the most powerful feedback mechanisms available to engineering teams, but only when implemented as a learning and collaboration tool. High-performance teams use code review to accelerate knowledge transfer, maintain quality standards, and continuously improve their collective skills.

Establish clear expectations: Code review is required for all changes, should focus on code quality (not personal preferences), and both author and reviewer are responsible for the final quality
Optimize for speed and quality: Target 24-hour turnaround time for initial review feedback, use automated tools to catch style issues, and provide specific, actionable feedback with examples
Make reviews educational: Encourage questions and explanations in review comments, share alternative approaches and best practices, and rotate reviewers to spread knowledge across the team
Measure and improve: Track review turnaround time and iteration cycles, monitor review feedback patterns to identify training opportunities, and regularly discuss review process effectiveness in retrospectives

Create a culture where developers look forward to code review because they know they’ll learn something and improve the overall codebase quality. When done well, code review becomes one of your most effective tools for maintaining technical standards and building team expertise.

Team Code Review Standards

Here’s the code review checklist I use with engineering teams. Adapt it collaboratively with your team to ensure buy-in and relevance to your specific context:

# Team Code Review Standards

**Functionality & Requirements**

- [ ] Implementation matches acceptance criteria and specifications
- [ ] Edge cases and error conditions are properly handled
- [ ] Changes are complete and don't break existing functionality
- [ ] Performance impact has been considered and tested

**Code Quality & Maintainability**

- [ ] Code is readable and well-structured
- [ ] Variable and function names clearly express intent
- [ ] Complex logic is documented with comments
- [ ] Code follows team style guidelines and patterns
- [ ] No duplicate code or overly complex functions

**Testing & Reliability**

- [ ] Appropriate tests are included and pass
- [ ] Test coverage meets team standards
- [ ] Manual testing has been performed where applicable
- [ ] Changes don't introduce security vulnerabilities

**Team Collaboration**

- [ ] Pull request description clearly explains the change
- [ ] Related documentation has been updated
- [ ] Breaking changes are clearly communicated
- [ ] Knowledge sharing opportunities have been identified

Make this checklist a living document that evolves based on team retrospectives and lessons learned. Regularly review and update the standards based on what issues you’re catching (or missing) in production.

5. Set Up Regular Process Retrospectives

Process retrospectives are where teams close the feedback loop by systematically improving how they work. High-performance teams treat retrospectives as their most important ceremony because it’s where all other improvements originate. The most effective retrospectives happen at multiple cadences:

Sprint retrospectives (every 1-2 weeks): Focus on immediate process improvements and team dynamics using formats like “Start, Stop, Continue”
Quarterly team health reviews: Deeper dive into team effectiveness, skill development, and strategic alignment with quantitative analysis of delivery metrics
Post-incident reviews: Blameless analysis of production issues focusing on system improvements rather than individual accountability
Process optimization sessions: Dedicated time to review and improve specific workflows like deployment, testing, or code review processes

Effective Retrospective Framework

Here are the key questions I use to guide productive team retrospectives:

Team performance review: How did we perform against our objectives? What factors contributed to successes? What blockers slowed us down? How effectively did our processes support our goals?
Process effectiveness analysis: Which practices are working well? What processes are creating friction or waste? Where are we spending time on work that doesn’t create value? What automation would have the biggest impact?
Team health and growth: How well are we collaborating and communicating? What skills or knowledge gaps are limiting our effectiveness? Are team members feeling challenged and supported in their growth?
Forward-looking improvements: What are the top 2-3 experiments we want to try next period? How will we measure success of these changes? What obstacles do we anticipate and how can we prepare for them?

Make retrospectives action-oriented. Every retrospective should end with specific commitments about what the team will try differently, who will own those changes, and how success will be measured.

6. Close the Loop: Act on Feedback Systematically

The most critical step in building high-performance teams is ensuring that feedback actually drives change. Many teams collect feedback but fail to systematically implement improvements. This is where engineering leadership makes the biggest difference.

Make changes visible and trackable: Document all process experiments and improvements in a shared space, track metrics before and after implementing changes, and celebrate successful improvements publicly to reinforce the feedback culture
Create accountability for implementation: Assign owners for each improvement initiative, set specific timelines and success criteria, and review progress on improvements in regular team meetings
Build improvement into regular workflow: Allocate dedicated time for process improvement work, include improvement tasks in sprint planning, and make process improvement a regular topic in one-on-one conversations
Scale successful practices: Share effective improvements with other teams in the organization, document successful patterns for future reference, and build successful practices into onboarding for new team members

Create a self-reinforcing cycle where the team becomes increasingly effective at identifying problems, implementing solutions, and measuring results. Teams that master this cycle become engines of continuous improvement that consistently outperform their peers.

Building high-performance engineering teams through feedback loops requires patience, consistency, and commitment from leadership. The investment pays enormous dividends in team velocity, code quality, job satisfaction, and organizational impact.

Adorable bookmarklets want to help delete your social media data

2018-06-14T13:12:02-04:00

A little while ago I wrote about a Lambda function I called ephemeral for deleting my old tweets. While it’s a great project for someone familiar with or wanting to learn to use Lambda, it isn’t simple for a non-technical person to set up. There are services out there that will delete your tweets for you, but require your access credentials. There didn’t seem to be anything that provided convenience without also requiring authentication.

So, I went oldschool and created the ephemeral bookmarklet.

If that didn’t make you instantly nostalgic, a bookmarklet is a little application that lives as a bookmark in your web browser. You “install” it by dragging the link to your bookmarks toolbar, or right-clicking on the link and choosing “Bookmark this link” (Firefox). You click it to execute the program on the current page.

Here’s what the ephemeral bookmarklet will do:

The ephemeral bookmarklet is part of a new suite of tools for personal data management that I’m co-creating with Adam Drake. You can get all the bookmarklets on this page, and they’re also open source on GitHub.

There are currently bookmarklets for managing your data on LinkedIn and Twitter. We’re looking for testers and contributors to help make this a comprehensive toolset for your social media data management. If you write code, I invite you to contribute and help this toolset grow.

∩{｡◕‿◕｡}∩ – Bookmarklet says hi!

A coffee-break introduction to time complexity of algorithms

2018-05-30T14:08:28-04:00

Just like writing your very first for loop, understanding time complexity is an integral milestone to learning how to write efficient complex programs. Think of it as having a superpower that allows you to know exactly what type of program might be the most efficient in a particular situation - before even running a single line of code.

The fundamental concepts of complexity analysis are well worth studying. You’ll be able to better understand how the code you’re writing will interact with the program’s input, and as a result, you’ll spend a lot less wasted time writing slow and problematic code. It won’t take long to go over all you need to know in order to start writing more efficient programs - in fact, we can do it in about fifteen minutes. You can go grab a coffee right now (or tea, if that’s your thing) and I’ll take you through it before your coffee break is over. Go ahead, I’ll wait.

All set? Let’s do it!

What is “time complexity” anyway

The time complexity of an algorithm is an approximation of how long that algorithm will take to process some input. It describes the efficiency of the algorithm by the magnitude of its operations. This is different than the number of times an operation repeats; I’ll expand on that later. Generally, the fewer operations the algorithm has, the faster it will be.

We write about time complexity using Big O notation, which looks something like O(n). There’s rather a lot of math involved in its formal definition, but informally we can say that Big O notation gives us our algorithm’s approximate run time in the worst case, or in other words, its upper bound.^[2] It is inherently relative and comparative.^[3] We’re describing the algorithm’s efficiency relative to the increasing size of its input data, n. If the input is a string, then n is the length of the string. If it’s a list of integers, n is the length of the list.

It’s easiest to picture what Big O notation represents with a graph:

Lines made with the very excellent Desmos graph calculator. You can play with this graph here.

Here are the main important points to remember as you read the rest of this article:

Time complexity is an approximation
An algorithm’s time complexity approximates its worst case run time

Determining time complexity

There are different classes of complexity that we can use to quickly understand an algorithm. I’ll illustrate some of these classes using nested loops and other examples.

Polynomial time complexity

A polynomial, from the Greek poly meaning “many,” and Latin nomen meaning “name,” describes an expression comprised of constant variables, and addition, multiplication, and exponentiation to a non-negative integer power.^[4] That’s a super math-y way to say that it contains variables usually denoted by letters and symbols that look like these:

The below classes describe polynomial algorithms. Some have food examples.

Constant

A constant time algorithm doesn’t change its running time in response to the input data. No matter the size of the data it receives, the algorithm takes the same amount of time to run. We denote this as a time complexity of O(1).

Here’s one example of a constant algorithm that takes the first item in a slice.

func takeCupcake(cupcakes []int) int {
    return cupcakes[0]
}

Choice of flavours are: vanilla cupcake, strawberry cupcake, mint chocolate cupcake, lemon cupcake, and wibbly wobbly, timey wimey cupcake.

With this constant-time algorithm, no matter how many cupcakes are on offer, you just get the first one. Oh well. Flavours are overrated anyway.

Linear

The running duration of a linear algorithm is constant. It will process the input in n number of operations. This is often the best possible (most efficient) case for time complexity where all the data must be examined.

Here’s an example of code with time complexity of O(n):

func eatChips(bowlOfChips int) {
 for chip := 0; chip <= bowlOfChips; chip++ {
  // dip chip
 }
}

Here’s another example of code with time complexity of O(n):

func eatChips(bowlOfChips int) {
 for chip := 0; chip <= bowlOfChips; chip++ {
  // double dip chip
 }
}

It doesn’t matter whether the code inside the loop executes once, twice, or any number of times. Both these loops process the input by a constant factor of n, and thus can be described as linear.

Don’t double dip in a shared bowl.

Quadratic

Now here’s an example of code with time complexity of O(n²):

func pizzaDelivery(pizzas int) {
 for pizza := 0; pizza <= pizzas; pizza++ {
  // slice pizza
  for slice := 0; slice <= pizza; slice++ {
   // eat slice of pizza
  }
 }
}

Because there are two nested loops, or nested linear operations, the algorithm process the input n² times.

Cubic

Extending on the previous example, this code with three nested loops has time complexity of O(n³):

func pizzaDelivery(boxesDelivered int) {
 for pizzaBox := 0; pizzaBox <= boxesDelivered; pizzaBox++ {
  // open box
  for pizza := 0; pizza <= pizzaBox; pizza++ {
   // slice pizza
   for slice := 0; slice <= pizza; slice++ {
    // eat slice of pizza
   }
  }
 }
}

Seriously though, who delivers unsliced pizza??

Logarithmic

A logarithmic algorithm is one that reduces the size of the input at every step. We denote this time complexity as O(log n), where log, the logarithm function, is this shape:

One example of this is a binary search algorithm that finds the position of an element within a sorted array. Here’s how it would work, assuming we’re trying to find the element x:

If x matches the middle element m of the array, return the position of m
If x doesn’t match m, see if m is larger or smaller than x
- If larger, discard all array items greater than m
- If smaller, discard all array items smaller than m
Continue by repeating steps 1 and 2 on the remaining array until x is found

I find the clearest analogy for understanding binary search is imagining the process of locating a book in a bookstore aisle. If the books are organized by author’s last name and you want to find “Terry Pratchett,” you know you need to look for the “P” section.

You can approach the shelf at any point along the aisle and look at the author’s last name there. If you’re looking at a book by Neil Gaiman, you know you can ignore all the rest of the books to your left, since no letters that come before “G” in the alphabet happen to be “P.” You would then move down the aisle to the right any amount, and repeat this process until you’ve found the Terry Pratchett section, which should be rather sizable if you’re at any decent bookstore because wow did he write a lot of books.

Quasilinear

Often seen with sorting algorithms, the time complexity O(n log n) can describe a data structure where each operation takes O(log n) time. One example of this is quick sort, a divide-and-conquer algorithm.

Quick sort works by dividing up an unsorted array into smaller chunks that are easier to process. It sorts the sub-arrays, and thus the whole array. Think about it like trying to put a deck of cards in order. It’s faster if you split up the cards and get five friends to help you.

Non-polynomial time complexity

The below classes of algorithms are non-polynomial.

Factorial

An algorithm with time complexity O(n!) often iterates through all permutations of the input elements. One common example is a brute-force search seen in the travelling salesman problem. It tries to find the least costly path between a number of points by enumerating all possible permutations and finding the ones with the lowest cost.

Exponential

An exponential algorithm often also iterates through all subsets of the input elements. It is denoted O(2ⁿ) and is often seen in brute-force algorithms. It is similar to factorial time except in its rate of growth, which as you may not be surprised to hear, is exponential. The larger the data set, the more steep the curve becomes.

In cryptography, a brute-force attack may systematically check all possible elements of a password by iterating through subsets. Using an exponential algorithm to do this, it becomes incredibly resource-expensive to brute-force crack a long password versus a shorter one. This is one reason that a long password is considered more secure than a shorter one.

There are further time complexity classes less commonly seen that I won’t cover here, but you can read about these and find examples in this handy table.

Recursion time complexity

As I described in my article explaining recursion using apple pie, a recursive function calls itself under specified conditions. Its time complexity depends on how many times the function is called and the time complexity of a single function call. In other words, it’s the product of the number of times the function runs and a single execution’s time complexity.

Here’s a recursive function that eats pies until no pies are left:

func eatPies(pies int) int {
 if pies == 0 {
  return pies
 }
 return eatPies(pies - 1)
}

The time complexity of a single execution is constant. No matter how many pies are input, the program will do the same thing: check to see if the input is 0. If so, return, and if not, call itself with one fewer pie.

The initial number of pies could be any number, and we need to process all of them, so we can describe the input as n. Thus, the time complexity of this recursive function is the product O(n).

This function’s return value is zero, plus some indigestion.

Worst case time complexity

So far, we’ve talked about the time complexity of a few nested loops and some code examples. Most algorithms, however, are built from many combinations of these. How do we determine the time complexity of an algorithm containing many of these elements strung together?

Easy. We can describe the total time complexity of the algorithm by finding the largest complexity among all of its parts. This is because the slowest part of the code is the bottleneck, and time complexity is concerned with describing the worst case for the algorithm’s run time.

Say we have a program for an office party. If our program looks like this:

package main

import "fmt"

func takeCupcake(cupcakes []int) int {
 fmt.Println("Have cupcake number",cupcakes[0])
 return cupcakes[0]
}

func eatChips(bowlOfChips int) {
 fmt.Println("Have some chips!")
 for chip := 0; chip <= bowlOfChips; chip++ {
  // dip chip
 }
 fmt.Println("No more chips.")
}

func pizzaDelivery(boxesDelivered int) {
 fmt.Println("Pizza is here!")
 for pizzaBox := 0; pizzaBox <= boxesDelivered; pizzaBox++ {
  // open box
  for pizza := 0; pizza <= pizzaBox; pizza++ {
   // slice pizza
   for slice := 0; slice <= pizza; slice++ {
    // eat slice of pizza
   }
  }
 }
 fmt.Println("Pizza is gone.")
}

func eatPies(pies int) int {
 if pies == 0 {
  fmt.Println("Someone ate all the pies!")
  return pies
 }
 fmt.Println("Eating pie...")
 return eatPies(pies - 1)
}

func main() {
 takeCupcake([]int{1, 2, 3})
 eatChips(23)
 pizzaDelivery(3)
 eatPies(3)
 fmt.Println("Food gone. Back to work!")
}

We can describe the time complexity of all the code by the complexity of its most complex part. This program is made up of functions we’ve already seen, with the following time complexity classes:

Function	Class	Big O
`takeCupcake`	constant	O(1)
`eatChips`	linear	O(n)
`pizzaDelivery`	cubic	O(n³)
`eatPies`	linear (recursive)	O(n)

To describe the time complexity of the entire office party program, we choose the worst case. This program would have the time complexity O(n³).

Here’s the office party soundtrack, just for fun.

Have cupcake number 1
Have some chips!
No more chips.
Pizza is here!
Pizza is gone.
Eating pie...
Eating pie...
Eating pie...
Someone ate all the pies!
Food gone. Back to work!

P vs NP, NP-complete, and NP-hard

You may come across these terms in your explorations of time complexity. Informally, P (for Polynomial time), is a class of problems that is quick to solve. NP, for Nondeterministic Polynomial time, is a class of problems where the answer can be quickly verified in polynomial time. NP encompasses P, but also another class of problems called NP-complete, for which no fast solution is known.^[5] Outside of NP but still including NP-complete is yet another class called NP-hard, which includes problems that no one has been able to verifiably solve with polynomial algorithms.^[6]

P vs NP Euler diagram, by Behnam Esfahbod, CC BY-SA 3.0

P versus NP is an unsolved, open question in computer science.

Anyway, you don’t generally need to know about NP and NP-hard problems to begin taking advantage of understanding time complexity. They’re a whole other Pandora’s box.

Approximate the efficiency of an algorithm before you write the code

So far, we’ve identified some different time complexity classes and how we might determine which one an algorithm falls into. So how does this help us before we’ve written any code to evaluate?

By combining a little knowledge of time complexity with an awareness of the size of our input data, we can take a guess at an efficient algorithm for processing our data within a given time constraint. We can base our estimation on the fact that a modern computer can perform some hundreds of millions of operations in a second.^[1] The following table from the Competitive Programmer’s Handbook offers some estimates on required time complexity to process the respective input size in a time limit of one second.

Input size	Required time complexity for 1s processing time
n ≤ 10	O(n!)
n ≤ 20	O(2ⁿ)
n ≤ 500	O(n³)
n ≤ 5000	O(n²)
n ≤ 10⁶	O(n log n) or O(n)
n is large	O(1) or O(log n)

Keep in mind that time complexity is an approximation, and not a guarantee. We can save a lot of time and effort by immediately ruling out algorithm designs that are unlikely to suit our constraints, but we must also consider that Big O notation doesn’t account for constant factors. Here’s some code to illustrate.

The following two algorithms both have O(n) time complexity.

func makeCoffee(scoops int) {
 for scoop := 0; scoop <= scoops; scoop++ {
  // add instant coffee
 }
}

func makeStrongCoffee(scoops int) {
 for scoop := 0; scoop <= 3*scoops; scoop++ {
  // add instant coffee
 }
}

The first function makes a cup of coffee with the number of scoops we ask for. The second function also makes a cup of coffee, but it triples the number of scoops we ask for. To see an illustrative example, let’s ask both these functions for a cup of coffee with a million scoops.

Here’s the output of the Go test:

Benchmark_makeCoffee-4          1000000000               0.29 ns/op
Benchmark_makeStrongCoffee-4    1000000000               0.86 ns/op

Our first function, makeCoffee, completed in an average 0.29 nanoseconds. Our second function, makeStrongCoffee, completed in an average of 0.86 nanoseconds. While those may both seem like pretty small numbers, consider that the stronger coffee took near three times longer to make. This should make sense intuitively, since we asked it to triple the scoops. Big O notation alone wouldn’t tell you this, since the constant factor of the tripled scoops isn’t accounted for.

Improve time complexity of existing code

Becoming familiar with time complexity gives us the opportunity to write code, or refactor code, to be more efficient. To illustrate, I’ll give a concrete example of one way we can refactor a bit of code to improve its time complexity.

Let’s say a bunch of people at the office want some pie. Some people want pie more than others. The amount that everyone wants some pie is represented by an int > 0:

diners := []int{2, 88, 87, 16, 42, 10, 34, 1, 43, 56}

Unfortunately, we’re bootstrapped and there are only three forks to go around. Since we’re a cooperative bunch, the three people who want pie the most will receive the forks to eat it with. Even though they’ve all agreed on this, no one seems to want to sort themselves out and line up in an orderly fashion, so we’ll have to make do with everybody jumbled about.

Without sorting the list of diners, return the three largest integers in the slice.

Here’s a function that solves this problem and has O(n²) time complexity:

func giveForks(diners []int) []int {
 // make a slice to store diners who will receive forks
 var withForks []int
 // loop over three forks
 for i := 1; i <= 3; i++ {
  // variables to keep track of the highest integer and where it is
  var max, maxIndex int
  // loop over the diners slice
  for n := range diners {
   // if this integer is higher than max, update max and maxIndex
   if diners[n] > max {
    max = diners[n]
    maxIndex = n
   }
  }
  // remove the highest integer from the diners slice for the next loop
  diners = append(diners[:maxIndex], diners[maxIndex+1:]...)
  // keep track of who gets a fork
  withForks = append(withForks, max)
 }
 return withForks
}

This program works, and eventually returns diners [88 87 56]. Everyone gets a little impatient while it’s running though, since it takes rather a long time (about 120 nanoseconds) just to hand out three forks, and the pie’s getting cold. How could we improve it?

By thinking about our approach in a slightly different way, we can refactor this program to have O(n) time complexity:

func giveForks(diners []int) []int {
 // make a slice to store diners who will receive forks
 var withForks []int
 // create variables for each fork
 var first, second, third int
 // loop over the diners
 for i := range diners {
  // assign the forks
  if diners[i] > first {
   third = second
   second = first
   first = diners[i]
  } else if diners[i] > second {
   third = second
   second = diners[i]
  } else if diners[i] > third {
   third = diners[i]
  }
 }
 // list the final result of who gets a fork
 withForks = append(withForks, first, second, third)
 return withForks
}

Here’s how the new program works:

Initially, diner 2 (the first in the list) is assigned the first fork. The other forks remain unassigned.

Then, diner 88 is assigned the first fork instead. Diner 2 gets the second one.

Diner 87 isn’t greater than first which is currently 88, but it is greater than 2 who has the second fork. So, the second fork goes to 87. Diner 2 gets the third fork.

Continuing in this violent and rapid fork exchange, diner 16 is then assigned the third fork instead of 2, and so on.

We can add a print statement in the loop to see how the fork assignments play out:

This program is much faster, and the whole epic struggle for fork domination is over in 47 nanoseconds.

As you can see, with a little change in perspective and some refactoring, we’ve made this simple bit of code faster and more efficient.

Well, it looks like our fifteen minute coffee break is up! I hope I’ve given you a comprehensive introduction to calculating time complexity. Time to get back to work, hopefully applying your new knowledge to write more effective code! Or maybe just sound smart at your next office party. :)

References

“If I have seen further it is by standing on the shoulders of Giants.” –Isaac Newton, 1675

Antti Laaksonen. Competitive Programmer’s Handbook (pdf), 2017
Wikipedia: Big O notation
StackOverflow: What is a plain English explanation of “Big O” notation?
Wikipedia: Polynomial
Wikipedia: NP-completeness
Wikipedia: NP-hardness
Desmos graph calculator

Knapsack problem algorithms for my real-life carry-on knapsack

2018-05-09T21:00:35-04:00

The knapsack problem

I’m a nomad and live out of one carry-on bag. This means that the total weight of all my worldly possessions must fall under airline cabin baggage weight limits - usually 10kg. On some smaller airlines, however, this weight limit drops to 7kg. Occasionally, I have to decide not to bring something with me to adjust to the smaller weight limit.

As a practical exercise, deciding what to leave behind (or get rid of altogether) entails laying out all my things and choosing which ones to keep. That decision is based on the item’s usefulness to me (its worth) and its weight.

This is all my stuff, and my Minaal Carry-on bag.

Being a programmer, I’m aware that decisions like this could be made more efficiently by a computer. It’s done so frequently and so ubiquitously, in fact, that many will recognize this scenario as the classic packing problem or knapsack problem. How do I go about telling a computer to put as many important items in my bag as possible while coming in at or under a weight limit of 7kg? With algorithms! Yay!

I’ll discuss two common approaches to solving the knapsack problem: one called a greedy algorithm, and another called dynamic programming (a little harder, but better, faster, stronger…).

Let’s get to it.

The set up

I prepared my data in the form of a CSV file with three columns: the item’s name (a string), a representation of its worth (an integer), and its weight in grams (an integer). There are 40 items in total. I represented worth by ranking each item from 40 to 1, with 40 being the most important and 1 equating with something like “why do I even have this again?” (If you’ve never listed out all your possessions and ranked them by order of how useful they are to you, I highly recommend you try it. It can be a very revealing exercise.)

Total weight of all items and bag: 9003g

Bag weight: 1415g

Airline limit: 7000g

Maximum weight of items I can pack: 5585g

Total possible worth of items: 820

The challenge: Pack as many items as the limit allows while maximizing the total worth.

Data structures

Reading in a file

Before we can begin thinking about how to solve the knapsack problem, we have to solve the problem of reading in and storing our data. Thankfully, the Go standard library’s io/ioutil package makes the first part straightforward.

package main

import (
    "fmt"
    "io/ioutil"
)

func check(e error) {
    if e != nil {
        panic(e)
    }
}

func readItems(path string) {
    dat, err := ioutil.ReadFile(path)
    check(err)
    fmt.Print(string(dat))
}

The ReadFile() function takes a file path and returns the file’s contents and an error (nil if the call is successful) so we’ve also created a check() function to handle any errors that might be returned. In a real-world application we probably would want to do something more sophisticated than panic, but that’s not important right now.

Creating a struct

Now that we’ve got our data, we should probably do something with it. Since we’re working with real-life items and a real-life bag, let’s create some types to represent them and make it easier to conceptualize our program. A struct in Go is a typed collection of fields. Here are our two types:

type item struct {
    name          string
    worth, weight int
}

type bag struct {
    bagWeight, currItemsWeight, maxItemsWeight, totalWeight int
    items                                                   []item
}

It is helpful to use field names that are very descriptive. You can see that the structs are set up just as we’ve described the things they represent. An item has a name (string), and a worth and weight (integers). A bag has several fields of type int representing its attributes, and also has the ability to hold items, represented in the struct as a slice of item type thingamabobbers.

Parsing and storing our data

Several comprehensive Go packages exist that we could use to parse our CSV data… but where’s the fun in that? Let’s go basic with some string splitting and a for loop. Here’s our updated readItems() function:

func readItems(path string) []item {

    dat, err := ioutil.ReadFile(path)
    check(err)

    lines := strings.Split(string(dat), "\n")

    itemList := make([]item, 0)

    for i, v := range lines {
        if i == 0 {
            continue
        }
        s := strings.Split(v, ",")
        newItemWorth, _ := strconv.Atoi(s[1])
        newItemWeight, _ := strconv.Atoi(s[2])
        newItem := item{name: s[0], worth: newItemWorth, weight: newItemWeight}
        itemList = append(itemList, newItem)
    }
    return itemList
}

Using strings.Split, we split our dat on newlines. We then create an empty itemList to hold our items.

In our for loop, we skip the first line of our CSV file (the headers) then iterate over each line. We use strconv.Atoi (read “A to i”) to convert the values for each item’s worth and weight into integers. We then create a newItem with these field values and append it to the itemList. Finally, we return itemList.

Here’s what our set up looks like so far:

package main

import (
    "io/ioutil"
    "strconv"
    "strings"
)

type item struct {
    name          string
    worth, weight int
}

type bag struct {
    bagWeight, currItemsWeight, maxItemsWeight, totalWeight, totalWorth int
    items                                                               []item
}

func check(e error) {
    if e != nil {
        panic(e)
    }
}

func readItems(path string) []item {

    dat, err := ioutil.ReadFile(path)
    check(err)

    lines := strings.Split(string(dat), "\n")

    itemList := make([]item, 0)

    for i, v := range lines {
        if i == 0 {
            continue // skip the headers on the first line
        }
        s := strings.Split(v, ",")
        newItemWorth, _ := strconv.Atoi(s[1])
        newItemWeight, _ := strconv.Atoi(s[2])
        newItem := item{name: s[0], worth: newItemWorth, weight: newItemWeight}
        itemList = append(itemList, newItem)
    }
    return itemList
}

Now that we’ve got our data structures set up, let’s get packing (🥁) on the first approach.

Greedy algorithm

A greedy algorithm is the most straightforward approach to solving the knapsack problem, in that it is a one-pass algorithm that constructs a single final solution. At each stage of the problem, the greedy algorithm picks the option that is locally optimal, meaning it looks like the most suitable option right now. It does not revise its previous choices as it progresses through our data set.

Building our greedy algorithm

The steps of the algorithm we’ll use to solve our knapsack problem are:

Sort items by worth, in descending order.
Start with the highest worth item. Put items into the bag until the next item on the list cannot fit.
Try to fill any remaining capacity with the next item on the list that can fit.

If you read my article about solving problems and making paella, you’ll know that I always start by figuring out what the next most important question is. In this case, there are three main operations we need to figure out how to do:

Sort items by worth.
Put an item in the bag.
Check to see if the bag is full.

The first one is just a docs lookup away. Here’s how we sort a slice in Go:

sort.Slice(is, func(i, j int) bool {
    return is[i].worth > is[j].worth
})

The sort.Slice() function orders our items according to the less function we provide. In this case, it will order the highest worth items before the lowest worth items.

Given that we don’t want to put an item in the bag if it doesn’t fit, we’ll complete the last two tasks in reverse. First, we’ll check to see if the item fits. If so, it goes in the bag.

func (b *bag) addItem(i item) error {
    if b.currItemsWeight+i.weight <= b.maxItemsWeight {
        b.currItemsWeight += i.weight
        b.items = append(b.items, i)
        return nil
    }
    return errors.New("could not fit item")
}

Notice the * in our first line there. That indicates that bag is a pointer receiver (as opposed to a value receiver). It’s a concept that can be slightly confusing if you’re new to Go. Here are some things to consider that might help you decide when to use a value receiver and when to use a pointer receiver. For the purposes of our addItem() function, this case applies:

If the method needs to mutate the receiver, the receiver must be a pointer.

Our use of a pointer receiver tells our function we want to operate on this specific bag in particular, not a new bag. It’s important because without it, every item would always fit in a newly created bag! A little detail like this can make the difference between code that works and code that keeps you up until 4am chugging Red Bull and muttering to yourself. (Go to bed on time even if your code doesn’t work - you’ll thank me later.)

Now that we’ve got our components, let’s put together our greedy algorithm:

func greedy(is []item, b bag) {
    sort.Slice(is, func(i, j int) bool {
        return is[i].worth > is[j].worth
    })

    for i := range is {
        b.addItem(is[i])
    }

    b.totalWeight = b.bagWeight + b.currItemsWeight

    for _, v := range b.items {
        b.totalWorth += v.worth
    }
}

Then in our main() function, we’ll create our bag, read in our data, and call our greedy algorithm. Here’s what it looks like, all set up and ready to go:

func main() {

    minaal := bag{bagWeight: 1415, currItemsWeight: 0, maxItemsWeight: 5585}
    itemList := readItems("objects.csv")

    greedy(itemList, minaal)
}

Greedy algorithm results

So how does this algorithm do when it comes to efficiently packing our bag to maximize its total worth? Here’s the result:

Total weight of bag and items: 6987g

Total worth of packed items: 716

Here are the items our greedy algorithm chose, sorted by worth:

Item	Worth	Weight
Lenovo X1 Carbon (5th Gen)	40	112
10 pairs thongs	39	80
5 Underarmour Strappy	38	305
1 pair Uniqlo leggings	37	185
2 Lululemon Cool Racerback	36	174
Chargers and cables in Mini Bomber Travel Kit	35	665
The Roost Stand	34	170
ThinkPad Compact Bluetooth Keyboard with trackpoint	33	460
Seagate Backup PlusSlim	32	159
1 pair black denim shorts	31	197
2 pairs Nike Pro shorts	30	112
2 pairs Lululemon shorts	29	184
Isabella T-Strap Croc sandals	28	200
2 Underarmour HeatGear CoolSwitch tank tops	27	138
5 pairs black socks	26	95
2 pairs Injinji Women’s Run Lightweight No-Show Toe Socks	25	54
1 fancy tank top	24	71
1 light and stretchylong-sleeve shirt (Gap Fit)	23	147
Uniqlo Ultralight Down insulating jacket	22	235
Patagonia Torrentshell	21	301
Lightweight Merino Wool Buff	20	50
1 LBD (H&M)	19	174
Field Notes Pitch Black Memo Book Dot-Graph	18	68
Innergie PocketCell USB-C 6000mAh power bank	17	14
JBL Reflect Mini Bluetooth Sport Headphones	13	14
Oakley Latch Sunglasses	11	30
Petzl E+LITE Emergency Headlamp	8	27

It’s clear that the greedy algorithm is a straightforward way to quickly find a feasible solution. For small data sets, it will probably be close to the optimal solution. The algorithm packed a total item worth of 716 (104 points less than the maximum possible value), while filling the bag with just 13g left over.

As we learned earlier, the greedy algorithm doesn’t improve upon the solution it returns. It simply adds the next highest worth item it can to the bag.

Let’s look at another method for solving the knapsack problem that will give us the optimal solution - the highest possible total worth under the weight limit.

Dynamic programming

The name “dynamic programming” can be a bit misleading. It’s not a style of programming, as the name might cause you to infer, but simply another approach.

Dynamic programming differs from the straightforward greedy algorithm in a few key ways. Firstly, a dynamic programming bag packing solution enumerates the entire solution space with all possibilities of item combinations that could be used to pack our bag. Where a greedy algorithm chooses the most optimal local solution, dynamic programming algorithms are able to find the most optimal global solution.

Secondly, dynamic programming uses memoization to store the results of previously computed operations and returns the cached result when the operation occurs again. This allows it to “remember” previous combinations. This takes less time than it would to re-compute the answer again.

Building our dynamic programming algorithm

To use dynamic programming to find the optimal recipe for packing our bag, we’ll need to:

Create a matrix representing all subsets of the items (the solution space) with rows representing items and columns representing the bag’s remaining weight capacity
Loop through the matrix and calculate the worth that can be obtained by each combination of items at each stage of the bag’s capacity
Examine the completed matrix to determine which items to add to the bag in order to produce the maximum possible worth for the bag in total

It will be most helpful to visualize our solution space. Here’s a representation of what we’re building with our code:

The empty knapsackian multiverse.

In Go, we can create this matrix as a slice of slices.

matrix := make([][]int, numItems+1) // rows representing items
for i := range matrix {
    matrix[i] = make([]int, capacity+1) // columns representing grams of weight
}

We’ve padded the rows and columns by 1 so that the indicies match the item and weight numbers.

Now that we’ve created our matrix, we’ll fill it by looping over the rows and the columns:

// loop through table rows
for i := 1; i <= numItems; i++ {
    // loop through table columns
    for w := 1; w <= capacity; w++ {
        // do stuff in each element
    }
}

Then for each element, we’ll calculate the worth value to ascribe to it. We do this with code that represents the following:

If the item at the index matching the current row fits within the weight capacity represented by the current column, take the maximum of either:

The total worth of the items already in the bag or,

The total worth of all the items in the bag except the item at the previous row index, plus the new item’s worth

In other words, as our algorithm considers one of the items, we’re asking it to decide whether this item added to the bag would produce a higher total worth than the last item it added to the bag, at the bag’s current total weight. If this current item is a better choice, put it in - if not, leave it out.

Here’s the code that accomplishes this:

// if weight of item matching this index can fit at the current capacity column...
if is[i-1].weight <= w {
    // worth of this subset without this item
    valueOne := float64(matrix[i-1][w])
    // worth of this subset without the previous item, and this item instead
    valueTwo := float64(is[i-1].worth + matrix[i-1][w-is[i-1].weight])
    // take maximum of either valueOne or valueTwo
    matrix[i][w] = int(math.Max(valueOne, valueTwo))
// if the new worth is not more, carry over the previous worth
} else {
    matrix[i][w] = matrix[i-1][w]
}

This process of comparing item combinations will continue until every item has been considered at every possible stage of the bag’s increasing total weight. When all the above have been considered, we’ll have enumerated the solution space - filled the matrix - with all possible total worth values.

We’ll have a big chart of numbers, and in the last column at the last row we’ll have our highest possible value.

A strictly representative representation of the filled matrix.

That’s great, but how do we find out which combination of items were put in the bag to achieve that worth?

Getting our optimized item list

To see which items combine to create our optimal packing list, we’ll need to examine our matrix in reverse to the way we created it. Since we know the highest possible value is in the last row in the last column, we’ll start there. To find the items, we:

Get the value of the current cell
Compare the value of the current cell to the value in the cell directly above it
If the values differ, there was a change to the bag items; find the next cell to examine by moving backwards through the columns according to the current item’s weight (find the value of the bag before this current item was added)
If the values match, there was no change to the bag items; move up to the cell in the row above and repeat

The nature of the action we’re trying to achieve lends itself well to a recursive function. If you recall from my previous article about making apple pie, recursive functions are simply functions that call themselves under certain conditions. Here’s what it looks like:

func checkItem(b *bag, i int, w int, is []item, matrix [][]int) {
    if i <= 0 || w <= 0 {
        return
    }

    pick := matrix[i][w]
    if pick != matrix[i-1][w] {
        b.addItem(is[i-1])
        checkItem(b, i-1, w-is[i-1].weight, is, matrix)
    } else {
        checkItem(b, i-1, w, is, matrix)
    }
}

Our checkItem() function calls itself if the condition we described in step 4 is true. If step 3 is true, it also calls itself, but with different arguments.

Recursive functions require a base case. In this example, we want the function to stop once we run out of values of worth to compare. Thus our base case is when either i or w are 0.

Here’s how the dynamic programming approach looks when it’s all put together:

func checkItem(b *bag, i int, w int, is []item, matrix [][]int) {
    if i <= 0 || w <= 0 {
        return
    }

    pick := matrix[i][w]
    if pick != matrix[i-1][w] {
        b.addItem(is[i-1])
        checkItem(b, i-1, w-is[i-1].weight, is, matrix)
    } else {
        checkItem(b, i-1, w, is, matrix)
    }
}

func dynamic(is []item, b *bag) *bag {
    numItems := len(is)          // number of items in knapsack
    capacity := b.maxItemsWeight // capacity of knapsack

    // create the empty matrix
    matrix := make([][]int, numItems+1) // rows representing items
    for i := range matrix {
        matrix[i] = make([]int, capacity+1) // columns representing grams of weight
    }

    // loop through table rows
    for i := 1; i <= numItems; i++ {
        // loop through table columns
        for w := 1; w <= capacity; w++ {

            // if weight of item matching this index can fit at the current capacity column...
            if is[i-1].weight <= w {
                // worth of this subset without this item
                valueOne := float64(matrix[i-1][w])
                // worth of this subset without the previous item, and this item instead
                valueTwo := float64(is[i-1].worth + matrix[i-1][w-is[i-1].weight])
                // take maximum of either valueOne or valueTwo
                matrix[i][w] = int(math.Max(valueOne, valueTwo))
            // if the new worth is not more, carry over the previous worth
            } else {
                matrix[i][w] = matrix[i-1][w]
            }
        }
    }

    checkItem(b, numItems, capacity, is, matrix)

    // add other statistics to the bag
    b.totalWorth = matrix[numItems][capacity]
    b.totalWeight = b.bagWeight + b.currItemsWeight

    return b
}

Dynamic programming results

We expect that the dynamic programming approach will give us a more optimized solution than the greedy algorithm. So did it? Here are the results:

Total weight of bag and items: 6982g

Total worth of packed items: 757

Here are the items our dynamic programming algorithm chose, sorted by worth:

Item	Worth	Weight
10 pairs thongs	39	80
5 Underarmour Strappy	38	305
1 pair Uniqlo leggings	37	185
2 Lululemon Cool Racerback	36	174
Chargers and cables in Mini Bomber Travel Kit	35	665
The Roost Stand	34	170
ThinkPad Compact Bluetooth Keyboard with trackpoint	33	460
Seagate Backup Plus Slim	32	159
1 pair black denim shorts	31	197
2 pairs Nike Pro shorts	30	112
2 pairs Lululemon shorts	29	184
Isabella T-Strap Croc sandals	28	200
2 Underarmour HeatGear CoolSwitch tank tops	27	138
5 pairs black socks	26	95
2 pairs Injinji Women’s Run Lightweight No-Show Toe Socks	25	54
1 fancy tank top	24	71
1 light and stretchy long-sleeve shirt (Gap Fit)	23	147
Uniqlo Ultralight Down insulating jacket	22	235
Patagonia Torrentshell	21	301
Lightweight Merino Wool Buff	20	50
1 LBD (H&M)	19	174
Field Notes Pitch Black Memo Book Dot-Graph	18	68
Innergie PocketCell USB-C 6000mAh power bank	17	148
Important papers	16	228
Deuter First Aid Kit Active	15	144
Stanley Classic Vacuum Camp Mug 16oz	14	454
JBL Reflect Mini Bluetooth Sport Headphones	13	14
Anker SoundCore nano Bluetooth Speaker	12	80
Oakley Latch Sunglasses	11	30
Ray Ban Wayfarer Classic	10	45
Petzl E+LITE Emergency Headlamp	8	27
Peak Design Cuff Camera Wrist Strap	6	26
Travelon Micro Scale	5	125
Humangear GoBites Duo	3	22

There’s an obvious improvement to our dynamic programming solution over what the greedy algorithm gave us. Our total worth of 757 is 41 points greater than the greedy algorithm’s solution of 716, and for a few grams less weight too!

Input sort order

While testing my dynamic programming solution, I implemented the Fisher-Yates shuffle algorithm on the input before passing it into my function, just to ensure that the answer wasn’t somehow dependent on the sort order of the input. Here’s what the shuffle looks like in Go:

rand.Seed(time.Now().UnixNano())

for i := range itemList {
    j := rand.Intn(i + 1)
    itemList[i], itemList[j] = itemList[j], itemList[i]
}

Of course I then realized that Go 1.10 now has a built-in shuffle… it works precisely the same way and looks like this:

rand.Shuffle(len(itemList), func(i, j int) {
    itemList[i], itemList[j] = itemList[j], itemList[i]
})

So did the order in which the items were processed affect the outcome? Well…

Suddenly… a rogue weight appears!

As it turns out, in a way, the answer did depend on the order of the input. When I ran my dynamic programming algorithm several times, I sometimes saw a different total weight for the bag, though the total worth remained at 757. I initially thought this was a bug before examining the two sets of items that accompanied the two different total weight values. Everything was the same except for a few changes that collectively added up to a different item subset accounting for 14 of the 757 worth points.

In this case, there were two equally optimal solutions based only on the success metric of the highest total possible worth. Shuffling the input seemed to affect the placement of the items in the matrix and thus, the path that the checkItem() function took as it went through the matrix to find the chosen items. Since the success metric of having the highest possible worth was the same in both item sets, we don’t have a single unique solution - there’s two!

As an academic exercise, both these sets of items are correct answers. We may choose to optimize further by another metric, say, the total weight of all the items. The highest possible worth at the least possible weight could be seen as an ideal solution.

Here’s the second, lighter, dynamic programming result:

Total weight of bag and items: 6955g

Total worth of packed items: 757

Item	Worth	Weight
10 pairs thongs	39	80
5 Underarmour Strappy	38	305
1 pair Uniqlo leggings	37	185
2 Lululemon Cool Racerback	36	174
Chargers and cables in Mini Bomber Travel Kit	35	665
The Roost Stand	34	170
ThinkPad Compact Bluetooth Keyboard with trackpoint	33	460
Seagate Backup Plus Slim	32	159
1 pair black denim shorts	31	197
2 pairs Nike Pro shorts	30	112
2 pairs Lululemon shorts	29	184
Isabella T-Strap Croc sandals	28	200
2 Underarmour HeatGear CoolSwitch tank tops	27	138
5 pairs black socks	26	95
2 pairs Injinji Women’s Run Lightweight No-Show Toe Socks	25	54
1 fancy tank top	24	71
1 light and stretchy long-sleeve shirt (Gap Fit)	23	147
Uniqlo Ultralight Down insulating jacket	22	235
Patagonia Torrentshell	21	301
Lightweight Merino Wool Buff	20	50
1 LBD (H&M)	19	174
Field Notes Pitch Black Memo Book Dot-Graph	18	68
Innergie PocketCell USB-C 6000mAh power bank	17	148
Important papers	16	228
Deuter First Aid Kit Active	15	144
JBL Reflect Mini Bluetooth Sport Headphones	13	14
Anker SoundCore nano Bluetooth Speaker	12	80
Oakley Latch Sunglasses	11	30
Ray Ban Wayfarer Classic	10	45
Zip bag of toiletries	9	236
Petzl E+LITE Emergency Headlamp	8	27
Peak Design Cuff Camera Wrist Strap	6	26
Travelon Micro Scale	5	125
BlitzWolf Bluetooth Tripod/Monopod	4	150
Humangear GoBites Duo	3	22
Vapur Bottle 1L	1	41

Which approach is better?

Go benchmarking

The Go standard library’s testing package makes it straightforward for us to benchmark these two approaches. We can find out how long it takes each algorithm to run, and how much memory each uses. Here’s a simple main_test.go file:

package main

import (
    "testing"
)

func Benchmark_greedy(b *testing.B) {
    itemList := readItems("objects.csv")
    for i := 0; i < b.N; i++ {
        minaal := bag{bagWeight: 1415, currItemsWeight: 0, maxItemsWeight: 5585}
        greedy(itemList, minaal)
    }
}

func Benchmark_dynamic(b *testing.B) {
    itemList := readItems("objects.csv")
    for i := 0; i < b.N; i++ {
        minaal := bag{bagWeight: 1415, currItemsWeight: 0, maxItemsWeight: 5585}
        dynamic(itemList, &minaal)
    }
}

We can run go test -bench=. -benchmem to see these results:

Benchmark_greedy-4       1000000              1619 ns/op            2128 B/op          9 allocs/op
Benchmark_dynamic-4         1000           1545322 ns/op         2020332 B/op         49 allocs/op

Greedy algorithm performance

After running the greedy algorithm 1,000,000 times, the speed of the algorithm was reliably measured to be 0.001619 milliseconds (translation: very fast). It required 2128 Bytes or 2-ish kilobytes of memory and 9 distinct memory allocations per iteration.

Dynamic programming performance

The dynamic programming algorithm was run 1,000 times. Its speed was measured to be 1.545322 milliseconds or 0.001545322 seconds (translation: still pretty fast). It required 2,020,332 Bytes or 2-ish Megabytes, and 49 distinct memory allocations per iteration.

The verdict

Part of choosing the right approach to solving any programming problem is taking into account the size of the input data set. In this case, it’s a small one. In this scenario, a one-pass greedy algorithm will always be faster and less resource-needy than dynamic programming, simply because it has fewer steps. Our greedy algorithm was almost two orders of magnitude faster and less memory-hungry than our dynamic programming algorithm.

Not having those extra steps, however, means that getting the best possible solution from the greedy algorithm is unlikely.

It’s clear that the dynamic programming algorithm gave us better numbers: a lower weight, and higher overall worth.

	Greedy algorithm	Dynamic programming
Total weight:	6987g	6955g
Total worth:	716	757

Where dynamic programming on small data sets lacks in performance, it makes up in optimization. The question then becomes whether that additional optimization is worth the performance cost.

“Better,” of course, is a subjective judgement. If speed and low resource usage is our success metric, then the greedy algorithm is clearly better. If the total worth of items in the bag is our success metric, then dynamic programming is clearly better. However, our scenario is a practical one, and only one of these algorithm designs returned an answer I’d choose. In optimizing for the overall greatest possible total worth of the items in the bag, the dynamic programming algorithm left out my highest-worth, but also heaviest, item: my laptop. The chargers and cables, Roost stand, and keyboard that were included aren’t much use without it.

Better algorithm design

There’s a simple way to alter the dynamic programming approach so that the laptop is always included: we can modify the data so that the worth of the laptop is greater than the sum of the worth of all the other items. (Try it out!)

Perhaps in re-designing the dynamic programming algorithm to be more practical, we might choose another success metric that better reflects an item’s importance, instead of a subjective worth value. There are many possible metrics we can use to represent the value of an item. Here are a few examples of a good proxy:

Amount of time spent using the item
Initial cost of purchasing the item
Cost of replacement if the item were lost today
Dollar value of the product of using the item

By the same token, the greedy algorithm’s results might be improved with the use of one of these alternate metrics.

On top of choosing an appropriate approach to solving the knapsack problem in general, it is helpful to design our algorithm in a way that translates the practicalities of a scenario into code.

There are many considerations for better algorithm design beyond the scope of this introductory post. One of these is time complexity, and I’ve written about it here. A future algorithm may very well decide my bag’s contents on the next trip, but we’re not quite there yet. Stay tuned!

Why I'm automatically deleting my old tweets using AWS Lambda

2018-04-12T06:37:48-06:00

From now on, my tweets are ephemeral. Here’s why I’m deleting all my old tweets and the AWS Lambda function that does it for free.

Stuff and opinions

I’ve only been a one-bag nomad for a little over a year and a half. Before that, I lived as most people do in an apartment or a house. I owned furniture, more clothing than I strictly needed, and enough “stuff” to fill at least a few moving boxes. If I went to live somewhere else, moving for school or family or work, I packed up all my things and brought them with me. Over the years, I accumulated more and more stuff.

Adopting what many would call a minimalist lifestyle has rapidly changed a lot of my longstanding views. Giving away all my stuff (an idea I once thought to be interesting in principle but practically a little bit ridiculous) has become normal. It’s normal for me, now, to not own things that I don’t use on a regular basis. I don’t keep wall shelves packed with old books or dishes or clothing or childhood toys because those items aren’t relevant to me anymore. I just keep fond memories, instead.

Imagine, for a moment, that I still lived in a house. Imagine that in that house, on the fridge, is a drawing I made when I was six-years-old. In the bottom right corner of that drawing scribbled in green crayon are the words “broccoli is dumb - Victoria, Age 6.”

If you were in my house and saw that drawing on the fridge, would you assume that the statement “broccoli is dumb” comprised an accurate and current account of my opinions on broccoli? Of course not. I was six when I wrote that. I’ve had plenty of time to change my mind.

I have a friend whom I’ve known since we were both in kindergarten. We went through grade school together, then spoke to and saw each other on infrequent occasions across the years. We’re both adults now. Sometimes when we chat, we’ll recall some amusing memory from when we were younger. The nature of memory being what it is, I have no illusion that what we recall is recounted with much accuracy. Our impressions of things that happened - mistakes we made and moments of victory alike - are coloured by the experiences we’ve had since then, and all the things we’ve learned. An awkward moment at a school colleague’s birthday party becomes an example of a child learning to socialize, instead of the world-ending moment of embarrassment it probably felt like at the time.

This is how memory works. In a sense, it gets updated, as well it should. People living in small communities remember things that their neighbour did many years ago, but recall them in the context of who their neighbour is now, and what their current relationship is like. This re-colouring of history is an important part of how people heal, make good decisions, and socialize.

Social media does not do this. Your perfectly preserved tweet from five days or five years ago can be recalled with absolute accuracy. For most people, this is not particularly worrying. We tend to tweet about pretty mundane things - things that pop into mind when we’re bored and want someone to notice us. Individually, usually, our old tweets are pretty insignificant. In aggregate, however, they paint a pretty complete picture of a person’s random, unintentionally telling thoughts. This is the problem.

The assumption made of things written in social media and on Twitter specifically is a very different assumption than you might make about someone’s notepad scribble from last week. I’m not endeavoring to speculate why - I’ve just seen enough cases of someone getting publicly flogged for something they posted years ago to know that it does happen. This is weird. If you wouldn’t assume that a notepad scribble from last week or a crayon drawing from decades ago reflects the essence of who someone is now, why would you assume that an old tweet does?

You are not the same person you were last month - you’ve seen things, read things, understood and learned things that have, in some small way, changed you. While a person may have the same sense of self and identity through most of their life, even this grows and changes over the years. We change our opinions, our desires, our habits. We are not stagnant beings, and we should not let ourselves be represented as such, however unintentionally.

Ephemeral tweets

If you look at my Twitter profile page today, you’ll see fewer tweets there than you have fingers (I hope). I’m using ephemeral - a lightweight utility I wrote for use on AWS Lambda - to delete all my tweets older than a few days. I’m doing this for the same reason that I don’t hang on to stuff that I no longer use - that stuff isn’t relevant to me anymore. It doesn’t represent me, either.

The code that makes up ephemeral is written in Go. AWS Lambda creates an environment for each Lambda function, so ephemeral utilizes environment variables for your private Twitter API keys and the maximum age of the tweets you want to keep, represented in hours, like 72h.

var (
	consumerKey       = getenv("TWITTER_CONSUMER_KEY")
	consumerSecret    = getenv("TWITTER_CONSUMER_SECRET")
	accessToken       = getenv("TWITTER_ACCESS_TOKEN")
	accessTokenSecret = getenv("TWITTER_ACCESS_TOKEN_SECRET")
	maxTweetAge       = getenv("MAX_TWEET_AGE")
	logger            = log.New()
)

func getenv(name string) string {
	v := os.Getenv(name)
	if v == "" {
		panic("missing required environment variable " + name)
	}
	return v
}

The program uses the anaconda library. It fetches your timeline up to the Twitter API’s limit of 200 tweets per request, then compares each tweet’s date of creation to your MAX_TWEET_AGE variable to decide whether it’s old enough to be deleted. After deleting all the expired tweets, the Lambda function terminates.

func deleteFromTimeline(api *anaconda.TwitterApi, ageLimit time.Duration) {
	timeline, err := getTimeline(api)

	if err != nil {
		log.Error("Could not get timeline")
	}
	for _, t := range timeline {
		createdTime, err := t.CreatedAtTime()
		if err != nil {
			log.Error("Couldn't parse time ", err)
		} else {
			if time.Since(createdTime) > ageLimit {
				_, err := api.DeleteTweet(t.Id, true)
				log.Info("DELETED: Age - ", time.Since(createdTime).Round(1*time.Minute), " - ", t.Text)
				if err != nil {
					log.Error("Failed to delete! ", err)
				}
			}
		}
	}
	log.Info("No more tweets to delete.")

}

Read the full code here.

For a use case like this, AWS Lambda has a free tier that costs nothing. If you’re any level of developer, it’s an extremely useful tool to become familiar with. For a full walkthrough with screenshots of how to set up a Lambda function that tweets for you, you can read this article. The set up for ephemeral is the same, it just has an opposite function. :)

I forked ephemeral from Adam Drake’s Harold, a Twitter tool that has many useful functions beyond keeping your timeline trimmed. If you have more than 200 tweets to delete at first pass, please use Harold to do that first. You can run Harold with the deletetimeline flag from your terminal.

For sentiment, you may like to download all your tweets before deleting them.

Why use Twitter at all?

In anticipation of the question, let me say that yes, I do use Twitter besides just as a bucket for my Lambda functions to fill and empty. It has its benefits, most related to what I perceive to be its original intended purpose: to be a means of near-instant communication for short, digestible pieces of information reaching a widespread pool of people.

I use it as a way to keep tabs on what’s happening right now. I use it to comment on, joke about, and commiserate with things tweeted by the people I follow right now. By keeping my timeline restricted to only the most recent few days, I feel like I’m using Twitter more like it was meant to be used: a way to join the conversation and see what’s happening in the world right now - instead of just another place to amass more “stuff.”

Running a free Twitter bot on AWS Lambda

2018-03-05T10:29:15-05:00

If you read About time, you’ll know that I’m a big believer in spending time now on building things that save time in the future. To this end I built a simple Twitter bot in Go that would occasionally post links to my articles and keep my account interesting even when I’m too busy to use it. The tweets help drive traffic to my sites, and I don’t have to lift a finger.

I ran the bot on an Amazon EC2 instance for about a month. My AWS usage has historically been pretty inexpensive (less than the price of a coffee in most of North America), so I was surprised when the little instance I was using racked up a bill 90% bigger than the month before. I don’t think AWS is expensive, to be clear, but still… I’m cheap. I want my Twitter bot, and I want it for less.

I’d been meaning to explore AWS Lamda, and figured this was a good opportunity. Unlike an EC2 instance that is constantly running (and charging you for it), Lambda charges you per request and according to the duration of time your function takes to run. There’s a free tier, too, and the first 1 million requests, plus a certain amount of compute time, are free. Roughly translated to running a Twitter bot that posts for you, say, twice a day, your monthly cost for using Lambda would total… carry the one… nothing. I’ve been running my Lambda function for a couple weeks now, completely free.

When recently it came to me to take the reigns of the @freeCodeCampTO Twitter, I decided to employ a similar strategy, and also use this opportunity to document the process for you, dear reader.

So if you’re currently using a full-time running instance for a task that could be served by a cron job, this is the article for you. I’ll cover how to write your function for Lambda, how to get it set up to run automatically, and as a sweet little bonus, a handy bash script that updates your function from the command line whenever you need to make a change. Let’s do it!

Is Lambda right for you

When I wrote the code for my Twitter bot in Go, I intended to have it run on an AWS instance and borrowed heavily from Francesc’s awesome Just for Func episode. Some time later I modified it to randomly choose an article from my RSS feeds and tweet the link, twice a day. I wanted to do something similar for the @freeCodeCampTO bot, and have it tweet an inspiring quote about programming every morning.

This is a good use case for Lambda because:

The program should execute once
It runs on a regular schedule, using time as a trigger
It doesn’t need to run constantly

The important thing to keep in mind is that Lambda runs a function once in response to an event that you define. The most widely applicable trigger is a simple cron expression, but there are many other trigger events you can hook up. You can get an overview here.

Write a Lambda function

I found this really straightforward to do in Go. First, grab the aws-lambda-go library:

go get github.com/aws/aws-lambda-go/lambda

Then make this your func main():

func main() {
 lambda.Start(tweetFeed)
}

Where tweetFeed is the name of the function that makes everything happen. While I won’t go into writing the whole Twitter bot here, you can view my code on GitHub.

Setting up AWS Lambda

I’m assuming you already have an AWS account. If not, first things first here: https://aws.amazon.com/free

1. Create your function

Find AWS Lambda in the list of services, then look for this shiny button:

We’re going to author a function from scratch. Name your function, then under Runtime choose “Go 1.x”.

Under Role name write any name you like. It’s a required field but irrelevant for this use case.

Click Create function.

2. Configure your function

You’ll see a screen for configuring your new function. Under Handler enter the name of your Go program.

If you scroll down, you’ll see a spot to enter environment variables. This is a great place to enter the Twitter API tokens and secrets, using the variable names that your program expects. The AWS Lambda function will create the environment for you using the variables you provide here.

No further settings are necessary for this use case. Click Save at the top of the page.

3. Upload your code

You can upload your function code as a zip file on the configuration screen. Since we’re using Go, you’ll want to go build first, then zip the resulting executable before uploading that to Lambda.

…Of course I’m not going to do that manually every time I want to tweak my function. That’s what awscli and this bash script is for!

update.sh

go build && \
zip fcc-tweet.zip fcc-tweet && \
rm fcc-tweet && \
aws lambda update-function-code --function-name fcc-tweet --zip-file fileb://fcc-tweet.zip && \
rm fcc-tweet.zip

Now whenever I make a tweak, I just run bash update.sh.

If you’re not already using AWS Command Line Interface, do pip install awscli and thank me later. Find instructions for getting set up and configured in a few minutes here under Quick Configuration.

4. Test your function

Wanna see it go? Of course you do! Click “Configure test events” in the dropdown at the top.

Since you’ll use a time-based trigger for this function, you don’t need to enter any code to define test events in the popup window. Simply write any name under Event name and empty the JSON in the field below. Then click Create.

Click Test at the top of the page, and if everything is working correctly you should see…

5. Set up CloudWatch Events

To run our function as we would a cron job - as a regularly scheduled time-based event - we’ll use CloudWatch. Click CloudWatch Events in the Designer sidebar.

Under Configure triggers, you’ll create a new rule. Choose a descriptive name for your rule without spaces or punctuation, and ensure Schedule expression is selected. Then input the time you want your program to run as a rate expression, or cron expression.

A cron expression looks like this: cron(0 12 * * ? *)

Minutes	Hours	Month	Day of week	Year	In English
0	12	`*`	?	`*`	Run at noon (UTC) every day

For more on how to write your cron expressions, read this.

If you want your program to run twice a day, say once at 10am and again at 3pm, you’ll need to set two separate CloudWatch Events triggers and cron expression rules.

Click Add.

Watch it go

That’s all you need to get your Lambda function up and running! Now you can sit back, relax, and do more important things than share your RSS links on Twitter.

Moving to a new domain without breaking old links with AWS & Disqus

2018-01-10T08:56:20-05:00

I started blogging about my nomadic travels last year, and so far the habit has stuck. Like all side projects, I won’t typically invest heavily in setting up web properties before I can be reasonably certain that such an investment is worth my time or enjoyment. In other words: don’t buy the domain until you’ve proven to yourself that you’ll stick with it!

After some months of regular posting I felt I was ready to commit (short courtship, I know, but we’re all adults here) and I bought a dedicated domain, herOneBag.com.

Up until recently, my #NomadLyfe blog was just a subdirectory of my main personal site. Now it’s all grown up and ready to strike out into the world alone! Here’s the setup for the site:

Static site in Amazon Web Services S3 bucket
Route 53 handling the DNS
CloudFront for distribution and a custom SSL certificate
Disqus for comments

If you’d like a walk-through for how to set up a new domain with this structure, it’s over here: Hosting your static site with AWS S3, Route 53, and CloudFront. In this post, I’ll just detail how I managed to move my blog to the new site without breaking the old links or losing any comments.

Preserve old links with redirection rules

I wanted to avoid breaking links that have been posted around the web by forwarding visitors to the new URL. The change looks like this:

Old URL: https://victoria.dev/meta/5-bag-lessons/

New URL: https://heronebag.com/blog/5-bag-lessons/

You can see that the domain name as well as the subdirectory have changed, but the slug for the blog post remains the same. (I love static sites.)

To redirect links from the old site, we’ll need to set redirection rules in the old site’s S3 bucket. AWS provides a way to set up a conditional redirect. This is set in the “Redirection rules” section of your S3 bucket’s properties, under “Static website hosting.” You can find the documentation here.

There are a few examples given, but none that represent the redirect I want. In addition to changing the prefix of the object key, we’re also changing the domain. The latter is achieved with the tag.

To redirect requests for the old blog URL to the new top level domain, we’ll use the code below.


  
    
      oldblog/
    
    
      newdomain.com
      newblog/

This rule ensures that requests for olddomain.com/oldblog/specific-blog-post will redirect to newdomain.com/newblog/specific-blog-post.

Migrate Disqus comments

Disqus provides a tool for migrating the comment threads from your old blog site to the new one. You can find it in your Disqus admin tools at your-short-name.disqus.com/admin/discussions/migrate/.

To migrate posts from the old blog address to the new one, we’ll use the URL mapper tool. Click “Start URL mapper,” then “you can download a CSV here.”

Disqus has decent instructions for how this tool works, and you can read them here. Basically, you’ll input the new blog URLs into the second column of the CSV file you downloaded, then pass it back to Disqus to process. If you’re using a program to edit the CSV, be sure to save the resulting file in CSV format.

Unless you have a bazillion URLs, the tool works pretty quickly, and you’ll get an email when it’s finished. Don’t forget to update the name of your site in the Disqus admin, too.

Transfer other settings

Update links in your social profiles and any other sites you may have around the web. If you’re using other services attached to your website like Google Analytics or IFTTT, don’t forget to update those details too!

A Unicode substitution cipher algorithm

2018-01-06T20:00:28-05:00

Full transparency: I occasionally waste time messing around on Twitter. (Gasp! Shock!) One of the ways I waste time messing around on Twitter is by writing my name in my profile with different Unicode character “fonts,” 𝖑𝖎𝖐𝖊 𝖙𝖍𝖎𝖘 𝖔𝖓𝖊. I previously did this by searching for different Unicode characters on Google, then one-by-one copying and pasting them into the “Name” field on my Twitter profile. Since this method of wasting time was a bit of a time waster, I decided (in true programmer fashion) to write a tool that would help me save some time while wasting it.

I originally dubbed the tool “uni-pretty,” (based on LEGO’s Unikitty from a movie – a pun that absolutely no one got) but have since renamed it fancy unicode. It builds from this GitHub repo. It lets you type any characters into a field and then converts them into Unicode characters that also represent letters, giving you fancy “fonts” that override a website’s CSS, like in your Twitter profile. (Sorry, Internet.)

The tool’s first naive iteration existed for about twenty minutes while I copy-pasted Unicode characters into a data structure. This approach of storing the characters in the JavaScript file, called hard-coding, is fraught with issues. Besides having to store every character from every font style, it’s painstaking to build, hard to update, and more code means it’s susceptible to more possible errors.

Fortunately, working with Unicode means that there’s a way to avoid the whole mess of having to store all the font characters: Unicode numbers are sequential. More importantly, the special characters in Unicode that could be used as fonts (meaning that there’s a matching character for most or all of the letters of the alphabet) are always in the following sequence: capital A-Z, lowercase a-z.

For example, in the fancy Unicode above, the lowercase letter “L” character has the Unicode number U+1D591 and HTML code 𝖑. The next letter in the sequence, a lowercase letter “M,” has the Unicode number U+1D592 and HTML code 𝖒. Notice how the numbers in those codes increment by one.

Why’s this relevant? Since each special character can be referenced by a number, and we know that the order of the sequence is always the same (capital A-Z, lowercase a-z), we’re able to produce any character simply by knowing the first number of its font sequence (the capital “A”). If this reminds you of anything, you can borrow my decoder pin.

In cryptography, the Caesar cipher (or shift cipher) is a simple method of encryption that utilizes substitution of one character for another in order to encode a message. This is typically done using the alphabet and a shift “key” that tells you which letter to substitute for the original one. For example, if I were trying to encode the word “cat” with a right shift of 3, it would look like this:

c a t
f d w

With this concept, encoding our plain text letters as a Unicode “font” is a simple process. All we need is an array to reference our plain text letters with, and the first index of our Unicode capital “A” representation. Since some Unicode numbers also include letters (which are sequential, but an unnecessary complication) and since the intent is to display the page in HTML, we’ll use the HTML code number 𝕬, with the extra bits removed for brevity.

var plain = ['A', 'B', 'C', 'D', 'E', 'F', 'G', 'H', 'I', 'J', 'K', 'L', 'M', 'N', 'O', 'P', 'Q', 'R', 'S', 'T', 'U', 'V', 'W', 'X', 'Y', 'Z', 'a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j', 'k', 'l', 'm', 'n', 'o', 'p', 'q', 'r', 's', 't', 'u', 'v', 'w', 'x', 'y', 'z'];

var fancyA = 120172;

Since we know that the letter sequence of the fancy Unicode is the same as our plain text array, any letter can be found by using its index in the plain text array as an offset from the fancy capital “A” number. For example, capital “B” in fancy Unicode is the capital “A” number, 120172 plus B’s index, which is 1: 120173.

Here’s our conversion function:

function convert(string) {
    // Create a variable to store our converted letters
    let converted = [];
    // Break string into substrings (letters)
    let arr = string.split('');
    // Search plain array for indexes of letters
    arr.forEach(element => {
        let i = plain.indexOf(element);
        // If the letter isn't a letter (not found in the plain array)
        if (i == -1) {
            // Return as a whitespace
            converted.push(' ');
        } else {
            // Get relevant character from fancy number + index
            let unicode = fancyA + i;
            // Return as HTML code
            converted.push('&#' + unicode + ';');
        }

    });
    // Print the converted letters as a string
    console.log(converted.join(''));
}

A neat possibility for this method of encoding requires a departure from my original purpose, which was to create a human-readable representation of the original string. If the purpose was instead to produce a cipher, this could be done by using any Unicode index in place of fancyA as long as the character indexed isn’t a representation of a capital “A.”

Here’s the same code set up with a simplified plain text array, and a non-letter-representation Unicode key:

var plain = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j', 'k', 'l', 'm', 'n', 'o', 'p', 'q', 'r', 's', 't', 'u', 'v', 'w', 'x', 'y', 'z'];

var key = 9016;

You might be able to imagine that decoding a cipher produced by this method would be relatively straightforward, once you knew the encoding secret. You’d simply need to subtract the key from the HTML code numbers of the encoded characters, then find the relevant plain text letters at the remaining indexes.

Well, that’s it for today. Be sure to drink your Ovaltine and we’ll see you right here next Monday at 5:45!

Oh, and… ⍔⍠⍟⍘⍣⍒⍥⍦⍝⍒⍥⍚⍠⍟⍤ ⍒⍟⍕ ⍨⍖⍝⍔⍠⍞⍖ ⍥⍠ ⍥⍙⍖ ⍔⍣⍪⍡⍥⍚⍔ ⍦⍟⍚⍔⍠⍕⍖ ⍤⍖⍔⍣⍖⍥ ⍤⍠⍔⍚⍖⍥⍪

Hosting your static site with AWS S3, Route 53, and CloudFront

2017-12-13T20:46:12-05:00

Some time ago I decided to stop freeloading on GitHub pages and move one of my sites to Amazon Web Services (AWS). It turns out that I’m still mostly freeloading (yay free tier) so it amounted to a learning experience. Here are the components that let me host and serve the site at my custom domain with HTTPS.

Static site in Amazon Web Services S3 bucket
Route 53 handling the DNS
CloudFront for distribution and a custom SSL certificate

I set all that up most of a year ago. At the time, I found the AWS documentation to be rather fragmented and inconvenient to follow - it was hard to find what you were looking for without knowing what a specific setting might be called, or where it was, or if it existed at all. When I recently set up a new site and stumbled through this process again, I didn’t find it any easier. Hopefully this post can help to collect the relevant information into a more easily followed process and serve as an accompanying guide to save future me (and you) some time.

Rather than replace existing documentation, this post is meant to supplement it. Think of me as your cool tech-savvy friend on the phone with you at 4am, troubleshooting your website. (Please don’t actually call me at 4am.) I’ll walk through the set up while providing links for the documentation that was ultimately helpful (mostly so I can find it again later…).

Hosting a static site with Amazon S3 and a custom domain

If you’re starting from scratch, you’ll need an AWS account. It behooves you to get one, even if you don’t like paying for services - there’s a free tier that will cover most of the experimental stuff you’re going to want to do in the first year, and even the things I do pay for cost me less than a dollar a month. You can sign up at https://aws.amazon.com/free.

Getting your static site hosted and available at your custom domain is your first mission, should you choose to accept it. Your instructions are here.

Creating the buckets for site hosting on S3 is the most straightforward part of this process in my opinion, and the AWS documentation walkthrough covers what you’ll need to do quite well. It gets a little unclear around Step 3: Create and Configure Amazon Route 53 Hosted Zone, so come back and read on once you’ve reached that point. I’ll make some tea in the meantime.

… 🎶 🎵

Ready? Cool. See, I’m here for you.

Set up Route 53

The majority of the work in this section amounts to creating the correct record sets for your custom domain. If you’re already familiar with how record sets work, the documentation is a bit of a slog. Here’s how it should look when you’re finished:

The “NS” and “SOA” records are created automatically for you. The only records you need to create are the “A” records.

Hop over to Route 53 and follow this walkthrough to create a “hosted zone.” The value of the NS (Name Servers) records are what you’ll have to provide to your domain name registrar. Your registrar is wherever you bought your custom domain, such as this super subtle Namecheap.com affiliate link right here. (Thanks for your support! 😊)

If you created two buckets in the first section (one for yourdomain.com and one for www.yourdomain.com), you’ll need two separate A records in Route 53. Initially, these have the value of the endpoints for your matching S3 buckets (looks like s3-website.us-east-2.amazonaws.com). Later, you’ll change them to your CloudFront domain name.

If you went with Namecheap as your registrar, Step 4 looks like this:

Waiting is the hardest part… I’ve gotten into the habit of working on another project or setting up the DNS change before going to bed so that changes have time to propagate without me feeling like I need to fiddle with it. ^^;

When the transfer’s ready, you’ll see your site at http://yourdomain.com. Next, you’ll want to set up CloudFront so that becomes https://yourdomain.com.

Set up CloudFront and SSL

Here are the instructions for setting up CloudFront. There are a few important points to make sure you don’t miss on the “Create Distribution” page:

Origin Domain Name: Make sure to use your S3 bucket endpoint, and not select the bucket from the dropdown menu that appears.
Viewer Protocol Policy: If you want requests for http://yourdomain.com to always result in https://yourdomain.com, choose “Redirect HTTP to HTTPS.”
Alternate Domain Names: Enter yourdomain.com and www.yourdomain.com on separate lines.
SSL Certificate: See below.
Default Root Object: Enter the name of the html file that should be returned when your users go to https://yourdomain.com. This is usually “index.html”.

SSL Certificate

To show your content with HTTPS at your custom domain, you’ll need to choose “Custom SSL Certificate.” You can easily get an SSL Certificate with AWS Certificate Manager. Click on “Request or Import a Certificate with ACM” to get started in a new window.

Here are instructions for setting up a certificate. I don’t think they’re very good, personally. Don’t worry, I got you.

To account for “www.yourdomain.com” as well as any subdomains, you’ll want to add two domain names to the certificate, like so:

Click “Next.” You’ll be asked to choose a validation method. Choose “DNS validation” and click “Review.” If everything is as it should be, click “Confirm and request.”

You’ll see a page, “Validation” that looks like this. You’ll have to click the little arrow next to both domain names to get the important information to show:

Under both domain names, click the button for “Create record in Route 53.” This will automatically create a CNAME record set in Route 53 with the given values, which ACM will then check in order to validate that you own those domains. You could create the records manually, if you wanted to for some reason. I don’t know, maybe you’re killing time. ¯\_(ツ)_/¯

Click “Continue.” You’ll see a console that looks like this:

It may take some time for the validation to complete, at which point the “Pending validation” status will change to “Issued.” Again with the waiting. You can close this window to return to the CloudFront set up. Once the certificate is validated, you’ll see it in the dropdown menu under “Custom SSL Certificate.” You can click “Create Distribution” to finish setting up CloudFront.

In your CloudFront Distributions console, you’ll see “In Progress” until AWS has done its thing. Once it’s done, it’ll change to “Deployed.”

One last thing

Return to your Route 53 console and click on “Hosted zones” in the sidebar, then your domain name from the list. For both A records, change the “Alias Target” from the S3 endpoint to your CloudFront distribution domain, which should look something like dj4p1rv6mvubz.cloudfront.net. It appears in the dropdown after you clear the field.

You’re done

Well, usually. If you navigate to your new HTTPS domain and don’t see your beautiful new site where it should be, here are some things you can do:

Check S3 bucket policy - ensure that the bucket for yourdomain.com in the S3 console shows “Public” in the “Access” column.
Check S3 bucket index document - In the “metadata” tab for the bucket, then “Static website hosting”. Usually “index.html”.
Check CloudFront Origin - the “Origin” column in the CloudFront Console should show the S3 bucket’s endpoint (s3-website.us-east-2.amazonaws.com), not the bucket name (yourdomain.com.s3.amazonaws.com).
Check CloudFront Default Root Object - clicking on the distribution name should take you to a details page that shows “Default Root Object” in the list with the value that you set, usually “index.html”.
Wait. Sometimes changes take up to 48hrs to propagate. ¯\_(ツ)_/¯

I hope that helps you get set up with your new static site on AWS! If you found this post helpful, there’s a lot more where this came from. You can subscribe below to see new posts first.

About time

2017-11-22T14:05:14-05:00

This morning I read an article that’s been making the rounds lately: Modern Media Is a DoS Attack on Your Free Will.

It’s made me think, which I must admit, I at first didn’t like. See, when I wake up in the morning (and subsequently wake up my computer) the first thing I do is go on Twitter to catch up on everything I missed while I was asleep. All this before my first coffee, mind you. Links on Twitter usually lead to stories on Medium, newly released apps on ProductHunt, and enticing sales on a new gadget or two on Amazon. Wherever it goes, in those blissfully half-awake mental recesses, the last thing I’m trying to do is think.

However, yesterday, I also happened to listen to a podcast from freeCodeCamp. It was #7: The code I’m still ashamed of. This lead to thoughts on the responsibilities of programmers - the people tasked with designing and building apps and systems meant to steer the very course of your life.

This morning, the combined swirling mess of notions brought on by these two sources of information had, even before my first coffee, the unfortunate effect of making me think.

Mostly, I thought about intention, and time.

I don’t believe it’s wildly inaccurate to say that when you go about doing something in your daily life, you have a general awareness of your reason for doing it. If you leave your building and go down the street to Starbucks and buy a coffee, more often than not, it’s because you wanted a coffee. If you go to the corner store and buy a litre of milk, you probably intend to drink it. If you find yourself nicely dressed on a Friday night waiting at a well-decorated restaurant to meet another human being with whom you share an apparent mutual attraction, I can risk a guess that you’re after some form of pleasant human interaction.

In each of these, and many more examples you can think up, the end goal is clearly defined. There is an expected final step to the process; an expected response; a return value.

What is the return value of opening up the Twitter app? Browsing Facebook? Instagram? In fact, any social media?

The concrete answer is that there isn’t one. Perhaps in those of us with resilient self-discipline, there may at least be some sort of time limitation. That’s the most we can hope for, however, and no wonder - that’s what these and other similar services have been designed for. They’re built to be open-ended black-holes for our most precious resource… time.

In the case of the Analytical Engine we have undoubtedly to lay out a certain capital of analytical labour in one particular line; but this is in order that the engine may bring us in a much larger return in another line.

Ada Augusta (Ada Lovelace) - Notes on Sketch of The Analytical Engine

Okay, so I did some more reading. Specifically, #ThrowbackThursday to the mid 1800’s and something my good friend Ada Lovelace once scribbled in a book. Widely considered one of the first computer programmers, she and Charles Babbage pioneered many concepts that programmers today take for granted. The one I’m going to hang my point on is, I think, nicely encapsulated in the above quote: the things programmers make are supposed to save you time.

Save it. Not lose it.

I think Ada and Charles would agree that, observing the effects of social media apps, clickbait news sites, and many other forms of attention-hogging interactivity that we haven’t even classified yet - something’s gone horribly wrong.

What if, as programmers, we actually did something about it?

Consider that collectively - no, even individually - we who design and build the workings of modern technology have an incredible amount of power. The next indie app that goes viral on ProductHunt will consume hundreds of hours of time from its users. Where is all that untapped, pure potential going to? Some open-ended, inoffensive amusement? Another advertising platform thinly veiled as a game? Perhaps another drop of oil to smooth the machinery of The Great Engine of Commerce?

I get it - programmers will build what they’re paid to build. That’s capitalism, that’s feeding your family, survival–life. I’m not trying to suggest we all quit our jobs, go live in the woods, and volunteer as humanitarians. That would be nice, but it’s impractical.

But we all have side projects. Free time. What are you doing with yours?

Before I’m accused of being too hand-wavy and idealistic, I want to offer a concrete suggestion. Build things that save time. Not in the “I’ve made yet another to-do list app for you to download,” kind of way, but in the “Here’s a one-liner to automate this mundane thing that would have taken you hours,” kind of way. Here, have a shameless plug.

I also really like this idea from the first article I mentioned, so hang on tight while I bring this full circle:

What’s one concrete thing companies could do now to stop subverting our attention?

I would just like to know what is the ultimate design goal of that site or that system that’s shaping my behavior or thinking. What are they really designing my experience for? Companies will say that their goal is to make the world open and connected or whatever. These are lofty marketing claims. But if you were to actually look at the dashboards that they’re designing, the high-level metrics they’re designing for, you probably wouldn’t see those things. You’d see other things, like frequency of use, time on site, this type of thing. If there was some way for the app to say, to the user, “Here’s generally what this app wants from you, from an attentional point of view,” that would be huge. It would probably be the primary way I would decide which apps I download and use.

There are so many ways I’d love to see this put into practice, from the obvious to the subversive. A little position: sticky; banner? A custom meta tag in the header? Maybe a call to action like this takes more introspection and honesty than a lot of app makers are ready for… but maybe it just takes a little of our time.

Batch renaming images, including image resolution, with awk

2017-11-20T13:59:30-05:00

The most recent item on my list of “Geeky things I did that made me feel pretty awesome” is an hour’s adventure that culminated in this code:

$ file IMG* | awk 'BEGIN{a=0} {print substr($1, 1, length($1)-5),a++"_"substr($8,1, length($8)-1)}' | while read fn fr; do echo $(rename -v "s/$fn/img_$fr/g" *); done
IMG_20170808_172653_425.jpg renamed as img_0_4032x3024.jpg
IMG_20170808_173020_267.jpg renamed as img_1_3024x3506.jpg
IMG_20170808_173130_616.jpg renamed as img_2_3024x3779.jpg
IMG_20170808_173221_425.jpg renamed as img_3_3024x3780.jpg
IMG_20170808_173417_059.jpg renamed as img_4_2956x2980.jpg
IMG_20170808_173450_971.jpg renamed as img_5_3024x3024.jpg
IMG_20170808_173536_034.jpg renamed as img_6_4032x3024.jpg
IMG_20170808_173602_732.jpg renamed as img_7_1617x1617.jpg
IMG_20170808_173645_339.jpg renamed as img_8_3024x3780.jpg
IMG_20170909_170146_585.jpg renamed as img_9_3036x3036.jpg
IMG_20170911_211522_543.jpg renamed as img_10_3036x3036.jpg
IMG_20170913_071608_288.jpg renamed as img_11_2760x2760.jpg
IMG_20170913_073205_522.jpg renamed as img_12_2738x2738.jpg
// ... etc etc

The last item on the aforementioned list is “TODO: come up with a shorter title for this list.”

I previously wrote about the power of command line tools like sed. This post expands on how to string all this magical functionality into one big, long, rainbow-coloured, viscous stream of awesome.

Rename files

The tool that actually handles the renaming of our files is, appropriately enough, rename. The syntax is: rename -n "s/original_filename/new_filename/g" * where -n does a dry-run, and substituting -v would rename the files. The s indicates our substitution string, and g for “global” finds all occurrences of the string. The * matches zero or more occurrences of our search-and-replace parameters.

We’ll come back to this later.

Get file information

When I run $ file IMG_20170808_172653_425.jpg in the image directory, I get this output:

IMG_20170808_172653_425.jpg: JPEG image data, baseline, precision 8, 4032x3024, frames 3

Since we can get the image resolution (“4032x3024” above), we know that we’ll be able to use it in our new filename.

Isolate the information we want

I love awk for its simplicity. It takes lines of text and makes individual bits of information available to us with built in variables that we can then refer to as column numbers denoted by $1, $2, etc. By default, awk splits up columns on whitespace. To take the example above:

|              1               |   2  |   3   |   4   |     5     |     6     | 7  |      8     |   9    | 10 |
-------------------------------------------------------------------------------------------------------------
| IMG_20170808_172653_425.jpg: | JPEG | image | data, | baseline, | precision | 8, | 4032x3024, | frames | 3  |

We can denote different values to use as a splitter with, for example, -F',' if we wanted to use commas as the column divisions. For our current project, spaces are fine.

There are a couple issues we need to solve before we can plug the information into our new filenames. Column $1 has the original filename we want, but there’s an extra “:” character on the end. We don’t need the “.jpg” either. Column $8 has an extra “,” that we don’t want as well. To get just to information we need, we’ll take a substring of the column with substr():

substr($1, 1, length($1)-5) - This gives us the file name from the beginning of the string to the end of the string, minus 5 characters (“length minus 5”). substr($8,1, length($8)-1) - This gives us the image size, without the extra comma (“length minus 1”).

Avoid duplicate file names

To ensure that two images with the same resolutions don’t create identical, competing file names, we’ll append a unique incrementing number to the filename.

BEGIN{a=0} - Using BEGIN tells awk to run the following code only once, at the (drumroll) beginning. Here, we’re declaring the variable a to be 0. a++ - Later in our code, at the appropriate spot for our file name, we call a and increment it.

When awk prints a string, it concatenates everything that isn’t separated by a comma. {print a b c} would create “abc” and {print a,b,c} would create “a b c”, for example.

We can add additional characters to our file name, such as an underscore, by inserting it in quotations: "_".

String it all together

To feed the output of one command into another command, we use “pipe,” written as |.

If we only used pipe in this instance, all our data from file and awk would get fed into rename all at once, making for one very, very long and probably non-compiling file name. To run the rename command line by line, we can use while and read. Similarly to awk, read takes input and splits it into variables we can assign and use. In our code, it takes the first bit of output from awk (the original file name) and assigns that the variable name $fn. It takes the second output (our incrementing number and the image resolution) and assigns that to $fr. The variable names are arbitrary; you can call them whatever you want.

To run our rename commands as if we’d manually entered them in the terminal one by one, we can use echo $(some command). Finally, done ends our while loop.

Bonus round: rainbow output

I wasn’t kidding with that “rainbow-coloured” bit…

p install lolcat

Here’s our full code:

le IMG* | awk 'BEGIN{a=0} {print substr($1, 1, length($1)-5),a++"_"substr($8,1, length($8)-1)}' | while read fn fs; do echo $(rename -v "s/$fn/img_$fs/g" *); done | lolcat

Enjoy!

How to code a satellite algorithm and cook paella from scratch

2017-09-08T16:50:24-04:00

What if I told you that by the end of this article, you’ll be able to calculate the orbital period of satellites around Earth using their average altitudes and… You tuned out already, didn’t you?

Okay, how about this: I’m going to teach you how to make paella!

And you’ll have written a function that does the stuff I mentioned above, just like I did for a freeCodeCamp challenge.

I promise there’s an overarching moral lesson that will benefit you every day for the rest of your life. Or at least, feed you for one night. Let’s get started.

The only thing I know about paella is that it’s an emoticon

Unless you’re reading this on a Samsung phone, in which case you’re looking at a Korean hotpot.

One of my favorite things about living in the world today is that it’s totally fine to know next-to-nothing about something. A hundred years ago you might have gone your whole life not knowing anything more about paella other than that it’s an emoticon.* But today? You can simply look it up.

*That was a joke.

As with all things in life, when we are unsure, we turn to the internet - in this case, the entry for paella on Wikipedia, which reads:

Paella …is a Valencian rice dish. Paella has ancient roots, but its modern form originated in the mid-19th century near the Albufera lagoon on the east coast of Spain adjacent to the city of Valencia. Many non-Spaniards view paella as Spain’s national dish, but most Spaniards consider it to be a regional Valencian dish. Valencians, in turn, regard paella as one of their identifying symbols.

At this point, you’re probably full of questions. Do I need to talk to a Valencian? Should I take an online course on the history of Spain? What type of paella should I try to make? What is the common opinion of modern chefs when it comes to paella types?

If you set out with the intention of answering all these questions, one thing is certain: you’ll never end up actually making paella. You’ll spend hours upon hours typing questions into search engines and years later wake up with a Masters in Valencian Cuisine.

The “Most Important Question” method

When I talk to myself out loud in public (doesn’t everyone?) I refer to this as “MIQ” (rhymes with “Nick”). I also imagine MIQ to be a rather crunchy and quite adorable anthropomorphized tortilla chip. Couldn’t tell you why.

MIQ swings his crunchy triangular body around to point me in the right direction, and the right direction always takes the form of the most important question that you need to ask yourself at any stage of problem solving. The first most important question is always this:

What is the scope of the objective I want to achieve?

Well, you want to make paella.

The next MIQ then becomes: how much do I actually need to know about paella in order to start making it?

You’ve heard this advice before: any big problem can be broken down into multiple, but more manageable, bite-size problems. In this little constellation of bite-size problems, there’s only one that you need to solve in order to get most of the way to a complete solution.

In the case of making paella, we need a recipe. That’s a bite-size problem that a search engine can solve for us:

Simple Paella Recipe

In a medium bowl, mix together 2 tablespoons olive oil, paprika, oregano, and salt and pepper. Stir in chicken pieces to coat. Cover, and refrigerate.

Heat 2 tablespoons olive oil in a large skillet or paella pan over medium heat. Stir in garlic, red pepper flakes, and rice. Cook, stirring, to coat rice with oil, about 3 minutes. Stir in saffron threads, bay leaf, parsley, chicken stock, and lemon zest. Bring to a boil, cover, and reduce heat to medium low. Simmer 20 minutes.

Meanwhile, heat 2 tablespoons olive oil in a separate skillet over medium heat. Stir in marinated chicken and onion; cook 5 minutes. Stir in bell pepper and sausage; cook 5 minutes. Stir in shrimp; cook, turning the shrimp, until both sides are pink.

Spread rice mixture onto a serving tray. Top with meat and seafood mixture. (allrecipes.com)

And voila! Believe it or not, we’re most of the way there already.

Having a set of step-by-step instructions that are easy to understand is really most of the work done. All that’s left is to go through the motions of gathering the ingredients and then making paella. From this point on, your MIQs may become fewer and far between, and they may slowly decrease in importance in relation to the overall problem. (Where do I buy paprika? How do I know when sausage is cooked? How do I set the timer on my phone for 20 minutes? How do I stop thinking about this delicious smell? Which Instagram filter best captures the ecstasy of this paella right now?)

The answer to that last one is Nashville

I still know nothing about calculating the orbital periods of satellites

Okay. Let’s examine the problem:

Return a new array that transforms the element’s average altitude into their orbital periods.

The array will contain objects in the format {name: ’name’, avgAlt: avgAlt}.

You can read about orbital periods on wikipedia.

The values should be rounded to the nearest whole number. The body being orbited is Earth.

The radius of the earth is 6367.4447 kilometers, and the GM value of earth is 398600.4418 km3s-2.

orbitalPeriod([{name : "sputnik", avgAlt : 35873.5553}]) should return [{name: "sputnik", orbitalPeriod: 86400}].

Well, as it turns out, in order to calculate the orbital period of satellites, we also need a recipe. Amazing, the things you can find on the internet these days.

Courtesy of dummies.com (yup! #noshame), here’s our recipe:

It’s kind of cute, in a way.

That might look pretty complicated, but as we’ve already seen, we just need to answer the next MIQ: how much do I actually need to know about this formula in order to start using it?

In the case of this challenge, not too much. We’re already given earthRadius, and avgAlt is part of our arguments object. Together, they form the radius, r. With a couple search queries and some mental time-travel to your elementary math class, we can describe this formula in a smattering of English:

T, the orbital period, equals 2 multiplied by Pi, in turn multiplied by the square root of the radius, r cubed, divided by the gravitational mass, GM.

JavaScript has a Math.PI property, as well as Math.sqrt() function and Math.pow() function. Using those combined with simple calculation, we can represent this equation in a single line assigned to a variable:

var orbitalPeriod = 2 * Math.PI * (Math.sqrt(Math.pow((earthRadius + avgAlt), 3) / GM));

From the inside out:

Add earthRadius and avgAlt
Cube the result of step 1
Divide the result of step 2 by GM
Take the square root of the result of step 3
Multiply 2 times Pi times the result of step 4
Assign the returned value to orbitalPeriod

Believe it or not, we’re already most of the way there.

The next MIQ for this challenge is to take the arguments object, extract the information we need, and return the result of our equation in the required format. There are a multitude of ways to do this, but I’m happy with a straightforward for loop:

function orbitalPeriod(arr) {
  var resultArr = [];

  for (var teapot = 0; teapot < arguments[0].length; teapot++) {
    var GM = 398600.4418;
    var earthRadius = 6367.4447;
    var avgAlt = arguments[0][teapot]['avgAlt'];
    var name = arguments[0][teapot]['name'];
    var orbitalPeriod = 2 * Math.PI * (Math.sqrt(Math.pow((earthRadius + avgAlt), 3) / GM));
    var result = {
      name: name,
      orbitalPeriod: Math.round(orbitalPeriod)
    }
    resultArr.push(result);
  }

  return resultArr;
}

If you need a refresher on iterating through arrays, have a look at my article on iterating, featuring breakfast arrays! (5 minutes read)

Don’t look now, but you just gained the ability to calculate the orbital period of satellites. You could even do it while making paella, if you wanted to. Seriously. Put it on your resume.

Tl;dr: the overarching moral lesson

Whether it’s cooking, coding, or anything else, problems may at first seem confusing, insurmountable, or downright boring. If you’re faced with such a challenge, just remember: they’re a lot more digestible with a side of bite-sized MIQ chips.

Making sandwiches with closures in JavaScript

2017-05-28T09:16:35+07:00

Say you’re having a little coding get-together, and you need some sandwiches. You happen to know that everyone prefers a different type of sandwich, like chicken, ham, or peanut butter and mayo. You could make all these sandwiches yourself, but that would be tedious and boring.

Luckily, you know of a nearby sandwich shop that delivers. They have the ability and ingredients to make any kind of sandwich in the world, and all you have to do is order through their app.

The sandwich shop looks like this:

function makeMeASandwich(x) {
    var ingredients = x.join(' ');
    return function barry() {
        return ingredients.concat(' sandwich');
    }
}

Notice that we have an outer function, makeMeASandwich() that takes an argument, x. This outer function has the local variable ingredients, which is just x mushed together.

Barry? Who’s Barry? He’s the guy who works at the sandwich shop. You’ll never talk with Barry directly, but he’s the reason your sandwiches are made, and why they’re so delicious. Barry takes ingredients and mushes them together with " sandwich".

The reason Barry is able to access the ingredients is because they’re in his outer scope. If you were to take Barry out of the sandwich shop, he’d no longer be able to access them. This is an example of lexical scoping: “Nested functions have access to variables declared in their outer scope.” (MDN)

Barry, happily at work in the sandwich shop, is an example of a closure.

Closures are functions that refer to independent (free) variables (variables that are used locally, but defined in an enclosing scope). In other words, these functions ‘remember’ the environment in which they were created. (MDN)

When you order, the app submits your sandwich request like so:

var pbm = makeMeASandwich(['peanut butter', 'mayo']);

pbm();

And in thirty-minutes-or-it’s-free, you get: peanut butter mayo sandwich.

The nice thing about the sandwich shop app is that it remembers the sandwiches you’ve ordered before. Your peanut butter and mayo sandwich is now available to you as pbm() for you to order anytime. It’s pretty convenient since, each time you order, there’s no need to specify that the sandwich you want is the same one you got before with peanut butter and mayo and it’s a sandwich. Using pbm() is much more concise.

Let’s order the sandwiches you need for the party:

var pmrp = makeMeASandwich(['prosciutto', 'mozzarella', 'red pepper']);
var pbt = makeMeASandwich(['peanut butter', 'tuna']);
var hm = makeMeASandwich(['ham']);
var pbm = makeMeASandwich(['peanut butter', 'mayo']);

pmrp();
pbt();
hm();
pbm();

Your order confirmation reads:

prosciutto mozzarella red pepper sandwich
peanut butter tuna sandwich
ham sandwich
peanut butter mayo sandwich

Plot twist! The guy who wanted a ham sandwich now wants a ham and cheese sandwich. Luckily, the sandwich shop just released a new version of their app that will let you add cheese to any sandwich.

With this added feature, the sandwich shop now looks like this:

function makeMeASandwich(x) {
    var ingredients = x.join(' ');
    var slices = 0;

    function barry() {
        return ingredients.concat(' sandwich');
    }
    function barryAddCheese() {
        slices += 2;
        return ingredients.concat(' sandwich with ', slices, ' slices of cheese');
    }
    return {
        noCheese: function() {
            return barry();
        },
        addCheese: function() {
            return barryAddCheese();
        }
    }
}

You amend the order to look like this:

pmrp.noCheese();
pbt.noCheese();
hm.addCheese();
pbm.noCheese();

And your order confirmation reads:

prosciutto mozzarella red pepper sandwich
peanut butter tuna sandwich
ham sandwich with 2 slices of cheese
peanut butter mayo sandwich

You’ll notice that when you order a sandwich with cheese, Barry puts 2 slices of cheese on it. In this way, the sandwich shop controls how much cheese you get. You can’t get to Barry to tell him you want more than 2 slices at a time. That’s because your only access to the sandwich shop is through the public functions noCheese or addCheese.

Of course, there’s a way to cheat the system…

hm.addCheese();
hm.addCheese();
hm.addCheese();

By ordering the same ham sandwich with cheese three times, you get: ham sandwich with 6 slices of cheese.

This happens because the sandwich shop app recognizes the variable hm as the same sandwich each time, and increases the number of cheese slices it tells Barry to add.

The app could prevent you from adding lots of cheese to the same sandwich, either by adding a maximum or by appending unique order numbers to the variable names… but this is our fantasy sandwich shop, and we get to pile on as much cheese as we want.

By using closures, we can have JavaScript emulate private methods found in languages like Ruby and Java. Closures are a useful way to extend the functionality of JavaScript, and also order sandwiches.

Understanding Array.prototype.reduce() and recursion using apple pie

2017-05-18T11:40:06+07:00

I was having trouble understanding reduce() and recursion in JavaScript, so I wrote this article to explain it to myself (hey, look, recursion!). I hope you find my examples both helpful and delicious.

Given an array with nested arrays:

var arr = [1, [2], [3, [[4]]]]

We want to produce this:

var flat = [1, 2, 3, 4]

Using for loops and if statements

Naively, if we know the maximum number of nested arrays we’ll encounter (there are 4 in this example), we can use for loops to iterate through each array item, then if statements to check if each item is in itself an array, and so on…

function flatten() {
    var flat = [];
    for (var i=0; i<arr.length; i++) {
    if (Array.isArray(arr[i])) {
        for (var ii=0; ii<arr[i].length; ii++) {
        if (Array.isArray(arr[i][ii])) {
            for (var iii=0; iii<arr[i][ii].length; iii++) {
            for (var iiii=0; iiii<arr[i][ii][iii].length; iiii++) {
                if (Array.isArray(arr[i][ii][iii])) {
                flat.push(arr[i][ii][iii][iiii]);
                } else {
                flat.push(arr[i][ii][iii]);
                }
            }
            }
        } else {
            flat.push(arr[i][ii]);
        }
        }
    } else {
    flat.push(arr[i]);
    }
    }
}

// [1, 2, 3, 4]

…Which works, but of course looks ridiculous. Besides looking ridiculous, a) it only works if we know how many nested arrays we’ll process, b) it’s hard to read and harder to understand, and c) can you imagine having to debug this mess?! (Gee, I think there’s an extra i somewhere.)

Using reduce

JavaScript has a couple methods we can use to make our code a little less ridiculous. One of these is reduce() and it looks like this:

var flat = arr.reduce(function(done,curr){
    return done.concat(curr);
}, []);

// [ 1, 2, 3, [ [ 4 ] ] ]

It’s a lot less code, but we haven’t taken care of some of the nested arrays. Let’s first walk through reduce() together and examine what it does to see how we’ll correct this.

Array.prototype.reduce() The reduce() method applies a function against an accumulator and each element in the array (from left to right) to reduce it to a single value. (MDN)

It’s not quite as complicated as it seems. Let’s think of reduce() as an out-of-work developer (AI took all the dev jobs) with an empty basket. We’ll call him Adam. Adam’s main function (ba-dum ching) is now to take apples from a pile, shine them up, and put them one-by-one into the basket. This basket of shiny apples is destined to become delicious apple pies. It’s a very important job.

Apples plus human effort equals pie. Not to be confused with apple-human-pie, which is less appetizing.

In our above example, the pile of apples is our array, arr. Our basket is done, the accumulator. The initial value of done is an empty array, which we see as [] at the end of our reduce function. The apple that our out-of-work dev is currently shining, you guessed it, is curr. Once Adam processes the current apple, he places it into the basket (.concat()). When there are no more apples in the pile, he returns the basket of polished apples to us, and then probably goes home to his cat, or something.

Using reduce recursively to address nested arrays

So that’s all well and good, and now we have a basket of polished apples. But we still have some nested arrays to deal with. Going back to our analogy, let’s say that some of the apples in the pile are in boxes. Within each box there could be more apples, and/or more boxes containing smaller, cuter apples.

Adorable, slightly skewed apples just want to be loved/eaten.

Here’s what we want our apple-processing-function/Adam to do:

If the pile of apples is a pile of apples, take an apple from the pile.
If the apple is an apple, polish it, put it in the basket.
If the apple is a box, open the box. If the box contains an apple, go to step 2.
If the box contains another box, open this box, and go to step 3.
When the pile is no more, give us the basket of shiny apples.
If the pile of apples is not a pile of apples, give back whatever it is.

A recursive reduce function that accomplishes this is:

function flatten(arr) {
  if (Array.isArray(arr)) {
  return arr.reduce(function(done,curr){
    return done.concat(flatten(curr));
    }, []);
  } else {
    return arr;
  }
}

// [ 1, 2, 3, 4 ]

Bear with me and I’ll explain.

An act of a function calling itself. Recursion is used to solve problems that contain smaller sub-problems. A recursive function can receive two inputs: a base case (ends recursion) or a recursive case (continues recursion). (MDN)

If you examine our code above, you’ll see that flatten() appears twice. The first time it appears, it tells Adam what to do with the pile of apples. The second time, it tells him what to do with the thing he’s currently holding, providing instructions in the case it’s an apple, and in the case it’s not an apple. The thing to note is that these instructions are a repeat of the original instructions we started with - and that’s recursion.

We’ll break it down line-by-line for clarity:

function flatten(arr) { - we name our overall function and specify that it will take an argument, arr.
if (Array.isArray(arr)) { - we examine the provided “arrgument” (I know, I’m very funny) to determine if it is an array.
return arr.reduce(function(done,curr){ - if the previous line is true and the argument is an array, we want to reduce it. This is our recursive case. We’ll apply the following function to each array item…
return done.concat(flatten(curr)); - an unexpected plot twist appears! The function we want to apply is the very function we’re in. Colloquially: take it from the top.
}, []); - we tell our reduce function to start with an empty accumulator (done), and wrap it up.
} else { - this resolves our if statement at line 2. If the provided argument isn’t an array…
return arr; - return whatever the arr is. (Hopefully a cute apple.) This is our base case that breaks us out of recursion.
} - end the else statement.
} - end the overall function.

And we’re done! We’ve gone from our 24 line, 4-layers-deep nested for loop solution to a much more concise, 9 line recursive reduce solution. Reduce and recursion can seem a little impenetrable at first, but they’re valuable tools that will save you lots of future effort once you grasp them.

And don’t worry about Adam, our out-of-work developer. He got so much press after being featured in this article that he opened up his very own AI-managed apple pie factory. He’s very happy.

+1 for you if you saw that one coming.

Iterating over objects and arrays: frequent errors

2017-05-16T10:46:46+07:00

Here’s ~~some complaining~~ a quick overview of some code that has confounded me more than once. I’m told even very experienced developers encounter these situations regularly, so if you find yourself on your third cup of coffee scratching your head over why your code is doing exactly what you told it to do (and not what you want it to do), maybe this post can help you.

The example code is JavaScript, since that’s what I’ve been working in lately, but I believe the concepts to be pretty universal.

Quick reference for equivalent statements

This…	…is the same as this
`i++;`	`i = i + 1;`
`i--;`	`i = i - 1;`
`apples += 5`	`apples = apples + 5;`
`apples -= 5`	`apples = apples - 5;`
`apples *= 5`	`apples = apples * 5;`
`apples /= 5`	`apples = apples / 5;`

Quick reference for logical statements

This…	…gives this
`3 == '3'`	`true` (type converted)
`3 === '3'`	`false` (type matters; integer is not a string)
`3 != '3'`	`false` (type converted, 3: 3)
`3 !== '3'`	`true` (type matters; integer is not a string)
\|\|	logical “or”: either side evaluated
`&&`	logical “and”: both sides evaluated

Objects

Given a breakfast object that looks like this:

var breakfast = {
    'eggs': 2,
    'waffles': 2,
    'fruit': {
        'blueberries': 5,
        'strawberries': 1,
    },
    'coffee': 1
}

Or like this:

Iterate over object properties

We can iterate through each breakfast item using a for loop as follows:

for (item in breakfast) {
    console.log('item: ', item);
}

This produces:

item: eggs
item: waffles
item: fruit
item: coffee

Get object property value

We can access the value of the property or nested properties (in this example, the number of items) like this:

console.log('How many waffles? ', breakfast['waffles'])
console.log('How many strawberries? ', breakfast['fruit']['strawberries'])

Or equivalent syntax:

console.log('How many waffles? ', breakfast.waffles)
console.log('How many strawberries? ', breakfast.fruit.strawberries)

This produces:

How many waffles?  2
How many strawberries?  1

Get object property from the value

If instead I want to access the property via the value, for example, to find out which items are served in twos, I can do so by iterating like this:

for (item in breakfast) {
    if (breakfast[item] == 2) {
        console.log('Two of: ', item);
    }
}

Which gives us:

Two of:  eggs
Two of:  waffles

Alter nested property values

Say I want to increase the number of fruits in breakfast, because sugar is bad for me and I like things that are bad for me. I can do that like this:

var fruits = breakfast['fruit'];
for (f in fruits) {
    fruits[f] += 1;
}
console.log(fruits);

Which gives us:

{ blueberries: 6, strawberries: 2 }

Arrays

Given an array of waffles that looks like this:

var wafflesIAte = [ 1, 3, 2, 0, 5, 2, 11 ];

Or like this:

Iterate through array items

We can iterate through each item in the array using a for loop:

for (var i = 0; i < wafflesIAte.length; i++) {
    console.log('array index: ', i);
    console.log('item from array: ', wafflesIAte[i]);
}

This produces:

array index:  0
item from array:  1
array index:  1
item from array:  3
array index:  2
item from array:  2
array index:  3
item from array:  0
array index:  4
item from array:  5
array index:  5
item from array:  2
array index:  6
item from array:  11

Some things to remember: i in the above context is a placeholder; we could substitute anything we like (x, n, underpants, etc). It simply denotes each instance of the iteration.

i < wafflesIAte.length tells our for loop to continue as long as i is less than the array’s length (in this case, 7).

i++ is equivalent to i+1 and means we’re incrementing through our array by one each time. We could also use i+2 to proceed with every other item in the array, for example.

Access array item by index

We can specify an item in the array using the array index, written as wafflesIAte[i] where i is any index of the array. This gives the item at that location.

Array index always starts with 0, which is accessed with wafflesIAte[0]. Using wafflesIAte[1] gives us the second item in the array, which is “3”.

Ways to get mixed up over arrays

Remember that wafflesIAte.length and the index of the last item in the array are different. The former is 7, the latter is 6.

When incrementing i, remember that [i+1] and [i]+1 are different:

console.log('[i+1] gives next array index: ', wafflesIAte[0+1]);
console.log('[i]+1 gives index value + 1: ', wafflesIAte[0]+1);

Produces:

[i+1] gives next array index:  3
[i]+1 gives index value + 1:  2

Practice makes… better

The more often you code and correct your errors, the better you’ll remember it next time!

That’s all for now. If you have a correction, best practice, or another common error for me to add, please let me know!

How to Replace a String with sed in Current and Recursive Subdirectories

2017-05-06T20:04:53+08:00

I’ve probably run some variation of “find and replace across multiple files” thousands of times in my career. It’s one of those operations that seems straightforward until you’re staring at a codebase with 500,000 lines spread across 2,000 files, and you need to rename a function that’s used everywhere. Get it wrong, and you’re looking at hours of manual cleanup—or worse, subtle bugs that only surface in production.

Here’s the approach I use, why some methods work better than others, and some tips that can save you from that sinking feeling when you realize you just broke prod.

Current Directory Only

You can use sed by itself to make changes to files in the current directory, ignoring subdirectories.

.
├── index.html        # Change this file
└── blog
    ├── list.html     # Don't change
    └── single.html   # these files

To replace all occurrences of “foo” with “bar” in files within the current directory:

sed -i -- 's/foo/bar/g' *

Here’s what each component of the command does:

-i will change the original, and stands for “in-place.”
s is for substitute, so we can find and replace.
foo is the string we’ll be taking away,
bar is the string we’ll use instead today.
g as in “global” means “all occurrences, please.”
* denotes all file types. (No more rhymes. What a tease.)

You can limit the operation to one file type, such as Python files, by using a matching pattern:

sed -i -- 's/foo/bar/g' *.py

The Performant Recursive Pattern

Here’s a performant command for making changes in the current directory and all subdirectories:

find . -type f -name "*.py" -exec sed -i 's/old_function_name/new_function_name/g' {} +

Let me break this down because each piece matters more than you might think:

find . starts from the current directory
-type f only matches files (not directories)
-name "*.py" filters to Python files (adjust the pattern for your needs)
-exec sed -i 's/old/new/g' {} + runs sed on batches of files

That + at the end instead of \; is crucial for performance. It batches multiple files into each sed call instead of spawning a new process for every single file. When you’re dealing with thousands of files, this can be the difference between a 5-second operation and a 5-minute one.

The Safer Version I Actually Use

But in the real world, it might not be best to run that command as-is. Here’s a more accidentally-had-decaf-proof version:

# First, see what we're dealing with
find . -type f -name "*.py" -exec grep -l "old_function_name" {} +

# Test on a single file first
find . -type f -name "*.py" -exec grep -l "old_function_name" {} + | head -1 | xargs sed -i.bak 's/old_function_name/new_function_name/g'

# If that looks good, run on everything
find . -type f -name "*.py" -exec sed -i.bak 's/old_function_name/new_function_name/g' {} +

That .bak extension creates backup files automatically. Yes, you should be using version control, but I’ve seen too many scenarios where someone needed to quickly revert a change and of course they hadn’t started with a clean working tree.

The backup files are easy to clean up later:

find . -name "*.bak" -delete

When GNU sed vs BSD sed Actually Matters

Here’s something you run into when you switch from Linux to macOS: sed behaves differently. BSD sed (default on macOS) requires an argument to -i, even if it’s empty:

# Linux (GNU sed)
sed -i 's/old/new/g' file.txt

# macOS (BSD sed) - this breaks
sed -i 's/old/new/g' file.txt

# macOS (BSD sed) - this works
sed -i '' 's/old/new/g' file.txt
# or with backup
sed -i '.bak' 's/old/new/g' file.txt

You can also write portable versions:

# Portable approach
if sed --version 2>/dev/null | grep -q GNU; then
    find . -type f -name "*.py" -exec sed -i 's/old/new/g' {} +
else
    find . -type f -name "*.py" -exec sed -i '' 's/old/new/g' {} +
fi

Or use the backup approach everywhere since it works on both:

find . -type f -name "*.py" -exec sed -i.bak 's/old/new/g' {} +

Handling Special Characters Without Losing Your Mind

When your search string contains slashes, quotes, or regex metacharacters, things get interesting.

Instead of fighting with escaping, change the delimiter:

# Instead of this nightmare
sed -i 's/https:\/\/old\.domain\.com\/api/https:\/\/new\.domain\.com\/api/g'

# Use this
sed -i 's|https://old.domain.com/api|https://new.domain.com/api|g'

You can use almost any character as the delimiter. I usually go with | for URLs and # for file paths or when I’m dealing with email addresses (it’s easier to differentiate from a lowercase L).

For really complex patterns, sometimes it’s easier to put the sed script in a file:

# In replace.sed
s|https://old.domain.com/api|https://new.domain.com/api|g
s/DEBUG = True/DEBUG = False/g
s/old_secret_key/new_secret_key/g

# Use it
find . -type f -name "*.py" -exec sed -i.bak -f replace.sed {} +

This approach is also great for complex replacements that you’ll need to run multiple times or document for your team.

Performance Considerations That Actually Matter

When you’re dealing with large codebases, performance starts to matter. Seemingly simple find-and-replace operations could take 20+ minutes on large repositories when done inefficiently.

The biggest performance killer is usually file selection. Don’t do this:

# Slow—processes every file then filters
find . -type f -exec grep -l "old_string" {} + | xargs sed -i 's/old/new/g'

Do this instead:

# Fast—filters files first
find . -type f -name "*.py" -exec sed -i 's/old/new/g' {} +

If you need to be more selective about which files to process, use multiple find conditions:

# Only process Python files that aren't in virtual environments or build directories
find . -type f -name "*.py" ! -path "./venv/*" ! -path "./build/*" ! -path "./.git/*" -exec sed -i.bak 's/old/new/g' {} +

When sed Isn’t the Right Tool

It’s tempting to force sed to do things it’s not great at. Here’s when I reach for other tools:

For complex transformations: Use a proper scripting language. A 50-line sed script could be 10 lines of Python and infinitely more readable.

For structured data: If you’re modifying JSON, YAML, or XML, use tools that understand the format. sed doesn’t know about string escaping or nested structures.

For very large files: sed loads the entire file into memory for each operation. For multi-gigabyte files, stream processing tools like awk might be better.

For interactive replacements: Use your editor’s project-wide search and replace, or tools like rg (ripgrep) with interactive replacement.

The Nuclear Option: Parallel Processing

If you’re dealing with truly massive codebases (millions of lines), you might need to get aggressive about performance:

# Find all target files first
find . -type f -name "*.py" ! -path "./venv/*" > /tmp/files_to_process

# Process them in parallel
cat /tmp/files_to_process | xargs -n 50 -P 8 sed -i.bak 's/old/new/g'

That -P 8 runs up to 8 sed processes in parallel, and -n 50 processes 50 files per batch. Adjust based on your CPU cores and I/O capacity.

Testing Before You Commit

Here’s a thorough testing workflow for large replacements:

# 1. Count occurrences before
find . -type f -name "*.py" -exec grep -c "old_string" {} + | awk -F: '{sum+=$2} END {print sum}'

# 2. Run replacement with backups
find . -type f -name "*.py" -exec sed -i.bak 's/old_string/new_string/g' {} +

# 3. Count occurrences after (should be 0)
find . -type f -name "*.py" -exec grep -c "new_string" {} + | awk -F: '{sum+=$2} END {print sum}'

# 4. Spot check a few files
find . -name "*.bak" | head -5 | while read backup; do
    original="${backup%.bak}"
    echo "=== $original ==="
    diff "$backup" "$original"
done

# 5. Run tests
make test  # or whatever your test command is

# 6. If everything looks good, clean up backups
find . -name "*.bak" -delete

Using sed in Real-World Scenarios

API endpoint migration: Moving from v1 to v2 API endpoints meant updating hundreds of URL references across multiple repositories. The key was being selective about file types and using exact matches to avoid accidentally changing documentation or comments that mentioned the old API.

Database migrations: After a database refactor for a Django application, sed came in handy for making changes to complex Django migration files. I used different sed patterns for different contexts—from Python to raw SQL—because the replacement patterns were slightly different in each case.

Configuration key updates: When our configuration format changed, I needed to update key names across config files, code references, and documentation. This one required multiple passes with different patterns because the same logical key appeared in different syntactic contexts.

The Debugging Workflow That Saves Time

When a sed operation goes wrong (and it will), here’s how I debug:

Check what files were actually modified:

find . -name "*.bak" -exec sh -c 'diff -q "$1" "${1%.bak}"' _ {} \; | head -10

Look for unintended matches:

find . -name "*.bak" -exec sh -c 'diff "$1" "${1%.bak}"' _ {} \; | grep "^<" | sort | uniq -c | sort -nr

Restore and try a more specific pattern:

find . -name "*.bak" -exec sh -c 'mv "$1" "${1%.bak}"' _ {} \;

The pattern of creating backups, testing the results, and having a quick rollback strategy will save you countless hours. It’s especially important when you’re working on shared codebases where a mistake affects your entire team.

While sed operations might seem like they’re just for simple text processing, they can help with critical steps in deployments, migrations, and refactoring efforts that affect real systems and real users. Taking the time to do them safely and efficiently pays dividends when you’re not scrambling to fix broken builds or track down subtle bugs that only show up in production.

Things you need to know about becoming a Data Scientist

2017-03-31T13:19:19+09:00

I recently attended a panel discussion hosted by General Assembly in Singapore entitled, “So you want to be a Data Scientist/Analyst”. The panel featured professionals in different stages of their careers and offered a wealth of information to an audience of hopefuls, including tips on how to land a job as a data scientist, and stories debunking myths that color this field.

The panelists

Misrab Faizullah-Khan - VP of Data Science, GO_JEK
Anthony Ta - Data Scientist, Tech in Asia
Leow Guo Jun - Data Scientist, GO_JEK
Gabriel Jiang - Data Scientist
Adam Drake - Chief Data Officer, Atazzo

Here’s a rundown of the major points discussed, paraphrased for brevity.

What’s a day-in-the-life like

We’re mostly “data janitors.” A large part of working with data begins with and consists of data sanitation. Without quality data, you won’t get accurate results. Understanding how data should be sanitized largely encompasses skills that aren’t directly related to data analytics. To fully understand the problem you’re hoping to solve, you need to talk with the people involved. It’s important that everyone understands all the elements of a project, and exactly what those elements are being called. “Sales,” as an example, may be calculated differently depending on who you’re talking to.

What’s a data “scientist” vs. data “analyst”

It largely depends on the company you work for. “Data [insert modifier]” is only a recent distinction for a job field that has historically been called “Business Analytics.” In a smaller company, as with any other position, one person may handle a variety of data-related tasks under the title of “Data Scientist.” In a larger company with more staff and finer grain specialization, you may have a “Data Analyst” that handles less technical aspects, and a “Data Scientist” whose work is very technical and involves quantitative learning or machine learning.

The field of data science/analytics is fresh enough that standard definitions for job titles really haven’t been agreed upon yet. When considering a position, focus on the company rather than the title.

Should I join a startup or large company

There’s no wrong answer. Being aware of your own working style and preferences will help guide your decision.

Startups generally offer more freedom and less micromanaging. This also means that you’ll necessarily receive less guidance, and will need to be able to figure stuff out, learn, and make progress under your own power.

In a big company, you’re likely to experience more structure, and be expected to follow very clearly defined pre-existing processes. Your job scope will likely be more focused than it would be at a startup. You’ll experience less freedom in general, but also more certainty in what’s expected of you.

In the end, especially at the beginning of your career, don’t put too much stock in choosing one or the other. If you like the company, big or small, give it a try. If you’re not happy there after a few months, then try another one. No career decision is ever permanent.

It’s also worthwhile to note that even if you find a company you like the first time around, it’s in your best interest to change companies after one or two years. The majority of the salary raises you’ll earn in your lifetime will occur in the first ten years of your career. Say you’re hired by Company A as a junior data scientist for two years - after two years, you’re no longer a junior. You can now earn, say, a 30% higher salary in a data scientist position, but it’s unlikely that Company A will give you a 30% raise after two years. At that point it’s time to find Company B and put a few more years of experience on your resume, then probably change companies again. You don’t earn the big bucks sticking with one company for decades - you’ll always be the junior developer.

What do you look for when hiring a candidate

Overall, the most important skills for a data science candidate are soft skills. Curiosity, tenacity, and good communication skills are vital. Persistence, especially when it comes to adapting to a quickly changing industry, is important. The most promising candidates are passionate enough about the field to be learning everything they can, even outside of their work scope. Hard skills like coding and algorithms can be taught - it’s the soft skills that set good candidates apart.

Hacking skills are also vital. This doesn’t necessarily mean you can write code. Someone who has a grasp of overall concepts, knows algorithms, and has curiosity enough to continuously learn is going to go farther than someone who can just write code. It takes creativity to build hacking skills on top of being familiar with the basic navigation points. Having the ability to come up with solutions that use available tools in new ways - that’s hacking skill.

Design thinking is another important asset. Being able to understand how systems integrate on both technical and business levels is very valuable. If you’re able to see the big picture, you’re more likely to find different ways to accomplish the overall objective.

You might think that seeing buzzwords on resumes makes you look more attractive as a candidate - more often, it stands out as a red flag. Putting “advanced machine learning” on your CV and then demonstrating that you don’t know basic algorithms doesn’t look good. It’s your projects and your interests outside of the job you’re applying for that say the most about you. Popular topics in this industry change fast - you’re better off having a solid grasp of basic fundamentals as well as a broad array of experience than name-dropping whatever’s trending.

Is there a future for humans in the data science field? When will the machines replace us

This isn’t a question unique to data science, and many historical examples already exist. Financial investment is a good example - where you used to have a human do calculations and make predictions, computers now do a lot of that automatically, making decisions about risk and possible payoff every day.

Where humans won’t be replaced, just as in other industries that have embraced automation, is in the human element. You’ll still need people to handle communication, be creative, be curious, make interpretations and understand problems… all those things are fundamentally human aspects of enterprise.

Ultimately, machines and more automation will make human work less of a grind. By automating the mundane stuff, like data sanitization for example, human minds are freed up to develop more interesting things.

What are the future applications for data-driven automation

Legal is a good next candidate for automation. There’s a lot there that can be handled by programs using data to assess risk.

Medicine is another field ripe for advances through data. Radiologists, your days are numbered: image detection is coming for you. The whole field of diagnostics is about to drastically change.

A particularly interesting recent application for data science is in language translation. By looking at similarities in sentence structure and colloquial speech across different languages, we’re able to sort similar words based on the “space” they occupy within the language structure.

Insurance - the original data science industry - already is and will continue to become very automated. With increased ability to use data to assess risk, we’re beginning to see new creative insurance products being introduced. E-commerce companies can now buy insurance on the risk a customer will return a product - hard to do without the accessibility of data that we have today.

How do I push data-driven decisions and get my boss to agree with me

It’s a loaded question. The bottom line is that it depends on the company’s data culture and decision path. We’ve experienced working for management who say, “We’ve already made the decisions, we just need the data to prove it.” Obviously, that’s a tough position to work from.

Generally, ask yourself, “Am I making my boss look good?” You might hear that and think, “Why would I let my boss get all the credit?” - but who cares? Let them take the credit. If you’re producing good work, you’re making your team look good. If you make your team look good, you’re indispensible to your team and your boss. People who are indispensible are listened to.

What’s your best advice for a budding data scientist

Don’t be too keen to define yourself too quickly. If you narrow your focus too much, especially when you’re learning, you can get stuck in a situation of having become an expert in “Technology A, version 3” when companies are looking to hire for experts in version 4. It happens.

A broad understanding of fundamentals will be far more valuable to you on the whole. Maybe you start out writing code, and decide you don’t like it, but discover that you’re really good at designing big picture stuff and leading teams, and you end up as a technical lead. It could even vary depending on the company you work for - so stay flexible.

Your best bet is to follow what you’re passionate about, and try to understand a wide range of overall concepts. Spend the majority of your efforts learning things that are timeless, like the base technologies under hot-topic items like TensorFlow. Arm yourself with a broad understanding of the terrain, different companies, and the products that are out there.

If you focus on learning code specifically, learning one language well makes it easier to learn others. Make sure you understand the basics.

TL;dr it

Adam: Talk more and don’t give up.
Anthony: [Be] courageous, and hands-on.
Gabriel: Be creative.
Guo Jun: It’s worth the pain.
Misrab: Evaluate yourself and maintain a feedback loop.

How I created custom desktop notifications using terminal and cron

2017-02-21T10:48:38+07:00

In my last post I talked about moving from Windows 10 to running i3 on Linux, built up from Debian Base System. Among other things, this change has taught me about the benefits of using basic tools and running a minimal, lightweight system. You can achieve a lot of functionality with just command line tools and simple utilities. One example I’d like to illustrate in this post is setting up desktop notifications.

I use dunst for desktop notifications. It’s a simple, lightweight tool that is easy to configure, doesn’t have many dependencies, and can be used across various distributions.

Battery status/low battery notification

I was looking for a simple, versatile set up to create notifications for my battery status without having to rely on separate, standalone GUI apps or services. In my search I came across a simple one-line cron task that seemed to be the perfect fit. I adapted it to my purpose and it looks like this:

# m h  dom mon dow   command
*/5 * * * * acpi --battery | awk -F, '/Discharging/ { if (int($2) < 20) print }' | xargs -ri env DISPLAY=:0 notify-send -u critical -i "/usr/share/icons/Paper/16x16/status/xfce-battery-critical.png" -t 3000 "{}\nBattery low!"

Psst… here’s a great tool for formatting your crontab times.

There’s a lot going on here, so let’s break it down: */5 * * * * Every five minutes, do the following.

acpi --battery Execute acpi and show battery information, which on its own returns something akin to: Battery 0: Discharging, 65%, 03:01:27 remaining

Pretty straightforward so far. At any point you could input acpi --battery in a terminal and receive the status output. Today’s post, however, is about receiving this information passively in a desktop notification. So, moving on:

| awk -F, '/Discharging/ { if (int($2) < 20) print }' Pipe (|) the result of the previous command to awk. (If you don’t know what pipe does, here’s an answer from superuser.com that explains it pretty well, I think.) awk can do a lot of things, but in this case, we’re using it to examine the status of our battery. Let’s zoom in on the awk command:

awk -F, '/Discharging/ { if (int($2) < 20) print }' Basically, we’re saying, “Hey, awk, look at that input you just got and try to find the word “discharging,” then look to see if the number after the first comma is less than 20. If so, print the whole input.”

| xargs -ri Pipe the result of the previous command to xargs, which takes it as its input and does more stuff with it. -ri is equivalent to -r (run the next command only if it receives arguments) and -i (look for “{}” and replace it with the input). So in this example, xargs serves as our gatekeeper and messenger for the next command.

env DISPLAY=:0 Run the following utility in the specified display, in this case, the first display of the local machine.

notify-send -u critical -i "/usr/share/icons/Paper/16x16/status/xfce-battery-critical.png" -t 3000 "{}\nLow battery!" Shows a desktop notification with -u critical (critical urgency), -i (the specified icon), -t 3000 (display time/expires after 3000 milliseconds), and finally {} (the output of awk, replaced by xargs).

Not bad for a one-liner! I made a few modifications for different states of my battery. Here they all are in my crontab:

# m h  dom mon dow   command
*/5 * * * * acpi --battery | awk -F, '/Discharging/ { if ( (int($2) < 30) && (int($2) > 15) ) print }' | xargs -ri env DISPLAY=:0 notify-send -a "Battery status" -u normal -i "/usr/share/icons/Paper/16x16/status/xfce-battery-low.png" -t 3000 "{}\nBattery low!"
*/5 * * * * acpi --battery | awk -F, '/Discharging/ { if (int($2) < 15) print }' | xargs -ri env DISPLAY=:0 notify-send -a "Battery status" -u critical -i "/usr/share/icons/Paper/16x16/status/xfce-battery-critical.png" -t 3000 "{}\nSeriously, plug me in."
*/60 * * * * acpi --battery | awk -F, '/Discharging/ { if (int($2) > 30) print }' | xargs -ri env DISPLAY=:0 notify-send -a "Battery status" -u normal -i "/usr/share/icons/Paper/16x16/status/xfce-battery-ok.png" "{}"
*/60 * * * * acpi --battery | awk -F, '/Charging/ { print }' | xargs -ri env DISPLAY=:0 notify-send -a "Battery status" -u normal -i "/usr/share/icons/Paper/16x16/status/xfce-battery-ok-charging.png" "{}"
*/60 * * * * acpi --battery | awk -F, '/Charging/ { if (int($2) == 100) print }' | xargs -ri env DISPLAY=:0 notify-send -a "Battery status" -u normal -i "/usr/share/icons/Paper/16x16/status/xfce-battery-full-charging.png" "Fully charged."

By the way, you can open your crontab in the editor of your choice by accessing it as root from the /var/spool/cron/crontabs/ directory. It’s generally best practice however to make changes to your crontab with the command crontab -e.

You can see that each notification makes use of the {} placeholder that tells xargs to put its input there - except for the last one. This is interesting because in this case, we’re only using xargs -ri as a kind of switch to present the notification. The actual information that was the input for xargs is not needed in the output in order to create a notification.

Additional notifications with command line tools

With cron and just a few combinations of simple command line tools, you can create interesting and useful notifications. Consider the following:

Periodically check your dhcp address

*/60 * * * * journalctl | awk -F: '/dhcp/ && /address/ { print $5 }' | tail -1 | xargs -ri env DISPLAY=:0 notify-send -a "dhcp address" -u normal "{}"

Which does the following: */60 * * * * Every 60 minutes.

journalctl Take the contents of your system log.

| tail -1'/dhcp/ && /address/ { print $5 }' Find logs containing both “dhcp” and “address” and output the 5th portion as separated by “:” (the time field counts).

| tail -1 Take the last line of the output.

| xargs -ri env DISPLAY=:0 notify-send -a "dhcp address" -u normal "{}" Create the desktop notification including the output.

Periodically display the time and date

*/60 * * * * timedatectl status | awk -F\n '/Local time/ { print }' | xargs -ri env DISPLAY=:0 notify-send -a "Current Time" -u normal "{}"

System log activity

You can also search your system logs (try journalctl) for any number of things using awk, enabling you to get periodic notifications of virtually any logged events.

Experiment

As with all things, you are only limited by your imagination! I hope this post has given you some idea about the endless possibilities of these simple utilities. Thanks for reading!

How I ditched WordPress and set up my custom domain HTTPS site for (almost) free

2017-01-28T13:16:17+07:00

I got annoyed with WordPress.com. While using the service has its pros (like https and a mobile responsive website, and being very visual and beginner-friendly) it’s limiting. For someone who’s comfortable enough to be tweaking CSS but who’s not interested in creating their own theme (or paying upwards of $50 for one), I felt I wasn’t really the type of consumer WordPress.com was suited to.

To start with, if you want to remove WordPress advertising and use a custom domain name, it’s a minimum of $3 per month. If, like me, the free themes provided aren’t just what you’re looking for, you’re stuck with two choices: buy a theme for $50+, or pay $8.25 per month to do some css customization. I don’t know about you, but I feel like there should be a hack for this.

How I ditched WordPress and got everything I wanted for free

Okay, almost free. You still have to pay at least $0.99 for a domain name.

For those of you technical enough to skip reading a long post, the recipe is this:

Buy a custom domain via this Namecheap affiliate link (Thanks for your support! 😊)
Install Hugo, my favorite static site generator
Host with GitHub Pages
Put your custom domain to work with GitHub Pages
~~Use Cloudflare’s free plan~~ Enforce HTTPS for GitHub Pages

Let’s do the nitty gritty:

1. Buy a custom domain

This one’s pretty simple. Head on over to Namecheap , Gandi, or if you’re rolling in dough, GoDaddy. Find your perfect web address and buy it up.

If it’s a personal domain like yourname.com, it’s a pretty good idea to pay upfront for five years or even ten years, if you’ve got the cash. It’ll save you the trouble of remembering to renew, allow you to build your personal brand, and prevent someone else from buying up your URL.

If you’re just trying out an idea, you can go with a one-year $0.99 experiment. Namecheap also gives you WhoisGuard (domain registration privacy) free for one year.

2. Install Hugo

I’m a big fan of Hugo so far. Admittedly, those who feel more comfortable with a visual, WYSIWYG editor may feel like a fish out of water at first. As long as you’re not afraid of using command line, though, using Hugo is pretty straightforward. The fact that I have access to all my code is my favorite part. It’s only as simple or complicated as I want it to be.

Hugo is open source and free. They’ve got great documentation, and following their Quickstart guide line-by-line will get you set up with your new site in minutes.

If you’re not used to the idea of your site existing as files and folders, the basic premise is this: Hugo, along with the themes available, helps you to create all the pages and files that your site needs to run.

Blog posts can be written in Markdown and saved in your /content/blog/ folder; preferences for your site and theme can be set in the config.toml file. After that, generating all your site’s pages is as quick and easy as typing the command hugo --theme=. You’ll be able to see a live version of your site in your browser as you’re editing it (go to http://localhost:1313/ in your browser, as described in Step 5) so you’re not flying blind.

3. Host with GitHub Pages

If you read to Step 12 of Hugo’s Quickstart Guide, you’ll see that they even provided instructions for hosting your files on GitHub pages. If you’re new to Git, you’ll first need to sign up at GitHub and then set up Git. GitHub is a very friendly resource, and you can find a multitude of code examples and guides in connection with it. The Hello World Guide will take you through all you need to know to use GitHub.com.

Once you’re comfortable with the way GitHub works generally, setting up a site by following the guide on GitHub Pages is no big deal. If you followed the Hugo Quickstart Guide up to Step 11, you’ll want to jump to Step 12 after creating the repository on GitHub.

In case it’s not clear, once you set up your new repository on GitHub called yourusername.github.io, grab the HTTPS link at the top. From there it’s just a few simple commands to create the git repository for your site and push it to your new web address:

## from yoursite/public folder:
$ git init
$ git remote add origin 
$ git add --all
$ git commit -m "Initial commit."
$ git push origin master

Have a little celebration - your site is already up at https://yourusername.github.io! Now for the pizza-de-resilience: the custom domain.

4. Point your custom domain to GitHub Pages

To set up your site at apex (meaning yourname.com will replace yourusername.github.io), there’s just four steps:

Add your domain to your GitHub Pages site repository
In your domain registrar’s DNS settings, create A records pointing to GitHub’s IP addresses
In your domain registrar’s DNS settings, create a CNAME record pointing to yourusername.github.io
Make sure there’s a CNAME file in the root directory of your GitHub repository containing yourname.com (your custom domain)

5. Enforce HTTPS for GitHub Pages

GitHub Pages supports HTTPS through a partnership with Let’s Encrypt! This greatly simplifies the process of serving your site securely. Just look for this clever checkbox in the Settings of your site’s GitHub repository.

Why do I need HTTPS anyway? For one, it’ll give your site a little boost on Google. More importantly, it’s fundamental to your website security. You can learn more about HTTPS and TLS in this post.

That’s pretty much it! If you don’t see changes right away, give all your services a lunch hour or so to propogate. Soon your site will be up and running at https://yourname.com.

Thanks for reading! If you found this post helpful, there’s a lot more where this came from. You can subscribe below to see new posts first.

Iteration in Python: for, list, and map

2017-01-18T21:58:28+07:00

Iteration in Python can be a little hard to understand. Subtle differences in terminology like iteration, iterator, iterating, and iterable aren’t the most beginner-friendly.

When tackling new concepts, I find concrete examples to be most useful. I’ll share some in this post and discuss appropriate situations for each. (Pun intended.)

For loop

First, in pseudocode:

for iterating_variable in iterable:
    statement(s)

I find for loops to be the most readable way to iterate in Python. This is especially nice when you’re writing code that someone else needs to read and understand, which is always.

An iterating_variable, loosely speaking, is anything you could put in a group. For example: a letter in a string, an item from a list, or an integer in a range of integers.

An iterable houses the things you iterate on. This can also take different forms: a string with multiple characters, a range of numbers, a list, and so on.

A statement or multiple statements indicates doing something to the iterating variable. This could be anything from mathematical expressions to simply printing a result.

Here are a couple simple examples that print each iterating_variable of an iterable:

for letter in "Hello world":
    print(letter)

for i in range(10):
    print(i)

breakfast_menu = ["toast", "eggs", "waffles", "coffee"]
for choice in breakfast_menu:
    print(choice)

You can even use a for loop in a more compact situation, such as this one-liner:

breakfast_buffet = " ".join(str(item) for item in breakfast_menu)

The downside to for loops is that they can be a bit verbose, depending on how much you’re trying to achieve. Still, for anyone hoping to make their Python code as easily understood as possible, for loops are the most straightforward choice.

List comprehensions

A pseudocode example:

new_list = [statement(s) for iterating_variable in iterable]

List comprehensions are a concise and elegant way to create a new list by iterating on variables. Once you have a grasp of how they work, you can perform efficient iterations with very little code.

List comprehensions will always return a list, which may or may not be appropriate for your situation.

For example, you could use a list comprehension to quickly calculate and print tip percentage on a few bar tabs at once:

tabs = [23.60, 42.10, 17.50]
tabs_incl_tip = [round(tab*1.15, 2) for tab in tabs]
print(tabs_incl_tip)

>>> [27.14, 48.41, 20.12]

In one concise line, we’ve taken each tab amount, added a 15% tip, rounded it to the nearest cent, and made a new list of the tabs plus the tip values.

List comprehensions can be an elegant tool if output to a list is useful to you. Be advised that the more statements you add, the more complicated your list comprehension begins to look, especially once you get into nested list comprehensions. If your code isn’t well annotated, it may become difficult for another reader to figure out.

Map

How to map, in pseudocode:

map(statement, iterable)

Map is pretty compact, for better or worse. It can be harder to read and understand, especially if your line of code has a lot of parentheses.

In terms of efficiency for character count, map is hard to beat. It applies your statement to every instance of your iterable and returns an iterator.

Here’s an example casting each element of input() (the iterable) from string representation to integer representation. Since map returns an iterator, you also cast the result to a list representation.

values = list(map(int, input().split()))
weights = list(map(int, input().split()))

It’s worth noting that you can also use for loops, list comprehension, and map all together:

output = sum([x[0] * x[1] for x in zip(values, weights)]) / sum(weights)

print(round(output, 1))

Your iteration toolbox

Each of these methods of iteration in Python have a special place in the code I write every day. I hope these examples have helped you see how to use for loops, list comprehensions, and map in your own Python code!

If you like this post, there’s a lot more where that came from! I write about efficient programming for coders and for leading technical teams. Check out the posts below!

/site

0001-01-01T00:00:00+00:00

What’s All This?

Welcome to victoria.dev, a personal website wholly created and owned by me, Victoria Drake. I’ve produced everything you see on the site—from research and writing to illustrations, design, code, and deployment.

The site is open source, so feel free to explore how it works!

Technical Features

Static Site Generation: Built with Hugo for speed and flexibility.
Search Functionality: Implemented using Lunr.js.
Illustrations: I create all the illustrations and comics in my articles on my iPad.
IndieWeb Integration: I’ve implemented microformats2 markup, making the site compatible with social readers and other IndieWeb sites.

Continuous Deployment

The site is automatically deployed with each update using GitHub Pages and GitHub Actions.

Development Tools

Link Checking: I created link-snitch, a custom GitHub Action, to regularly check for broken links across the site. It’s powered by Hydra, a multithreaded Python site-crawling link checker built with the standard library.
Code Quality: I use the pre-commit framework with markdownlint-cli2 to maintain content quality.
Self-Documenting Makefile: A self-documenting Makefile helps streamline development workflows without having to remember command-line flags.

Contributions

If you find a mistake or bug, please open an issue so it can be fixed!

I don’t accept guest blog posts or requests for placing links in posts.

License

This work is licensed under a Creative Commons Attribution 4.0 International License.

_{There may or may not be secret pages and easter eggs on the site.}

Hello! Your turn.

0001-01-01T00:00:00+00:00

Email me

You can say hello@victoria.dev.

Please note that I do not accept guest blog posts or requests for placing links in posts.

Find me in the ‘verse

You can also learn how I built this site or sign up for my newsletter below.

Victoria Drake

0001-01-01T00:00:00+00:00

Engineering Director and Principal Software Engineer

As a seasoned engineering leader, I guide teams to produce secure, industry-leading software for both government and private sector entities. My unique experience equips me to navigate the full spectrum of technology product development, from roadmap conception to final deployment and operations.

I’ve excelled as an engineering director, principal software engineer, and mentor. I connect executive decisions with hands-on technical experience. I enjoy sharing my expertise at many levels, whether it’s advising C-suite executives on technology strategy or software and security best practices, unblocking development teams, or deep-diving the codebase with developers to tackle complex architectural and technical challenges. I particularly thrive in leading distributed teams and bringing together complementary skills.

I’m a co-author of the OWASP Web Security Testing Guide and advocate for cybersecurity training across organizations. Along with contributions to various business and engineering publications, my track record reflects my commitment to understanding the technology and security terrain and ensuring teams can traverse it with confidence.

You can read more about me or send me an email.

Download a PDF version of my resume.

Principal Software Engineer

September 2021 — September 2023

Principal Software Engineer for Sophos Factory, a modern DevSecOps automation pipeline builder.

Enhanced platform scalability of a newly-acquired startup, aligning it with the broader business needs of Sophos through strategic technical and operational enhancements
Collaborated across teams to seamlessly integrate Sophos Factory with broader Sophos offerings and product strategy
Established standards and workflow processes to scale up the delivery of new features
Ensured roadmap planning and comprehensive feature requirements aligned with the broader company strategy and roadmap

Director Of Engineering

March 2020 — September 2021

Directed software development at ZibaSec, a modern cybersecurity awareness training platform that uses realistic phishing simulations to create lasting behavior change and cyber risk reduction.

Led the engineering group to design, implement, and secure a serverless cloud infrastructure while greatly improving application performance and achieving FedRAMP Authorization
Achieved 4.5x speedup in serverless application performance using multiple infrastructure components and distributed computing techniques
Created and implemented strategies for increasing knowledge transfer and organizational scalability in a growing, remote-first company
Reduced onboarding time for new engineers by 75% by leading an overhaul of onboarding processes and documentation
Scaled the engineering team size by 3x through improved processes for recruiting, interviewing, and hiring

OWASP Core Contributor Team

August 2019 — present

Co-author and core maintainer for the OWASP WSTG. The Open Web Application Security Project (OWASP) Web Security Testing Guide (WSTG) is the foremost open source resource for testing web application security.

Built and established modern CI/CD and automation practices
Serve as technical editor for submissions from contributors

OWASP Web Security Testing Guide v4.2 released

freeCodeCamp Coding Mentor, Contributor

2017 — 2021

Recognized as a Top Contributor for three consecutive years at freeCodeCamp, a global 501(c)(3) non-profit organization that helps millions of people worldwide learn how to code.

Served as organizer for the 2017 inaugural freeCodeConference in Toronto
Provided mentorship, code review, and career guidance to motivated technologists worldwide

Senior Software Developer, Consultant

2016 — 2021

As a senior technology leader with a background in cybersecurity and full-stack software development, I provided executive leadership insights and technical guidance on product and process improvement.

Focus areas:

Leader mentorship and development
Increasing development velocity in engineering teams
Application infrastructure and code efficiency, speedup, and cost savings

Products and case studies:

ApplyByAPI.com, SaaS that improves the technical hiring process by filtering candidates at the top of the funnel, and reduces human hours spent on screening
Modern e-commerce solutions for legacy industries, such as large-scale commercial building construction materials
Product design and product management for applications including an audio virtual reality application

GitHub Action Hero: Victoria Drake

Education

Master of Science, Computer Science - Georgia Institute of Technology

Contact

hello@victoria.dev

Victoria's Bookshelf

0001-01-01T00:00:00+00:00

Books that have measurably contributed to my skill stack are shared here.

Required reading for technology leaders

Extreme Ownership: How U.S. Navy SEALs Lead and Win

Jocko Willink, Leif Babin

Foundational mindset and principles of leadership. How taking ownership of your work, project, and yourself helps to make you a better leader.

The Art of Action: How Leaders Close the Gaps between Plans, Actions and Results

Stephen Bungay

To understand how empowering your team to make decisions without you provides a significant competitive edge. An extremely worthwhile leadership curriculum. I'd get the hardcover.

Thinking, Fast and Slow

Daniel Kahneman

To help gain a foundational understanding of how people think and react. Largely considered transformative in the field of cognitive psychology.

Non-coding books for coders

A lot of non-technical knowledge gems can contribute to your programming skills! Here are the most helpful ones I’ve read myself.

Victoria Drake on victoria.dev

Why the Best Engineers Will Thrive Alongside AI

AI Amplifies Systems Thinking Through Better Collaboration

Human Skills Become Your Competitive Advantage

Building AI-Native Systems From the Ground Up

Developing AI Collaboration Skills

Positioning for Long-Term Success

The Compound Advantage of Early Adoption

From Problem Solver to Problem Solver Creator

Teaching Through Ownership, Not Tasks

Building Problem-Solving Muscle Through Learning

Communication That Enables Independent Thinking

Identifying and Developing Natural Problem-Solving Styles

Removing Obstacles to Problem-Solving Growth

The Multiplier Effect

I Spent $78 Learning Why Bash Still Matters in the AI Age

What just happened?

The expensive lesson in algorithmic thinking

The real cost of convenience

Create Better Code Documentation 10x Faster with AI

Documentation That Welcomes New Team Members

Operational Documentation That Actually Helps

Capturing Institutional Knowledge

Making Documentation a Team Superpower

Practical Tips for Better Results

Post to your static website from your iPhone

How to send long text input to ChatGPT using the OpenAI API

Chunking your input

Handling responses

Putting it all together

Error handling

Optimization

Now what

Optimizing text for ChatGPT: NLP and text pre-processing techniques

Text preprocessing

Tokenization and ChatGPT input limits

A general programmatic approach

Byte-Pair Encoding (BPE)

Sending lots of text to ChatGPT

Mastering Git for Small Teams

A Protected Main Branch (No Exceptions)

One Issue, One Branch, One PR (Keep It Simple)

Author's illustration of issue branches and releases from master.

Avoiding the Common Disasters

Why This Actually Works

Introducing The Tech Leader Docs

My paper to-do strategy

One page at a time

Intuitive notation

When it’s time to turn the page

Time well spent doing

Set up a Pi-hole VPN on an AWS Lightsail instance

Create and connect to a Lightsail instance

Install OpenVPN on your server

Install and configure Pi-hole

Configure OpenVPN

Configure firewall

Test your client connection

Save iptables

Future tasks

Beyond Gut Feelings: How I Use Issue Metrics to Boost Engineering Velocity

Getting quality data

Plotting with Pandas

You aim for what you measure

There are better options for a privacy-respecting phone

Linux phones (sort of)

De-googled Android

Hardware

LineageOS

GrapheneOS

New phone, who dis?

Software

1. Official APKs

2. Use F-Droid

3. Aurora Store

4. If you need Google Apps

The TL;DR

The Doorway Problem: Why Building in Isolation Fails

The Planning Fallacy (Or: Why We’re All Terrible at This)

Context Is Your Reality Check

Save `iptables`