Fortified Bikesheds: Basic Change Process Revisited

Saturday, December 10, 2011

Basic Change Process Revisited

In an earlier post, I explained the basic change process. In this post I'd like to show how it works in a distributed version control system. Besides encouraging the general use of distributed version control systems, I'd like to expose the difference between the classic branching diagrams and the actual meaningful part of the revision history.

As a reminder, here's the branch diagram again:

Start at a stable revision;
Branch, make change;
Meanwhile, some other change makes it into the stable branch;
Pull in the new change, merge;
Add your change to the stable branch (assuming, of course, it passes QA).

Here's what it looks like in the distributed world:

You start with your main repository. This will usually be hosted on some kind fo more centralized host, or on a service like GitHub or bit bucket. As I explained in my Cherrypicking Made Easy post, the best repo to use is the one that represents your current production version of the code. I believe that in the end, this is the only repository that truly matters to Release Management...

If you wish to make a change, you fork or clone your own repository off the main repo. If you use a remote service, you may wish to first fork/clone the repo on that remote service, then clone onto your local machine.

Now edit, compile, test and check in - creating the green revision in the diagram.

Meanwhile, some other change appears in the main repository, marked as the blue change.

In preparation to getting your green change out, you first pull in the blue change from the main repository. Note that this pull operation usually doesn't change any content, it simply creates a new head in your revision graph.

Inside your repository, you merge the two heads, creating a new revision with the merge result. Note how this creates this diamond shape in the revision graph. "git" users may opt to perform a rebase instead. This essentially revolves rewriting the revision history to remove the green side of the diamond and to pretend that your change was a simple addition.

Finally, if everything looks good, your change can be pushed back into the main repo. This should be done by the owner of the main repo, via a so-called pull request. This allows the repo owner to examine the changes prior to pulling them in.

The interesting thing here is how the branching diagram of the basic change process simplifies into the diamond shaped revision graph. This will become more interesting as we examine the basic technique to produce Release Notes by examining changes contributing to a specific revision (usually a release).

A very good read to get into the mood of distributed version control systems is http://hginit.com, especially if you're a subversion or perforce user.

2 comments:

Derick LyleJune 3, 2012 at 8:09 AM
I'm not sure that I agree with you that the owner of the main repo should pull changes. In a collaborative environment where you want to control the selection of submitted features, I would buy that argument. In my experience, merging other people's code is notoriously error-prone, and the best person to do the job is the change author. You also have more accountability from both sides - my feature, my merge, my bugs to fix, and my issues to correct if my merge clobbered some other functionality.
ReplyDelete
Replies
CGJune 3, 2012 at 8:27 AM
One doesn't exclude the other. You pull in any other changes, resolve the merge in your repo, and then make the pull request, asking the curator of the authoritative repo to pull in your merge.

If at that point, the curator of the repo notices a merge conflict, he can simply reject the pull request and ask you to perform a pull/merge yourself first.
ReplyDelete
Replies

Add comment

Note: Only a member of this blog may post a comment.

Story So Far

The goal of the first batch of posts is to describe a software build and release process which assumes that builds are not necessarily reproducible and expensive to perform, and also assumes a large number of independent development teams all working on some grand piece of software.

Good release management starts at the source, so the first few postings deal about source code control and change management, and how to mine the change data correctly.

Once we can reliably build stuff, we need to manage the build products, the artifacts, so they can be reused, tested and released.