One Thing I Love About Git

By Ovid on February 24, 2010 9:02 PM

If you've been using Subversion or (shudder) CVS, you only have the briefest glimmerings of what source control is about. I don't really like having to dig too deeply into tools that I use. I want them to be easy, but I dig when something's hard.

On thing which frustrated me about Subversion is that fact that, as mentioned, I don't think about some things. More than once I've quickly hacked up a change to a module, switched other modules to use that module and do a quick svn rm and svn add.

Oops. I just lost my version history. Damn it.

Not with git. It figures it out for me. My Veure project uses DBIx::Class::Schema::Loader because I don't want to think about building my schema classes. The 0.5003 version is fantastic. It does a better job of naming relationships and the DBIx::Class::Schema::Loader::DBI::Pg support is fantastic.

But what happens if you rename a table? I've done this more than once on my new project, continually working to ensure that I keep things clean and consistent. Unfortunately, that means that a renaming the foo_bar table to foo means that a different schema class is created, separate from the old one. Naturally, I don't think about this and I do a quick git rm and git add.

And git sees what I did and it realizes that the file has been renamed and I don't lose my history.

From what I read (I could be wrong), git actually uses heuristics to determine whether or not a file has been renamed, but so far it's worked flawlessly for me. I am still using Subversion at work (I've not felt comfortable checking out the git-svn work, but that's me being silly), and it's just painful. I doubt I'll ever use Subversion for a personal project again.

Update: came into work this morning and was asked to merge one branch into another. It was conflict hell and I was having trouble figuring out all of the changes. A colleague, using git-svn, merged the two branches cleanly, no problem. I think it's time for me to learn git-svn now.

12 comments

Tagged as:

dvcs, git, source control, subversion

12 Comments

denishowe | February 25, 2010 10:04 AM | Reply

Just wanted to say, I find almost every one of your posts fascinating and educational. Keep up the great work.

Ovid | February 25, 2010 10:52 AM | Reply

@denishowe: thank you. That's very kind of you.

Pedro Melo | February 25, 2010 11:04 AM | Reply

Yes, git uses an heuristic to detect renames.

It compares the files and calculates the percentage of common lines.

If the percentage is above X, then this is a rename. You can see the similarity between two files in a extended diff header called similarity. Try a git diff -M between two revisions where you renamed some files and look for it.

I don't know the value of X. I looked for it in the docs and I cannot find it. I know that X is not 100%. For example, if you rename a Perl package, even with the changed package Name line, git will still detect the rename.

BTW, why git works this way is explained in a very old message in the git mailing list archive. It is said [1] that its the most important message in the git history:

http://article.gmane.org/gmane.comp.version-control.git/217

As for git-svn, its a must :), just make sure you get decent Perl svn bindings.

Ben Morrow | February 25, 2010 11:25 PM | Reply

It's important to realise that, unlike svn, git doesn't even store renames in the repository. It doesn't version files, it versions whole trees, and any 'find me the last revision to change this file' logic is heuristic, based on similarity. This means, apart from anything else, that git is quite capable of coping with 'rename a file, make a copy, change both in different ways' in a single revision, which svn certainly isn't (when I was still using svk I managed to break the repo several times by forgetting to commit a rename before a change, but I think that may have been a svk rather than a svn bug).

Aristotle | February 26, 2010 6:51 AM | Reply

Although one should be careful about that. I still prefer not to make sweeping changes to a file in the same commit as a rename. Small edits like changing package lines are OK, but for very much more than that I prefer to rename and edit in separate commits, to avoid straining the heuristic.

Ovid | February 26, 2010 6:52 AM | Reply

@morrow.me.uk: You wrote:

when I was still using svk I managed to break the repo several times by forgetting to commit a rename before a change, but I think that may have been a svk rather than a svn bug.

That may have been an svk bug, but I do know that older versions of Subversion used to break horribly if you renamed something and then changed it before committing. Since I hate committing broken code, moving a package was problematic. If someone else was on the same branch, there was always a chance that I would move lib/Foo/Bar.pm to lib/Foo/Bar/Baz.pm and have to commit while the code read package Foo::Bar;.

In other words, Subversion forces you to commit broken code. The latest versions seemed to have fixed this, but it was a nightmare for a while.

Sawyer X | February 26, 2010 9:27 AM | Reply

This is kind of belated, but what the hell.

I always like posts on Git vs. others, because I don't have the experience with various SCMs that other people have, and I can learn about them from their experience, so I appreciate that.

Regarding git-svn, that's one thing I can contribute about. Me and a colleague been using it at $work with our rather-big-svn-repo, and it's pure heaven. Basically the difference for the major commands is:

Instead of "svn up", you do "git svn rebase". You cannot "svn rebase" if you have stuff waiting to commit.

Instead of "svn commit", you do all your commits regularly with "git commit" and when you're done you run "git svn dcommit". That just uploads your commits to the subversion reop (the way "svn commit" does).

Basic work cycle is:

git svn clone --username user svn://.../ (you will be prompted for the password unless it's saved by the subversion client) (work work work) git commit, git commit, git commit git svn rebase git svn dcommit

It's pretty easy once you just try it.

Ovid | February 26, 2010 12:45 PM | Reply

@SawyerX: you'll be happy to know that some of that's made it to our internal wiki. I've now switched to git-svn and updated the wiki to let new starters know how.

Sawyer X | February 26, 2010 5:12 PM | Reply

I'm really happy to hear it :)

Ben Morrow | February 27, 2010 4:51 PM | Reply

@Aristotle: Oh, I agree. I'm talking more about the small changes needed to make something compile, or about splitting a file in two without losing history for either copy.

Anonymous | March 25, 2010 4:03 PM | Reply

Please change your background so that its possible to read the blog after the second paragraph and the comments. You'll get a lot more responses!

Ovid replied to comment from Anonymous | March 25, 2010 4:21 PM | Reply

@Anonymous: screenshot? Which browser? I've no idea what you're referring to. In fact, I the comment sounded so strange that at first I thought it was spam. Then I realized you weren't selling anything (maybe you're just a particularly inept spammer"? :)

Name

Email Address

URL

Remember personal info?

Comments (You may use HTML tags for style)

About Ovid

Freelance Perl/Testing/Agile consultant and trainer. See http://www.allaroundtheworld.fr/ for our services. If you have a problem with Perl, we will solve it for you. And don't forget to buy my book! http://www.amazon.com/Beginning-Perl-Curtis-Poe/dp/1118013840/

More info »

Ovid