If you are not a Git geek this will be of limited interest and I urge you “move along”

Updated after tips from Randal Schwartz and Charles Bailey

Still here? OK. As you will know every file stored in a Git repo is identified by a SHA1 hexadecimal string (as explained in the Pro Git book, it’s the hash of the concatenation of the  file contents and a Git specific header). You can calculate the Git SHA1 of any file with the command

git hash-object $file

assuming you have Git installed

If you do not have access to Git, but do have access to a UNIX style shell or Perl you have a couple of options (the Pro Git book also shows a Ruby solution and Stackoverflow has examples in Python and #F))

On Cygwin, OS X or Linux

(printf "blob %s0" $(wc -c < $file);cat $file)|sha1sum -b | cut -d " " -f 1

(N.B. Assumes GNU tools installed on OS X)

On Solaris

(printf  "blob %s0" $(wc -c < $file);cat $file)|digest -a sha1 | cut -d " " -f 1

Using Perl

#!/usr/bin/env perl

# See also Git::PurePerl at http://search.cpan.org/dist/Git-PurePerl/

use strict;
use warnings;
use Digest::SHA1;

my @input = <>;

my $content = join("", @input);

my $git_blob = "blob " . length($content) . "\0" . $content;

my $sha1 = Digest::SHA1->new();


print $sha1->hexdigest();