kvz.io
Published on

Create Youtube-Like IDs With PHP/Python/Javascript/Java/SQL

Authors
  • avatar
    Name
    Kevin van Zonneveld
    Twitter
    @kvz

IDs are often numbers. Unfortunately there are only 10 digits to work with, so if you have a lot of records, IDs tend to get very lengthy. For computers that's OK. But human beings like their IDs as short as possible. So how can we make IDs shorter? Well, we could borrow characters from the alphabet as have them pose as additional numbers.... Alphabet to the rescue!

Other title options were

  • How to create unique short string IDs with PHP & MySQL
  • Or how to create IDs similar to YouTube e.g. yzNjIBEdyww

I created this function a long time ago. Time to be nice and share.

More is Less - the 'math'

The alphabet has 26 characters. That's a lot more than 10 digits. If we also distinguish upper- and lowercase, and add digits to the bunch for the heck of it, we already have (26 x 2 + 10) 62 options we can use per position in the ID.

Now of course we can also add additional funny characters to 'the bunch' like - / * & # but those may cause problems in URLs and that's our target audience for now.

OK so because there are roughly 6x more characters we will use per position, IDs will get much shorter. We can just fit a lot more data in each position.

This is basically what url shortening services do like tinyurl, is.gd, or bit.ly. But similar IDs can also be found at youtube: https://www.youtube.com/watch?v=yzNjIBEdyww

Convert Your IDs

Now unlike Database servers: webservers are easy to scale so you can let them do a bit of converting to ease the life of your users, while keeping your database fast with numbers (MySQL really likes them plain numbers ; )

To do the conversion I've written a PHP function that can translate big numbers to short strings and vice versa. I call it: alphaID.

The resulting string is not hard to decipher, but it can be a very nice feature to make URLs or directory structures more compact and significant.

So basically:

  • when someone requests rLHWfKd
  • alphaID() converts it to 999999999999
  • you lookup the record for id 999999999999 in your database

Source

<?php
/**
 * Translates a number to a short alphanumeric version
 *
 * Translated any number up to 9007199254740992
 * to a shorter version in letters e.g.:
 * 9007199254740989 --> PpQXn7COf
 *
 * specifying the second argument true, it will
 * translate back e.g.:
 * PpQXn7COf --> 9007199254740989
 *
 * this function is based on any2dec && dec2any by
 * fragmer[at]mail[dot]ru
 * see: https://nl3.php.net/manual/en/function.base-convert.php#52450
 *
 * If you want the alphaID to be at least 3 letter long, use the
 * $pad_up = 3 argument
 *
 * In most cases this is better than totally random ID generators
 * because this can easily avoid duplicate ID's.
 * For example if you correlate the alpha ID to an auto incrementing ID
 * in your database, you're done.
 *
 * The reverse is done because it makes it slightly more cryptic,
 * but it also makes it easier to spread lots of IDs in different
 * directories on your filesystem. Example:
 * $part1 = substr($alpha_id,0,1);
 * $part2 = substr($alpha_id,1,1);
 * $part3 = substr($alpha_id,2,strlen($alpha_id));
 * $destindir = "/".$part1."/".$part2."/".$part3;
 * // by reversing, directories are more evenly spread out. The
 * // first 26 directories already occupy 26 main levels
 *
 * more info on limitation:
 * - https://blade.nagaokaut.ac.jp/cgi-bin/scat.rb/ruby/ruby-talk/165372
 *
 * if you really need this for bigger numbers you probably have to look
 * at things like: https://theserverpages.com/php/manual/en/ref.bc.php
 * or: https://theserverpages.com/php/manual/en/ref.gmp.php
 * but I haven't really dug into this. If you have more info on those
 * matters feel free to leave a comment.
 *
 * The following code block can be utilized by PEAR's Testing_DocTest
 * <code>
 * // Input //
 * $number_in = 2188847690240;
 * $alpha_in  = "SpQXn7Cb";
 *
 * // Execute //
 * $alpha_out  = alphaID($number_in, false, 8);
 * $number_out = alphaID($alpha_in, true, 8);
 *
 * if ($number_in != $number_out) {
 *   echo "Conversion failure, ".$alpha_in." returns ".$number_out." instead of the ";
 *   echo "desired: ".$number_in."\n";
 * }
 * if ($alpha_in != $alpha_out) {
 *   echo "Conversion failure, ".$number_in." returns ".$alpha_out." instead of the ";
 *   echo "desired: ".$alpha_in."\n";
 * }
 *
 * // Show //
 * echo $number_out." => ".$alpha_out."\n";
 * echo $alpha_in." => ".$number_out."\n";
 * echo alphaID(238328, false)." => ".alphaID(alphaID(238328, false), true)."\n";
 *
 * // expects:
 * // 2188847690240 => SpQXn7Cb
 * // SpQXn7Cb => 2188847690240
 * // aaab => 238328
 *
 * </code>
 *
 * @author  Kevin van Zonneveld <kevin@vanzonneveld.net>
 * @author  Simon Franz
 * @author  Deadfish
 * @author  SK83RJOSH
 * @copyright 2008 Kevin van Zonneveld (https://kevin.vanzonneveld.net)
 * @license   https://www.opensource.org/licenses/bsd-license.php New BSD Licence
 * @version   SVN: Release: $Id: alphaID.inc.php 344 2009-06-10 17:43:59Z kevin $
 * @link    https://kevin.vanzonneveld.net/
 *
 * @param mixed   $in   String or long input to translate
 * @param boolean $to_num  Reverses translation when true
 * @param mixed   $pad_up  Number or boolean padds the result up to a specified length
 * @param string  $pass_key Supplying a password makes it harder to calculate the original ID
 *
 * @return mixed string or long
 */
function alphaID($in, $to_num = false, $pad_up = false, $pass_key = null)
{
  $out   =   '';
  $index = 'abcdefghijklmnopqrstuvwxyz0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ';
  $base  = strlen($index);

  if ($pass_key !== null) {
    // Although this function's purpose is to just make the
    // ID short - and not so much secure,
    // with this patch by Simon Franz (https://blog.snaky.org/)
    // you can optionally supply a password to make it harder
    // to calculate the corresponding numeric ID

    for ($n = 0; $n < strlen($index); $n++) {
      $i[] = substr($index, $n, 1);
    }

    $pass_hash = hash('sha256',$pass_key);
    $pass_hash = (strlen($pass_hash) < strlen($index) ? hash('sha512', $pass_key) : $pass_hash);

    for ($n = 0; $n < strlen($index); $n++) {
      $p[] =  substr($pass_hash, $n, 1);
    }

    array_multisort($p, SORT_DESC, $i);
    $index = implode($i);
  }

  if ($to_num) {
    // Digital number  <←  alphabet letter code
    $len = strlen($in) - 1;

    for ($t = $len; $t >= 0; $t--) {
      $bcp = bcpow($base, $len - $t);
      $out = $out + strpos($index, substr($in, $t, 1)) * $bcp;
    }

    if (is_numeric($pad_up)) {
      $pad_up--;

      if ($pad_up > 0) {
        $out -= pow($base, $pad_up);
      }
    }
  } else {
    // Digital number  -->>  alphabet letter code
    if (is_numeric($pad_up)) {
      $pad_up--;

      if ($pad_up > 0) {
        $in += pow($base, $pad_up);
      }
    }

    for ($t = ($in != 0 ? floor(log($in, $base)) : 0); $t >= 0; $t--) {
      $bcp = bcpow($base, $t);
      $a   = floor($in / $bcp) % $base;
      $out = $out . substr($index, $a, 1);
      $in  = $in - ($a * $bcp);
    }
  }

  return $out;
}

Get from GitHub

Example

Running:

alphaID(9007199254740989);

will return PpQXn7COf and:

alphaID('PpQXn7COf', true);

will return 9007199254740989

Easy right?

More Features

  • There also is an optional third argument: $pad_up. This enables you to make the resulting alphaID at least X characters long.
  • You can support even more characters (making the resulting alphaID even smaller) by adding characters to the $index var at the top of the function body.

Bonus

Thanks to some wonderful contributions in the comment section, here are some interesting updates & additions:

Pro Tip

You may want to remove vowels (a, e, i, o, u) from $index as to avoid combinations that result in: 'penis' or other dirty words that could get your customers upset.

You can also use the $pad_up argument to enforce a minimum length of 5 characters as to avoid: 'nsfw' and 'wtf'.

Thanks to William for pointing this out ; )

Postgres Implementation

Thanks to William as well:

CREATE OR REPLACE FUNCTION string_to_bits(input_text TEXT)
RETURNS TEXT AS $$
DECLARE
output_text TEXT;
i INTEGER;
BEGIN
output_text := '';


FOR i IN 1..char_length(input_text) LOOP
output_text := output_text || ascii(substring(input_text FROM i FOR 1))::bit(8);
END LOOP;


return output_text;
END;
$$ LANGUAGE plpgsql;


CREATE OR REPLACE FUNCTION id_to_sid(id INTEGER)
RETURNS TEXT AS $$
DECLARE
output_text TEXT;
i INTEGER;
index TEXT[];
bits TEXT;
bit_array TEXT[];
input_text TEXT;
BEGIN
input_text := id::TEXT;
output_text := '';
index := string_to_array('0,d,A,3,E,z,W,m,D,S,Q,l,K,s,P,b,N,c,f,j,5,I,t,C,i,y,o,G,2,r,x,h,V,J,k,-,T,w,H,L,9,e,u,X,p,U,a,O,v,4,R,B,q,M,n,g,1,F,6,Y,_,8,7,Z', ',');

bits := string_to_bits(input_text);

IF length(bits) % 6 <> 0 THEN
bits := rpad(bits, length(bits) + 6 - (length(bits) % 6), '0');
END IF;

FOR i IN 1..((length(bits) / 6)) LOOP
IF i = 1 THEN
bit_array[i] := substring(bits FROM 1 FOR 6);
ELSE
bit_array[i] := substring(bits FROM 1 + (i - 1) * 6 FOR 6);
END IF;

output_text := output_text || index[bit_array[i]::bit(6)::integer + 1];
END LOOP;


return output_text;
END;
$$ LANGUAGE plpgsql;

Java Implementation

Thanks to Ant Kutschera there also is a Java version.

package uk.co.maxant.util;

import java.math.BigInteger;
import java.util.ArrayList;
import java.util.HashMap;
import java.util.List;
import java.util.Map;

/**
 * allows you to convert a whole number into a compacted representation of that number,
 * based upon the dictionary you provide. very similar to base64 encoding, or indeed hex
 * encoding.
 */
public class BaseX {

  /**
   * contains hexadecimals 0-F only.
   */
  public static final char[] DICTIONARY_16 =
    new char[]{'0','1','2','3','4','5','6','7','8','9','A','B','C','D','E','F'};

  /**
   * contains only alphanumerics, in capitals and excludes letters/numbers which can be confused,
   * eg. 0 and O or L and I and 1.
   */
  public static final char[] DICTIONARY_32 =
    new char[]{'1','2','3','4','5','6','7','8','9','A','B','C','D','E','F','G','H','J','K','M','N','P','Q','R','S','T','U','V','W','X','Y','Z'};

  /**
   * contains only alphanumerics, including both capitals and smalls.
   */
  public static final char[] DICTIONARY_62 =
    new char[]{'0','1','2','3','4','5','6','7','8','9','A','B','C','D','E','F','G','H','I','J','K','L','M','N','O','P','Q','R','S','T','U','V','W','X','Y','Z','a','b','c','d','e','f','g','h','i','j','k','l','m','n','o','p','q','r','s','t','u','v','w','x','y','z'};

  /**
   * contains alphanumerics, including both capitals and smalls, and the following special chars:
   * +"@*#%&/|()=?'~[!]{}-_:.,; (you might not be able to read all those using a browser!
   */
  public static final char[] DICTIONARY_89 =
    new char[]{'0','1','2','3','4','5','6','7','8','9','A','B','C','D','E','F','G','H','I','J','K','L','M','N','O','P','Q','R','S','T','U','V','W','X','Y','Z','a','b','c','d','e','f','g','h','i','j','k','l','m','n','o','p','q','r','s','t','u','v','w','x','y','z','+','"','@','*','#','%','&','/','|','(',')','=','?','~','[',']','{','}','$','-','_','.',':',',',';','<','>'};

  protected char[] dictionary;

  /**
   * create an encoder with the given dictionary.
   *
   * @param dictionary the dictionary to use when encoding and decoding.
   */
  public BaseX(char[] dictionary){
    this.dictionary = dictionary;
  }

  /**
   * creates an encoder with the {@link #DICTIONARY_62} dictionary.
   *
   * @param dictionary the dictionary to use when encoding and decoding.
   */
  public BaseX(){
    this.dictionary = DICTIONARY_62;
  }

  /**
   * tester method.
   */
  public static void main(String[] args) {
    String original = "123456789012345678901234567890";
    System.out.println("Original: " + original);
    BaseX bx = new BaseX(DICTIONARY_62);
    String encoded = bx.encode(new BigInteger(original));
    System.out.println("encoded: " + encoded);
    BigInteger decoded = bx.decode(encoded);
    System.out.println("decoded: " + decoded);
    if(original.equals(decoded.toString())){
      System.out.println("Passed! decoded value is the same as the original.");
    }else{
      System.err.println("FAILED! decoded value is NOT the same as the original!!");
    }
  }

  /**
   * encodes the given string into the base of the dictionary provided in the constructor.
   * @param value the number to encode.
   * @return the encoded string.
   */
  public String encode(BigInteger value) {

    List<Character> result = new ArrayList<Character>();
    BigInteger base = new BigInteger("" + dictionary.length);
    int exponent = 1;
    BigInteger remaining = value;
    while(true){
      BigInteger a = base.pow(exponent); //16^1 = 16
      BigInteger b = remaining.mod(a); //119 % 16 = 7 | 112 % 256 = 112
      BigInteger c = base.pow(exponent - 1);
      BigInteger d = b.divide(c);

      //if d > dictionary.length, we have a problem. but BigInteger doesnt have
      //a greater than method :-(  hope for the best. theoretically, d is always
      //an index of the dictionary!
      result.add(dictionary[d.intValue()]);
      remaining = remaining.subtract(b); //119 - 7 = 112 | 112 - 112 = 0

      //finished?
      if(remaining.equals(BigInteger.ZERO)){
        break;
      }

      exponent++;
    }

    //need to reverse it, since the start of the list contains the least significant values
    StringBuffer sb = new StringBuffer();
    for(int i = result.size()-1; i >= 0; i--){
      sb.append(result.get(i));
    }
    return sb.toString();
  }

  /**
   * decodes the given string from the base of the dictionary provided in the constructor.
   * @param str the string to decode.
   * @return the decoded number.
   */
  public BigInteger decode(String str) {

    //reverse it, coz its already reversed!
    char[] chars = new char[str.length()];
    str.getChars(0, str.length(), chars, 0);

    char[] chars2 = new char[str.length()];
    int i = chars2.length -1;
    for(char c : chars){
      chars2[i--] = c;
    }

    //for efficiency, make a map
    Map<Character, BigInteger> dictMap = new HashMap<Character, BigInteger>();
    int j = 0;
    for(char c : dictionary){
      dictMap.put(c, new BigInteger("" + j++));
    }

    BigInteger bi = BigInteger.ZERO;
    BigInteger base = new BigInteger("" + dictionary.length);
    int exponent = 0;
    for(char c : chars2){
      BigInteger a = dictMap.get(c);
      BigInteger b = base.pow(exponent).multiply(a);
      bi = bi.add(new BigInteger("" + b));
      exponent++;
    }

    return bi;

  }
}

JavaScript Implementation

Thanks to Even Simon, there's a JavaScript implementation. You will also find PHP version there, that implements the encode & decode functions as separate methods in a class.

/**
 *  Javascript AlphabeticID class
 *  (based on a script by Kevin van Zonneveld <kevin@vanzonneveld.net>)
 *
 *  Author: Even Simon <even.simon@gmail.com>
 *
 *  Description: Translates a numeric identifier into a short string and backwords.
 *
 *  Usage:
 *    var str = AlphabeticID.encode(9007199254740989); // str = 'fE2XnNGpF'
 *    var id = AlphabeticID.decode('fE2XnNGpF'); // id = 9007199254740989;
 **/

var AlphabeticID = {
  index: "abcdefghijklmnopqrstuvwxyz0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ",

  /**
   *  [@function](https://twitter.com/function) AlphabeticID.encode
   *  [@description](https://twitter.com/description) Encode a number into short string
   *  [@param](https://twitter.com/param) integer
   *  [@return](https://twitter.com/return) string
   **/
  encode: function (_number) {
    if ("undefined" == typeof _number) {
      return null;
    } else if ("number" != typeof _number) {
      throw new Error("Wrong parameter type");
    }

    var ret = "";

    for (
      var i = Math.floor(Math.log(parseInt(_number)) / Math.log(AlphabeticID.index.length));
      i >= 0;
      i--
    ) {
      ret =
        ret +
        AlphabeticID.index.substr(
          Math.floor(parseInt(_number) / AlphabeticID.bcpow(AlphabeticID.index.length, i)) %
            AlphabeticID.index.length,
          1,
        );
    }

    return ret.reverse();
  },

  /**
   *  [@function](https://twitter.com/function) AlphabeticID.decode
   *  [@description](https://twitter.com/description) Decode a short string and return number
   *  [@param](https://twitter.com/param) string
   *  [@return](https://twitter.com/return) integer
   **/
  decode: function (_string) {
    if ("undefined" == typeof _string) {
      return null;
    } else if ("string" != typeof _string) {
      throw new Error("Wrong parameter type");
    }

    var str = _string.reverse();
    var ret = 0;

    for (var i = 0; i <= str.length - 1; i++) {
      ret =
        ret +
        AlphabeticID.index.indexOf(str.substr(i, 1)) *
          AlphabeticID.bcpow(AlphabeticID.index.length, str.length - 1 - i);
    }

    return ret;
  },

  /**
   *  [@function](https://twitter.com/function) AlphabeticID.bcpow
   *  [@description](https://twitter.com/description) Raise _a to the power _b
   *  [@param](https://twitter.com/param) float _a
   *  [@param](https://twitter.com/param) integer _b
   *  [@return](https://twitter.com/return) string
   **/
  bcpow: function (_a, _b) {
    return Math.floor(Math.pow(parseFloat(_a), parseInt(_b)));
  },
};

/**
 *  [@function](https://twitter.com/function) String.reverse
 *  [@description](https://twitter.com/description) Reverse a string
 *  [@return](https://twitter.com/return) string
 **/
String.prototype.reverse = function () {
  return this.split("").reverse().join("");
};

C# Implementation

Thanks to Romas, there's a C# implementation.

Improved by rumble|strip

class ShortId
{
    public static readonly string Alphabet = "abcdefghijklmnopqrstuvwxyz0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ";

    private static decimal BcPow(double a, double b)
    {
      return Math.Floor((decimal)Math.Pow(a, b));
    }

    public static ulong Decode(string value, int pad = 0)
    {
      value = value.ReverseString();
      var len = value.Length - 1;
      ulong result = 0;

      for (int t = len; t >= 0; t--)
      {
        var bcp = (ulong)BcPow(Alphabet.Length, len - t);
        result += (ulong)Alphabet.IndexOf(value[t]) * bcp;
      }

      if (pad > 0)
      {
        result -= (ulong)BcPow(Alphabet.Length, pad);
      }

      return result;
    }

    public static string Encode(byte[] value, int startIndex = 0, int pad = 0)
    {
      return Encode(BitConverter.ToUInt64(value, startIndex), pad);
    }

    public static string Encode(Guid guid, int pad = 0)
    {
      var bytes = guid.ToByteArray();

      var first = Encode(bytes, 0, pad);
      var second = Encode(bytes, 8, pad);

      return first + second;
    }

    public static string Encode(ulong value, int pad = 0)
    {
      var result = string.Empty;

      if (pad > 0)
      {
        value += (ulong)BcPow(Alphabet.Length, pad);
      }

      for (var t = (value != 0 ? Math.Floor(Math.Log(value, Alphabet.Length)) : 0); t >= 0; t--)
      {
        var bcp = (ulong)BcPow(Alphabet.Length, t);
        var a = ((ulong)Math.Floor((decimal)value / (decimal)bcp)) % (ulong)Alphabet.Length;
        result += Alphabet[(int)a];
        value  = value - (a * bcp);
      }

      return result.ReverseString();
    }

    private static string ReverseString(this string value)
    {
      char[] arr = value.ToCharArray();
      Array.Reverse(arr);
      return new string(arr);
    }
  }

Python Implementations

Thanks to wessite, there's a Python implementation.

ALPHABET = "bcdfghjklmnpqrstvwxyz0123456789BCDFGHJKLMNPQRSTVWXYZ"
BASE = len(ALPHABET)
MAXLEN = 6

def encode_id(self, n):

    pad = self.MAXLEN - 1
    n = int(n + pow(self.BASE, pad))

    s = []
    t = int(math.log(n, self.BASE))
    while True:
        bcp = int(pow(self.BASE, t))
        a = int(n / bcp) % self.BASE
        s.append(self.ALPHABET[a:a+1])
        n = n - (a * bcp)
        t -= 1
        if t < 0: break

    return "".join(reversed(s))

def decode_id(self, n):

    n = "".join(reversed(n))
    s = 0
    l = len(n) - 1
    t = 0
    while True:
        bcpow = int(pow(self.BASE, l - t))
        s = s + self.ALPHABET.index(n[t:t+1]) * bcpow
        t += 1
        if t > l: break

    pad = self.MAXLEN - 1
    s = int(s - pow(self.BASE, pad))

    return int(s)

Noah Miller contributed a version based on Wessite's, and changed it so it can use a passkey, and rolled it into one function:

import math
import hashlib

ALPHABET = "abcdefghijklmnopqrstuvwxyz0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ"

def alphaID(idnum, to_num=False, pad_up=False, passkey=None):
  index = ALPHABET
  if passkey:
    i = list(index)
    passhash = hashlib.sha256(passkey).hexdigest()
    passhash = hashlib.sha512(passkey).hexdigest() if len(passhash) < len(index) else passhash
    p = list(passhash)[0:len(index)]
    index = ''.join(zip(*sorted(zip(p,i)))[1])

  base = len(index)

  if to_num:
    idnum = idnum[::-1]
    out = 0
    length = len(idnum) -1
    t = 0
    while True:
      bcpow = int(pow(base, length - t))
      out = out + index.index(idnum[t:t+1]) * bcpow
      t += 1
      if t > length: break

    if pad_up:
      pad_up -= 1
      if pad_up > 0:
        out -= int(pow(base, pad_up))
  else:
    if pad_up:
      pad_up -= 1
      if pad_up > 0:
        idnum += int(pow(base, pad_up))

    out = []
    t = int(math.log(idnum, base))
    while True:
      bcp = int(pow(base, t))
      a = int(idnum / bcp) % base
      out.append(index[a:a+1])
      idnum = idnum - (a * bcp)
      t -= 1
      if t < 0: break

    out = ''.join(out[::-1])

  return out

HaXe Implementation

Thanks to Andy Li, there's a HaXe implementation.

/**
 *  HaXe version of AlphabeticID
 *  Author: Andy Li <andy@onthewings.net>
 *  ported from...
 *
 *  Javascript AlphabeticID class
 *  Author: Even Simon <even.simon@gmail.com>
 *  which is based on a script by Kevin van Zonneveld <kevin@vanzonneveld.net>)
 *
 *  Description: Translates a numeric identifier into a short string and backwords.
 *  https://kevin.vanzonneveld.net/techblog/article/create_short_ids_with_php_like_youtube_or_tinyurl/
 **/

class AlphaID {
    static public var index:String = 'abcdefghijklmnopqrstuvwxyz0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ';

    static public function encode(_number:Int):String {
        var strBuf = new StringBuf();

        var i = 0;
        var end = Math.floor(Math.log(_number)/Math.log(index.length));
        while(i <= end) {
            strBuf.add(index.charAt((Math.floor(_number / bcpow(index.length, i++)) % index.length)));
        }

        return strBuf.toString();
    }

    static public function decode(_string:String):Int {
        var str = reverseString(_string);
        var ret = 0;

        var i = 0;
        var end = str.length - 1;
        while(i <= end) {
            ret += Std.int(index.indexOf(str.charAt(i)) * (bcpow(index.length, end-i)));
            ++i;
        }

        return ret;
    }

    inline static private function bcpow(_a:Float, _b:Float):Float {
        return Math.floor(Math.pow(_a, _b));
    }

    inline static private function reverseString(inStr:String):String {
        var ary = inStr.split("");
        ary.reverse();
        return ary.join("");
    }
}

Go Implementation

Thanks to Dinesh Appavoo, there's a Go implementation.

// Package basex generates alpha id (alphanumeric id) for big integers.  This
// is particularly useful for shortening URLs.
package basex

import (
	"math/big"
	"strconv"
)

var (
	dictionary = []byte{'0', '1', '2', '3', '4', '5', '6', '7', '8', '9', 'A', 'B', 'C', 'D', 'E', 'F', 'G', 'H', 'I', 'J', 'K', 'L', 'M', 'N', 'O', 'P', 'Q', 'R', 'S', 'T', 'U', 'V', 'W', 'X', 'Y', 'Z', 'a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j', 'k', 'l', 'm', 'n', 'o', 'p', 'q', 'r', 's', 't', 'u', 'v', 'w', 'x', 'y', 'z'}
)

// Encode converts the big integer to alpha id (an alphanumeric id with mixed cases)
func Encode(val string) string {
	var result []byte
	var index int
	var strVal string

	base := big.NewInt(int64(len(dictionary)))
	a := big.NewInt(0)
	b := big.NewInt(0)
	c := big.NewInt(0)
	d := big.NewInt(0)

	exponent := 1

	remaining := big.NewInt(0)
	remaining.SetString(val, 10)

	for remaining.Cmp(big.NewInt(0)) != 0 {
		a.Exp(base, big.NewInt(int64(exponent)), nil) //16^1 = 16
		b = b.Mod(remaining, a)                       //119 % 16 = 7 | 112 % 256 = 112
		c = c.Exp(base, big.NewInt(int64(exponent-1)), nil)
		d = d.Div(b, c)

		//if d > dictionary.length, we have a problem. but BigInteger doesnt have
		//a greater than method :-(  hope for the best. theoretically, d is always
		//an index of the dictionary!
		strVal = d.String()
		index, _ = strconv.Atoi(strVal)
		result = append(result, dictionary[index])
		remaining = remaining.Sub(remaining, b) //119 - 7 = 112 | 112 - 112 = 0
		exponent = exponent + 1
	}

	//need to reverse it, since the start of the list contains the least significant values
	return string(reverse(result))
}

// Decode converts the alpha id to big integer
func Decode(s string) string {
	//reverse it, coz its already reversed!
	chars2 := reverse([]byte(s))

	//for efficiency, make a map
	dictMap := make(map[byte]*big.Int)

	j := 0
	for _, val := range dictionary {
		dictMap[val] = big.NewInt(int64(j))
		j = j + 1
	}

	bi := big.NewInt(0)
	base := big.NewInt(int64(len(dictionary)))

	exponent := 0
	a := big.NewInt(0)
	b := big.NewInt(0)
	intermed := big.NewInt(0)

	for _, c := range chars2 {
		a = dictMap[c]
		intermed = intermed.Exp(base, big.NewInt(int64(exponent)), nil)
		b = b.Mul(intermed, a)
		bi = bi.Add(bi, b)
		exponent = exponent + 1
	}
	return bi.String()
}

func reverse(bs []byte) []byte {
	for i, j := 0, len(bs)-1; i < j; i, j = i+1, j-1 {
		bs[i], bs[j] = bs[j], bs[i]
	}
	return bs
}

C++ Implementation

Thanks to Kay Makowsky, there's a C++ implementation.

//
//  ShortID.h
//  SocketServer
//
//  Created by Kay Makowsky on 16.06.16.
//  Copyright © 2016 Kay Makowsky. All rights reserved.
//

#ifndef ShortID_h
#define ShortID_h

#include <string>
#include <cmath>

const static std::string Alphabet = "abcdefghijklmnopqrstuvwxyz0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ";

class ShortID
{
public:
    static double BcPow(double a, double b)
    {
        return std::floor(std::pow(a, b));
    }

    static long Decode(std::string value, int pad = 0)
    {
        long len = value.length() - 1;
        unsigned long result = 0;

        for (long t = len; t >= 0; t--)
        {
            unsigned long bcp = (unsigned long)BcPow(Alphabet.length(), len - t);
            result += (unsigned long)Alphabet.find(value[t]) * bcp;
        }

        if (pad > 0)
        {
            result -= (unsigned long)BcPow(Alphabet.length(), pad);
        }

        return result;
    }

    static std::string Encode(unsigned long value, int pad = 0)
    {
        std::string result = "";

        if (pad > 0)
        {
            value += (unsigned long)BcPow(Alphabet.length(), pad);
        }
        int lg = std::log(value) / std::log(Alphabet.length());
        for (int t = (value != 0 ? lg : 0); t >= 0; t--)
        {
            unsigned long bcp = (unsigned long)BcPow(Alphabet.length(), t);
            unsigned long a = ((unsigned long)std::floor((double)value / (double)bcp)) % (unsigned long)Alphabet.length();
            result += Alphabet[(int)a];
            value  = value - (a * bcp);
        }

        return result;
    }

private:
    static std::string reverseString(std::string value)
    {
        return std::string (value.rbegin(), value.rend());
    }
};

#endif

Legacy Comments (70)

These comments were imported from the previous blog system (Disqus).

Đỗ Nam Khánh
Đỗ Nam Khánh·

Thanks for nice post ^^

devnic
devnic·

Cool. The best part is that it is a small code that can make the url look more readable and pro.

Gerrit
Gerrit·

Great post: I got inspired and changed your code a bit to improve for speed by using int rather than float. This also fixes the rounding problems: try
[CODE=\"php\"]
var_dump(alphaID(alphaID(238328), true) == 238328);
[/CODE]
in your converter.

Follow my link for my version of the converter.

Kev van Zonneveld
Kev van Zonneveld·

@ Gerrit: very interesting, thanks for the heads-up!

Marcelo
Marcelo·

Very useful. Thanks!

Topbit
Topbit·

If you only want to use a-z & 0-9 (base 36), base_convert() will convert both ways easily and quickly.

[CODE=\"php\"]
$number = 999999;
$string = base_convert($number, $frombase = 10, 36);
$num = base_convert($string, $frombase = 36, 10);
if ($num == $number) echo \"Match!!\\n\";
[/CODE]

There is also a number of other functions there that will do larger number bases - such as 62, using similar techniques as the above post.

BnoL
BnoL·

Hi, nice tut.

But I think every encode script need to have a \"password key\" so that noone else can decode your ID :) (unless he/she knows your password key).

Kev van Zonneveld
Kev van Zonneveld·

@ BnoL: This is not intended for security. Pure usability. If you want security you should probably use a lookup table or use UUIDS for records directly.

Tanzmusik
Tanzmusik·

wow, just great.
The only improvement i mean is to modify the code by adding or removing some letters before use. If you do not modify, then everyone else can reveal your primary key structure.

Kev van Zonneveld
Kev van Zonneveld·

@ Tanzmusik: Well it was never the goal of this function to be used as a security measure, but Simon Franz was kind enough to provide a patch for that nonetheless:

http://github.com/kvz/kvzli...

Deadfish
Deadfish·

I think there is a bug in your code. I believe it should not be... $base = strlen($index); but $base = strlen($index) - 1; or else during encoding when it calls substr($index, $a, 1) $a would be the character after the last character. Apart from that great code :D

Deadfish
Deadfish·

The only example I can find of this bug happening is alphaID(238328);

Deadfish
Deadfish·

A proper bug fix...

replace log10($in) / log10($base) with log10($in, $base) and $a = floor($in / $bcp) with $a = floor($in / $bcp) % $base. That will fix the bug with alphaID(238328);

Kev van Zonneveld
Kev van Zonneveld·

@ Deadfish: Wow great stuff! I\'ve fixed the bug thanks!

http://github.com/kvz/kvzli...

Catalin
Catalin·

php must be compiled with --enable-bcmath configure option for this to work.

Kev van Zonneveld
Kev van Zonneveld·

@ Catalin: Good point. With my distro it\'s default, but that may not be the case for everybody. Thanks for sharing.

Ant Kutschera
Ant Kutschera·

hi,

ive done the same thing independently using java.

http://blog.maxant.co.uk/pe...

im not sure what this type of encoding is really called...

another application for it is where you want to provide users with a pin which they can share among friends. but you dont want anyone to guess the pin. so you take your primary key for the relevant thing which is being shared, and append a 4 digit random number the the end, before encoding your big number. the PIN you distribute is the encoded shorter version.

Kev van Zonneveld
Kev van Zonneveld·

@ Ant Kutschera: Cool to see someone tackling the exact same problem in a completely different part of the world & programming community :) Thanks for leaving a note man.

William
William·

Nice script!

I got 1 small advise.. Remove all vouwels (a, e, o, u, i) from $index, otherwise one day one of your customers will ask you why his username (or whatever) is \'penis\' (or another unfriendly/dirty word) ;-).

Even Simon
Even Simon·

Hi there,

I\'ve used your code for my website, let me say this is some excellent work. Anyhow I also needed the same functionality on the client-side (JavaScript) so I had to write my own. Here it is:

[CODE=\"Javascript\"]
/**
* Javascript AlphabeticID class
* (based on a script by Kevin van Zonneveld <kevin@vanzonneveld.net>)
*
* Author: Even Simon <even.simon@gmail.com>
*
* Description: Translates a numeric identifier into a short string and backwords.
*
* Usage:
* var str = AlphabeticID.encode(9007199254740989); // str = \'fE2XnNGpF\'
* var id = AlphabeticID.decode(\'fE2XnNGpF\'); // id = 9007199254740989;
**/

var AlphabeticID = {
index:\'abcdefghijklmnopqrstuvwxyz0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ\',

/**
* @function AlphabeticID.encode
* @description Encode a number into short string
* @param integer
* @return string
**/
encode:function(_number){
if(\'undefined\' == typeof _number){
return null;
}
else if(\'number\' != typeof(_number)){
throw new Error(\'Wrong parameter type\');
}

var ret = \'\';

for(var i=Math.floor(Math.log(parseInt(_number))/Math.log(AlphabeticID.index.length));i>=0;i--){
ret = ret + AlphabeticID.index.substr((Math.floor(parseInt(_number) / AlphabeticID.bcpow(AlphabeticID.index.length, i)) % AlphabeticID.index.length),1);
}

return ret.reverse();
},

/**
* @function AlphabeticID.decode
* @description Decode a short string and return number
* @param string
* @return integer
**/
decode:function(_string){
if(\'undefined\' == typeof _string){
return null;
}
else if(\'string\' != typeof _string){
throw new Error(\'Wrong parameter type\');
}

var str = _string.reverse();
var ret = 0;

for(var i=0;i<=(str.length - 1);i++){
ret = ret + AlphabeticID.index.indexOf(str.substr(i,1)) * (AlphabeticID.bcpow(AlphabeticID.index.length, (str.length - 1) - i));
}

return ret;
},

/**
* @function AlphabeticID.bcpow
* @description Raise _a to the power _b
* @param float _a
* @param integer _b
* @return string
**/
bcpow:function(_a, _b){
return Math.floor(Math.pow(parseFloat(_a), parseInt(_b)));
}
};

/**
* @function String.reverse
* @description Reverse a string
* @return string
**/
String.prototype.reverse = function(){
return this.split(\'\').reverse().join(\'\');
};
[/CODE]

Plus I\'ve rewritten your PHP function into a PHP class to make it more suitable for my code:

[CODE=\"php\"]
<?php
class AlphabeticID{
const index = \"abcdefghijklmnopqrstuvwxyz0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ\";

public static function encode($number){
if(!isset($number)){
return null;
}

$base = strlen(self::index);
$out = \"\";
for ($t = floor(log($number, $base)); $t >= 0; $t--) {
$bcp = bcpow($base, $t);
$a = floor($number / $bcp) % $base;
$out = $out . substr(self::index, $a, 1);
$number = $number - ($a * $bcp);
}
$out = strrev($out);
return $out;
}

public static function decode($string){
$base = strlen(self::index);
$string = strrev($string);
$out = 0;
$len = strlen($string) - 1;

for ($t = 0; $t <= $len; $t++){
$bcpow = bcpow($base, $len - $t);
$out = $out + strpos(self::index, substr($string, $t, 1)) * $bcpow;
}

$out = sprintf(\'%F\', $out);
$out = substr($out, 0, strpos($out, \'.\'));
return $out;
}
}
?>
[/CODE]

Have a nice day. $))

-Simon

Kev van Zonneveld
Kev van Zonneveld·

@ William: Fair point : ) Thanks for this.

@ Even Simon: Good stuff man!

@ both: I\'ll update the article with clear references to your comments soon

William
William·

Hey Kevin and others!

In my last comment I suggested to remove all vowels from $index to prevent unfriendly / dirty words. With that in mind I would like to add another suggestion: to make the string at least 5 characters long to prevent getting id\'s like: \'wtf\', \'nsb\', \'nsfw\', etc ;-)

I also made a PostgreSQL version of Kevin\'s script :-)

It doesn\'t create a random unique string, but it will convert numbers to a string. It\'s an edited version of base64 encoding (URL save version without characters like: \'/\' and \'=\' ).

I hope you people find any use for it.

<?php

CREATE OR REPLACE FUNCTION string_to_bits(input_text TEXT)
RETURNS TEXT AS $$
DECLARE
output_text TEXT;
i INTEGER;
BEGIN
output_text := \'\';

FOR i IN 1..char_length(input_text) LOOP
output_text := output_text || ascii(substring(input_text FROM i FOR 1))::bit(8);
END LOOP;

return output_text;
END;
$$ LANGUAGE plpgsql;

CREATE OR REPLACE FUNCTION id_to_sid(id INTEGER)
RETURNS TEXT AS $$
DECLARE
output_text TEXT;
i INTEGER;
index TEXT[];
bits TEXT;
bit_array TEXT[];
input_text TEXT;
BEGIN
input_text := id::TEXT;
output_text := \'\';
index := string_to_array(\'0,d,A,3,E,z,W,m,D,S,Q,l,K,s,P,b,N,c,f,j,5,I,t,C,i,y,o,G,2,r,x,h,V,J,k,-,T,w,H,L,9,e,u,X,p,U,a,O,v,4,R,B,q,M,n,g,1,F,6,Y,_,8,7,Z\', \',\');

bits := string_to_bits(input_text);

IF length(bits) % 6 <> 0 THEN
bits := rpad(bits, length(bits) + 6 - (length(bits) % 6), \'0\');
END IF;

FOR i IN 1..((length(bits) / 6)) LOOP
IF i = 1 THEN
bit_array[i] := substring(bits FROM 1 FOR 6);
ELSE
bit_array[i] := substring(bits FROM 1 + (i - 1) * 6 FOR 6);
END IF;

output_text := output_text || index[bit_array[i]::bit(6)::integer + 1];
END LOOP;

return output_text;
END;
$$ LANGUAGE plpgsql;

?>

Have a nice day!

William

Kev van Zonneveld
Kev van Zonneveld·

@ William: Hey there! Good stuff man! I\'ve updated the article to reflect your awesome contributions!

wessite
wessite·

Thanks for this article, the code was very useful!
I needed this in python to use on Google Appengine, here\'s the code:

[CODE=\"Python\"]
ALPHABET = \"bcdfghjklmnpqrstvwxyz0123456789BCDFGHJKLMNPQRSTVWXYZ\"
BASE = len(ALPHABET)
MAXLEN = 6

def encode_id(self, n):

pad = self.MAXLEN - 1
n = int(n + pow(self.BASE, pad))

s = []
t = int(math.log(n, self.BASE))
while True:
bcp = int(pow(self.BASE, t))
a = int(n / bcp) % self.BASE
s.append(self.ALPHABET[a:a+1])
n = n - (a * bcp)
t -= 1
if t < 0: break

return \"\".join(reversed(s))

def decode_id(self, n):

n = \"\".join(reversed(n))
s = 0
l = len(n) - 1
t = 0
while True:
bcpow = int(pow(self.BASE, l - t))
s = s + self.ALPHABET.index(n[t:t+1]) * bcpow
t += 1
if t > l: break

pad = self.MAXLEN - 1
s = int(s - pow(self.BASE, pad))

return int(s)
[/CODE]

Kev van Zonneveld
Kev van Zonneveld·

@ wessite: How cool is that?? Thanks, Added! : )

Andy Li
Andy Li·

I made a haXe version, which can be compiled to swf/js/php/c++/neko :)

Here it is:
http://gist.github.com/358018

Thiet ke website
Thiet ke website·

Thanks for the nice code, i\'m very interesting :)

Kev van Zonneveld
Kev van Zonneveld·

@ Andy Li: Awesome, Added! :)

SD (Aspherical)
SD (Aspherical)·

Thanks so much for posting this, I used your digital number to alphabet letter encoding code section to build a custom link shortener that I use when Twittering my photoblog updates. Works like a charm!

Right now, I just put the encoded link in a database table, but I might explore using the decoder in the future.

esco
esco·

No return when I try for example [CODE=\"php\"]echo alphaID(238328, false);[/CODE]
When I try to debug a bit amateuristic, I think it breaks at: [CODE=\"php\"]$bcp = bcpow($base, $t);[/CODE]
Can someone help?

Webdesigner
Webdesigner·

How to add a secure Passwort, so that it\'s harder to get the ID from the String?

esco
esco·

I had to install bcmath extension. Works fine now. If anyone has the same problem, check your php configuration.

Kev van Zonneveld
Kev van Zonneveld·

@ SD (Aspherical): No problem

@ esco: Yeah you need the bcmath stuff.

@ Webdesigner: while it was not intended to encrypt or hide the actual id so much as to make it smaller/easier to read, Simon Franz patched the function to support passwords. the 4th argument.

Stephen
Stephen·

Gr8 script.

I thought I might add to it. If you have a short number and the pad_up say alphaID(\'9\', false, 7, \'passkeys\') you get a number like 366666P.
to avoid having something uniform like this I multiply the

(Digital number -->> alphabet letter code)

by a large prime number.. eg: $in = $in * 89527;

and you need to do the rev also so at the end of

(Digital number <<-- alphabet letter code):

$out = $out / 89527;

This will produce a more cryptic output like YDPH66P.

As for the prime number I just went and picked one from http://www.mathsisfun.com/n...

Tim
Tim·

Could you give an example of putting it all together with PHP/mySQL, including table structure?

I\'m building a new site and don\'t have big numbers yet. So if I set my ID column to make up a big number, such as INT(16) unsigned zerofill and INSERT a record, I get 0000000000000001 as an id. If I return that using the alphaID() function I get \'c\'. Hardly usable. :)

Are we supposed to add a second id column, alpha_id INT(16), instead and have PHP generate a random number, then INSERT IGNORE and hope that the number already isn\'t in the db? Or are we supposed to store the alphaID \"PpQXn7COf\" in the db and lookup on that? If so, doesn\'t that then negate the speed of a numerical lookup? Also, that just goes back to having PHP generate a rand number, right?

Kev van Zonneveld
Kev van Zonneveld·

@ Stephen: Not that I mind uniform results, but that\'s still an excellent tip I imagine many will find very useful. Thanks!

@ Tim: You\'re note supposed to do anything. But I would just use numeric indexes to keep the database speedy, and use appservers (easiest to scale in any infra) to care care of the conversion. If you don\'t want small alphaIds like \'c\', try the pad argument.

Martin
Martin·

Very nice.

Julien
Julien·

I tried PHP and Python solution but the short ID created with int ID are not the same.

cdub
cdub·

Thanks so much! I've done a lot of research on URL shorteners and I like you're implementation the best.

Ganesh
Ganesh·

shall we use base_convert?

cdub
cdub·

One more thing... I found an ASP.NET MVC url shortener similar to yours and thought it might be useful to link here, http://blog.andresays.org/2...

Iain
Iain·

I like this very much. But it gives an error result with large number. Here is an example:

[CODE="php"]
$num = 18446744073709551615; // mysql unsigned int

// print MOrkgbArGYv
$converted = alphaID($num);

// print 18446744073709551616
echo alphaID($converted, true);
[/CODE]

Notice there is a slight difference there on the result. Anyone can help on this issue?

Thanks.

Kev van Zonneveld
Kev van Zonneveld·

@ cdub: Thanks : ) Url doesn't seem to work anymore though :/

@ Iain: As stated in the function's comments, it's limited to relatively small numbers (9007199254740989 on 32bits systems). If you need anything bigger you'd best make some adjustments using the bc extension in PHP

John Smith
John Smith·

Thanks for sharing! Everything works perfect, except for that when the ID is 1, the code uses "b" not "a", and when the ID is 63, it uses "bb" not "aa". Is there any fix for that?

Kev van Zonneveld
Kev van Zonneveld·

@ John Smith: But then how would you encode 0? ; )

If you'll never use zeros, I guess an easy "fix" for that would be in your call: alphaID($mynumber-1).
Don't forget to reverse the decrement when you're decoding as well.

devashish gupta
devashish gupta·

i like.........

chrischris
chrischris·

Hi Kevin,

Thanks for a great script. Thanks!

Have you considered using the input as a string instead of an integer to overcome the limitation of a large integer in PHP?

"18446744073709551615" instead of 18446744073709551615.

I've been playing around with this encryption class by Tony Marston (http://www.tonymarston.co.u..., and it seems it works even with 18446744073709551615.

But the trick is you have to make sure the input is a string. If it's integer and the number is very large, PHP will convert it into a scientific notation e.g. 1.84467440E+12 first, and the encryption class will use this scientific notation instead of the actual number representation for conversion.

And you'd need to tweak the class so the resulted data is URL friendly. I just removed all the funny characters, except an empty space " " because it's used for padding. And then I added a code to replace a " " with a "_" to make it URL friendly. When I need to decode it, I just replace "_" back to " " before decoding it. It seems it works fine.

--

As for your alphaID script, is there a limitation with $pad_up? When I set it to 10, some numbers don't get converted back correctly:

[CODE="php"]
for($i = 0; $i < 100; $i++){
$alpha_id = alphaID($i, false, 10, "passtest");
$int_id = alphaID($alpha_id, true, 10, "passtest");
print "$i|$alpha_id|$int_id<br>";
}

Results:

Original int|alphaID|Converted int back

0|2222222229|0
1|2222222229|0
2|A222222229|2
3|R222222229|4
4|R222222229|4
5|R222222229|4
6|0222222229|6
7|W222222229|8
8|W222222229|8
9|W222222229|8
[/CODE]

Thanks.

franfran
franfran·

Thanks, this is a nice function. However, just found out that you must know the $pad_up value to decode. So the $pad_up is somehow become part of the $passKey.

If we want to encode a string with minimum length:

[CODE="php"]
alphaID(9007199254740989, false, 6);
[/CODE]

will return '9QFqnGSNRf'

[CODE="php"]
alphaID('9QFqnGSNRf', true);
[/CODE]

will get the WRONG thing 9007199599766240

You can only get back the original number if you know the $pad_up

[CODE="php"]
alphaID('9QFqnGSNRf', true, 6);
[/CODE]

will return 9007199254740989

Kev van Zonneveld
Kev van Zonneveld·

@ franfran: That's expected behavior. I guess if you take out the part where I reverse the string, it could work without knowing the padlength.

@ chrischris: I'll need to look into that

Dan
Dan·

That's a great function.
How many combinations are if I choose the string to be of length 3?

Is it more than 1 million?
Or how could I calculate.

Thanks.

sameer zafar
sameer zafar·

rehgsfgfh

sylvester javed
sylvester javed·

the best viewer page

রনি
রনি·

Really great function. Saved me a lot of coding. :)
Thanks...

sid
sid·

How do you think duplicate alphaid's can be avoided without correlating with a auto_increment field ?

Neil
Neil·

I also remove lowercase L, the number 1, the letter O and Zero, because they can be ambiguous if you need to physically type them.

Tim
Tim·

With smaller ids and padding this creates not so random hashes. Anyway around this?

2 = dbbbc
3 = fbbbc

dahuda
dahuda·

Code seems nice, will use it .. thnx in advance man. keep up the good work, it is appreciated..

Alessandro
Alessandro·

Hi, thank you for your nice script, i'm testing it with this script:

[CODE="php"]$test=Array();
for($i=1000000;$i<9007199254740992;$i++){
$id = alphaID($i, false, 8);
if (in_array($id, $test)) {
echo "Collision detected at $i with id $id\n";
}
if($i % 1000 == 0){
echo "i = $i\n";
}
array_push($test,$id);
}[/CODE]

and it seems that there are some collisions all starting with 0X, is there a starting number that can be considered "safe"? Because you won't create an id that corrispond to 2 rows in the db.

Best Regards

Romas
Romas·

C# code:

class ShortId
{
public string Alphabet = "abcdefghijklmnopqrstuvwxyz0123456789ABCDEFGHIJKLMNOPQRSTUVWXYZ";

public string Encode(Int64 Number)
{
string result = String.Empty;
for (int i = (int)Math.Floor(Math.Log(Number) / Math.Log(Alphabet.Length)); i >= 0; i--)
{
result += Alphabet.Substring((int)(Math.Floor(Number / BcPow(Alphabet.Length, i)) % Alphabet.Length), 1);
}
return ReverseString(result);
}

public Int64 Decode(string Id)
{
string str = ReverseString(Id);
Int64 result = 0;
int end = str.Length - 1;
for (int i = 0; i <= end; i++)
{
result = result + (Int64)(Alphabet.IndexOf(str.Substring(i, 1)) * BcPow(Alphabet.Length, end - i));
}
return result;
}

public double BcPow(double _a, double _b)
{
return Math.Floor(Math.Pow(_a, _b));
}

public string ReverseString(string s)
{
char[] arr = s.ToCharArray();
Array.Reverse(arr);
return new string(arr);
}
}

Rafique
Rafique·

Thnks alot bro!! it helps alot!! :)

Necessity
Necessity·

What inputs am I allowed to use if my output must be exactly a 6 digit number?

wouter2512
wouter2512·

@Alessandro

That aren't duplicates, but rather your checking is not in order. PHP interprets the values 0x... as hexadecimal and searches for the corresponding value. And as it happends it exists already... It also happens with strings starting with 1e... and so on.

Lan
Lan·

@ Dan: 238328

You can calculate this by
(length of $index) ^ (number of places)

so 62^3 = 238,328

Using 4 places would give you 14,776,336 numbers to work with

SaudSattar
SaudSattar·

Good

Leander
Leander·

to me this seems quite complicated.
why not just express your integer with a base of lets say 36?

[CODE ="PHP"]
$testId = 9999999999999;
$alphaId = base_convert($testId, 10, 36);
$originalInteger = intval($alphaId, 36);
[/CODE]

Renox
Renox·

@ Leander
nice ^^
typical for php btw.
Someone implements a big function of something, that already exists.

fetr
fetr·

very nice

Bill
Bill·

@Leander (cc @Renox)
Your solution not return a same length string for $alphaId.

HBR
HBR·

Hi,

Thank you for this useful function. I have a question about it (since i'm bad at maths !).

Actually, users in my application will encrypt IDs using a pass. If the pass is hardcoded by me and implicitly shared by all of them, the resulting encrypted alphid will look like "ab3g, cd3g, kj3g" for IDs like "8081, 8082, 8083".
I wouldn't like them to look like a sequence, it is better in my case to have "hJ39, cd3K, oPI3" for the same sequence of ID 8081, 8082, 8083...

So my question is, given an auto_increment ID in a mysql table, if I use a user-specific unique random pass key to encrypt each ID (and then decrypt it), is there a risk of collision ?

I hope my question is clear ...

Thanks

Cyrus David
Cyrus David·

What about creating unique ID's from alpha-numeric ones? I'm getting `A non well formed numeric value encountered` on your solution