encoding/base64: URLEncoding padding is optional #4237

gjemiller · 2012-10-11T22:13:08Z

It appears that under certain circumstances, padding characters are not needed in a
valid base64 encoding.  Specifically, when using the "url encoding" they are
optional.  This presents a problem when trying to decode base64 code from a third party
that omits them.

What steps will reproduce the problem?
http://play.golang.org/p/RWV_Nd34wr

// reproduced here:
package main

import (
    "bytes"
    "encoding/base64"
    "fmt"
    "io"
    "os"
)

func main() {
    decode("c3VyZS4=")    // example from wikipedia
    decode("c3VyZS4") // should be the same without padding
}

func decode(s string) {
    dec := base64.NewDecoder(base64.URLEncoding, bytes.NewBufferString(s))
    n, err := io.Copy(os.Stdout, dec)
    fmt.Printf("\n%d %v\n", n, err)
}


What is the expected output?
sure.
5 <nil>
sure.
5 <nil>

What do you see instead?
sure.
5 <nil>
sur
3 <nil>

minux · 2012-10-12T05:51:23Z

Comment 1:

it is optional in URLEncoding, if exists, '=' must be escaped to %3D

Labels changed: added priority-later, removed priority-triage.

Status changed to Accepted.

dsymonds · 2012-10-14T05:48:03Z

Comment 2:

encoding/base64 specifies that it follows RFC 4648, but there's no mention in that RFC
that padding is optional for the URL-safe alphabet. We could accept the lack of padding
silently, but then again it's pretty easy to add in calling code:
    if m := len(enc) % 4; m != 0 {
        enc += strings.Repeat("=", 4-m)
    }
For something so well-defined and easy to work around, I'm not sure whether we want to
get in the business of accepting potentially corrupt input silently.

Status changed to Thinking.

minux · 2012-10-14T05:59:50Z

Comment 3:

http://tools.ietf.org/html/rfc4648
section 5.
The pad character "=" is typically percent-encoded when used in an
   URI [9], but if the data length is known implicitly, this can be
   avoided by skipping the padding; see section 3.2.

dsymonds · 2012-10-14T06:08:18Z

Comment 4:

That refers to the encoding, and specifically when used in a URI. Indeed the section 3.2
that your quote refers to says
   In some circumstances, the use of padding ("=") in base-encoded data
   is not required or used.  In the general case, when assumptions about
   the size of transported data cannot be made, padding is required to
   yield correct decoded data.
   Implementations MUST include appropriate pad characters at the end of
   encoded data unless the specification referring to this document
   explicitly states otherwise.
encoding/base64 is a general-purpose package and doesn't know the context of where its
output will be used, so it should always be including the padding. It seems oddly
asymmetric to allow it to be absent when decoding.

minux · 2012-10-14T06:44:06Z

Comment 5:

I agree we can always encode with padding, but we'd better accept non-padded
data in decode.
also, because we always encode with padding, i think it's preferable the docs 
explicitly state that if used in URI, the caller should url.QueryEscape it.
the problem with reported program is that our Decoder silently truncate the
output without any error (except that n < len(input)); however, if we call
DecodeString or Decode directly, we get CorruptInputError.
This arises from the fact that we're using ReadAtLeast in Read. I'm not sure
this is correct.

dsymonds · 2012-10-14T10:47:12Z

Comment 6:

Well, that's what I was pondering. Some of our APIs are liberal in
what they accept (e.g. net/http), but some are not (flag, most of
encoding/*, etc.).
Even with the padding, isn't the output safe to use? It should be.
We shouldn't be throwing away errors, for sure. That sounds like a bug to me.

minux · 2012-10-14T11:06:12Z

Comment 7:

out of curiosity, i grepped the std library for ReadAtLeast, and except some tests,
only encoding/base32 and encoding/base64 use it.
Update: dec.Read() does return an error (i was wrong about this), it's
io.ErrUnexpectedEOF,
should it be a base64.CorruptInputError to match that of dec.DecodeString?

minux · 2012-10-14T13:19:55Z

Comment 8:

scratch the update in #7, I was confused by myself.
please See the behavior for yourself:
http://play.golang.org/p/BAnLexAKwr

dsymonds · 2012-11-06T02:56:58Z

Comment 9:

Labels changed: added go1.1maybe, removed go1.1.

robpike · 2013-03-07T22:24:10Z

Comment 10:

Labels changed: removed go1.1maybe.

rsc · 2013-07-30T15:12:14Z

Comment 11:

Labels changed: added go1.2maybe.

rsc · 2013-07-30T22:40:14Z

Comment 12:

Labels changed: added feature.

robpike · 2013-08-30T05:51:05Z

Comment 13:

Not for 1.2.

Labels changed: removed go1.2maybe.

dsymonds · 2013-08-30T05:55:39Z

Comment 14:

It's not even clear we want to do this at all.

Labels changed: added priority-someday, packagechange, removed priority-later.

rsc · 2013-12-04T01:50:32Z

Comment 15:

Labels changed: added repo-main.

gopherbot · 2013-12-19T22:29:44Z

Comment 16 by oleku.konko:

Is this change likely for 1.3 ?

dsymonds · 2013-12-19T23:56:20Z

Comment 17:

It seems unlikely.

Labels changed: added release-none.

gopherbot · 2014-06-03T13:48:25Z

Comment 18 by tais.hansen:

Decoding the Google OAuth2 JWT fails with the current go encoding/base64 implementation.
I worked around this by adding the following before decoding:
    if l := len(s) % 4; l > 0 {
        s += string([]byte{'=', '=', '='}[3-l:]) // or strings.Repeat("=", 4-l)
    }
Where "s" is the encoded string.
Info:
Google OAuth2 (OpenID) returns JWT (JSON Web Tokens) as the authentication result which
is "base64url" encoded with padding removed.
https://developers.google.com/accounts/docs/OAuth2Login#exchangecode
From the JWT specification:
Base64url Encoding
      Base64 encoding using the URL- and filename-safe character set
      defined in Section 5 of RFC 4648 [RFC4648], with all trailing '='
      characters omitted (as permitted by Section 3.2) and without the
      inclusion of any line breaks, white space, or other additional
      characters.
http://tools.ietf.org/html/draft-ietf-oauth-json-web-token-20

kennygrant · 2016-12-17T22:58:02Z

Perhaps this issue can be closed now?

The base64 pkg has changed a bit since it was filed. Specifically with this change in 2014:

2e0a1a7

RawURLEncoding can be used if you wish to decode data which uses no padding, so for the example given above, to avoid errors you can just use the appropriate raw encoding:

https://play.golang.org/p/skWYiMHk3j

bradfitz · 2016-12-17T23:07:26Z

@kennygrant, thanks! Closing.

gjemiller added Thinking priority-someday labels Jun 3, 2014

rsc added this to the Unplanned milestone Apr 10, 2015

rsc removed priority-someday labels Apr 10, 2015

bradfitz closed this as completed Dec 17, 2016

EmpireJones mentioned this issue Jul 26, 2017

Fix "illegal base64 data at input byte X" error SherClockHolmes/webpush-go#3

Closed

golang locked and limited conversation to collaborators Dec 17, 2017

gopherbot added the FrozenDueToAge label Dec 17, 2017

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

encoding/base64: URLEncoding padding is optional #4237

encoding/base64: URLEncoding padding is optional #4237

gjemiller commented Oct 11, 2012

minux commented Oct 12, 2012

dsymonds commented Oct 14, 2012

minux commented Oct 14, 2012

dsymonds commented Oct 14, 2012

minux commented Oct 14, 2012

dsymonds commented Oct 14, 2012

minux commented Oct 14, 2012

minux commented Oct 14, 2012

dsymonds commented Nov 6, 2012

robpike commented Mar 7, 2013

rsc commented Jul 30, 2013

rsc commented Jul 30, 2013

robpike commented Aug 30, 2013

dsymonds commented Aug 30, 2013

rsc commented Dec 4, 2013

gopherbot commented Dec 19, 2013

dsymonds commented Dec 19, 2013

gopherbot commented Jun 3, 2014

kennygrant commented Dec 17, 2016

bradfitz commented Dec 17, 2016

encoding/base64: URLEncoding padding is optional #4237

encoding/base64: URLEncoding padding is optional #4237

Comments

gjemiller commented Oct 11, 2012

minux commented Oct 12, 2012

dsymonds commented Oct 14, 2012

minux commented Oct 14, 2012

dsymonds commented Oct 14, 2012

minux commented Oct 14, 2012

dsymonds commented Oct 14, 2012

minux commented Oct 14, 2012

minux commented Oct 14, 2012

dsymonds commented Nov 6, 2012

robpike commented Mar 7, 2013

rsc commented Jul 30, 2013

rsc commented Jul 30, 2013

robpike commented Aug 30, 2013

dsymonds commented Aug 30, 2013

rsc commented Dec 4, 2013

gopherbot commented Dec 19, 2013

dsymonds commented Dec 19, 2013

gopherbot commented Jun 3, 2014

kennygrant commented Dec 17, 2016

bradfitz commented Dec 17, 2016