const A_LCASE: u8 = 97;
const A_UCASE: u8 = 65;
const ALL_26_BITS_SET: u32 = 67108863;
pub fn is_pangram(sentence: &str) -> bool {
let mut letter_flags = 0;
for letter in sentence.chars() {
if letter >= 'a' && letter <= 'z' {
letter_flags |= 1 << (letter as u8 - A_LCASE);
} else if letter >= 'A' && letter <= 'Z' {
letter_flags |= 1 << (letter as u8 - A_UCASE);
}
}
letter_flags == ALL_26_BITS_SET
}
This solution uses the ASCII value of the letter to set the corresponding bit position.
First, some const
values are set.
These values will be used for readability in the body of the is_pangram
function.
The ASCII value for a
is 97
.
The ASCII value for A
is 65
.
The value for all of the rightmost 26
bits being set in a u32
is 67108863
.
- The
for
loop loops through the chars ofsentence
. We don't iterate by bytes because, as of this writing, some tests may include multi-byte characters insentence
. - Each letter is tested for being
a
throughz
orA
throughZ
. - If the lower-cased letter is subtracted by
a
, thena
will result in0
, because97
minus97
equals0
.z
would result in25
, because122
minus97
equals25
. Soa
would have1
shifted left 0 places (so not shifted at all) andz
would have1
shifted left 25 places. - If the upper-cased letter is subtracted by
A
, thenA
will result in0
, because65
minus65
equals0
.Z
would result in25
, because90
minus65
equals25
. SoA
would have1
shifted left 0 places (so not shifted at all) andZ
would have1
shifted left 25 places.
In that way, both a lower-cased z
and an upper-cased Z
can share the same position in the bit field.
So, for an unsigned thirty-two bit integer, if the values for a
and Z
were both set, the bits would look like
zyxwvutsrqponmlkjihgfedcba
00000010000000000000000000000001
We can use the bitwise OR operator to set the bit.
After the loop completes, the function returns true
if the letter_flags
value is the same value as when all of the rightmost 26
bits are set,
which is 67108863
.
This approach is relatively very fast compared with other approaches.