技术控

    今日:94| 主题:58057
收藏本版 (1)
最新软件应用技术尽在掌握

[其他] Inspired by a HN comment, four half-star characters accepted by Unicode

[复制链接]
我没死掉就很好 投递于 2016-10-4 13:10:53
379 7
Star characters (☆★) have long been part of the Unicode standard, which means they can appear as characters in web pages, text, and email. But half-stars were missing, so they required special images or custom fonts. I recently co-wrote a proposal to add half-star characters to Unicode, and it was just  accepted  . In the next Unicode release, half-stars will be usable like any other text character. In this article, I discuss how I got these characters added to Unicode, and how you can add characters too.
   

Inspired by a HN comment, four half-star characters accepted by Unicode

Inspired by a HN comment, four half-star characters accepted by Unicode-1-技术控-everything,character,different,computers,recently

  Usage of the four different half-stars to express 3.5 of 5.
   Unicode is the computer standard that defines the characters that are used by almost every computer—this standard allows different computers to easily display text in almost every language, and with almost every symbol you might need. (Before Unicode, dealing with non-English text on computers was a mess.) But Unicode doesn't include everything. Last June, a comment on Hacker News complained that Unicode lacked the half-star character used in ratings and movie reviews:
  Until Unicode has a half-star character, it won't even be able to encode the average newspaper.
  I suggested that someone should propose the half-star to Unicode, but quickly realized that "someone" would be me. Since I had successfully proposed two symbols to Unicode earlier, I knew the process necessary to get the half-star added.
   A few years ago, a detailed article described how a couple people got power symbols added to Unicode . Adding a new character to Unicode is easier than most people think. You don't need to pay money, be part of a major company or join a committee. All you need to do is write a proposal explaining why the character is needed. If the Unicode Committee agrees, they'll approve your character for addition to Unicode.
   In 2015, I started programming the 1960s-eraIBM 1401 mainframe at the Computer History Museum . But when I wrote about the IBM 1401 system, I ran into a problem. This computer uses a 6-bit character set (the precursor to EBCDIC) with some strange characters . All these characters appeared in Unicode, with the exception of one: the Group Mark. I was a bit shocked that Unicode, with its 128,172 characters , lacked a character I needed. Having read about the power symbol team's success in adding characters, I figured it would be interesting to see if I could get the group mark character added to Unicode. I wrote a proposal , submitted it to Unicode, and at the next meeting it was approved .
   

Inspired by a HN comment, four half-star characters accepted by Unicode

Inspired by a HN comment, four half-star characters accepted by Unicode-2-技术控-everything,character,different,computers,recently

  The group mark character, from an IBM 705 computer manual (1959). Since Unicode lacked this character, you couldn't write this text on a modern computer.
   A few months later, I learned that the Bitcoin symbol was missing from Unicode. This was a surprising omission, since the Bitcoin symbol is widely used in the real world. The symbol had been rejected before , so I made a more thorough proposal in October 2015 with the enthusiastic support of /r/bitcoin and other Bitcoin groups. The Bitcoin symbol proposal was accepted by the Unicode Committee in November 2015.
   

Inspired by a HN comment, four half-star characters accepted by Unicode

Inspired by a HN comment, four half-star characters accepted by Unicode-3-技术控-everything,character,different,computers,recently

   The Bitcoin symbol on an IBM punched card. Mining Bitcoins on a punched card mainframe isn't practical, but was an interesting experiment.
   So when I saw the comment about half-stars on Hacker News, I figured it would be straightforward to get it accepted to Unicode. I wrote a proposal after discussion on HN and on the Unicode mailing list . The Unicode committee considered the proposal in August 2016, but to my surprise they had also received another half star proposal , so they decided to wait on a single proposal. It turned out that Andrew West had also written a proposal for half-stars, and we had both submitted proposals, unaware of the other. So Adam and I joined forces and made a combined proposal , which was accepted by the committee Sept 30, 2016.
  Why did we propose four different half-stars? We included both the outline half-star and solid half-star because both forms are commonly used. (I wasn't sure if the committee would consider these characters distinct enough to include both, but they did.) Right-to-left languages such as Hebrew do their star ratings right-to-left too (which was a bit of a surprise to me), so we included mirrored versions for RTL languages. Thus, the four different half-stars cover the range of uses.
   

Inspired by a HN comment, four half-star characters accepted by Unicode

Inspired by a HN comment, four half-star characters accepted by Unicode-4-技术控-everything,character,different,computers,recently

  Half-stars in Hebrew are written right-to-left. From Haaretz 2 November 2012, provided by Simon Montagu.
   If there's a character that you want to add to Unicode and it meets the requirements, you should write a proposal. It's a very interesting process. Make sure your character meets the criteria . In particular, you'll need to find a bunch of examples of the character used in text. The Unicode Committee isn't going to add a character just because you think it's cool. The power symbol team has lots of helpful advice on making a successful proposal. I'm also happy to offer advice if you're writing a proposal.
   I should mention that emojis have a totally different process , so don't argue that "since the poop emoji exists, my character should too". (The poop emoji :hankey: was added for backwards compatibility with Japanese mobile phones.) For emoji, expected popularity of the symbol is a major factor in acceptance. Regular Unicode, on the other hand, isn't concerned with popularity—historical scripts such as Tangut won't get a millionth the usage of a new emoji—but with existing usage in text. (Reading between the lines, I think a lot of the Unicode committee wishes they weren't in the emoji business at all.)
  Once a character is accepted, there's still a long road for it to appear in fonts and be usable. A new version of Unicode is released typically every June, so these characters will hopefully appear in Unicode 10.0 mid-2017. The Bitcoin community in particular has had to wait patiently since the Bitcoin symbol just missed the cutoff for Unicode 9.0, adding a year to the wait. But with any luck, next June you'll be able to use the group mark, Bitcoin symbol and half stars in web pages and text just like any other symbol.



上一篇:The Third Bricklayer
下一篇:What’s new in PostgreSQL 9.5
崔甜天 投递于 2016-10-5 09:36:49
楼主,约么?
回复 支持 反对

使用道具 举报

驱动人生 投递于 2016-10-5 09:57:10
别和我谈理想,戒了.
回复 支持 反对

使用道具 举报

亿帮 投递于 2016-10-7 02:26:35
白富美?高富帅?
回复 支持 反对

使用道具 举报

sjsacjfgtic 投递于 2016-10-11 22:31:19
sjsacjfgtic就喜欢这种人少的贴子,沉了感觉是我弄沉的,火了就占了前排,说不准还遇到好心人把我给粉了 稳赚不赔啊!
回复 支持 反对

使用道具 举报

天猫小虫虫 投递于 2016-10-14 02:21:31
想污染一个地方有两种方法:垃圾,或是钞票.
回复 支持 反对

使用道具 举报

小城城 投递于 2016-11-12 19:48:41
今天不想骂人!
回复 支持 反对

使用道具 举报

易访 投递于 2016-11-16 15:53:21
我用手机上酷辣虫,好好用
回复 支持 反对

使用道具 举报

我要投稿

推荐阅读


回页顶回复上一篇下一篇回列表
手机版/CoLaBug.com ( 粤ICP备05003221号 | 文网文[2010]257号 | 粤公网安备 44010402000842号 )

© 2001-2017 Comsenz Inc.

返回顶部 返回列表