My favorites | Sign in
Project Logo
                
Code license: New BSD License
Labels: php, text, statistics, webdesign
People details
Project owners:
  addedbytes
Project committers:
joel...@cyberone.com.au

Introduction

The PHP Text Statistics class will help you to indentify issues with your website content, especially with readability.

It allows you to measure the readability of text using common scoring systems, including:

This code was originally released on AddedBytes.com (formerly ILoveJackDaniels.com) in July 2004.

What's New?

In July 2008, I rewrote the code into a more up-to-date format (it was a mess before ... no structure, no validation ... awful). There are versions for both PHP5 and PHP4. The code was turned into this open source project and released under a New BSD License in August 2008.

The code that generates the statistics is in a simple class structure. There are also several unit test classes to ensure that changes made don't break existing functionality. There is also a live version of this tool.

One of the biggest challenges with measuring text readability is the counting of syllables, which can be tricky to work out. There are rules in the statistics file for working out the syllable count of words, and a large list of words to test these rules against.

Please feel free to add to the test word list, especially if you can find words whose syllable count is not correctly calculated (even more especially if you can also add code to the class so your word is correctly handled!).

Homographs are going to be impossible to calculate as they depend on context (i.e., "he moped around the house", "she rode her moped to school), but there are few enough of these not to be a concern.

Join!

The main reason this code has been released under an open source license is to allow others to improve it. Best places to start are probably the Google Group for discussions and the commit list to be notified of changes to the code and issues.









Hosted by Google Code