uiwebview - How to extract text contents from html like Read it later or InstaPaper Iphone app? -


i want extract main article content html on iphone app , show on textview or coretext.

read later , instapaper iphone apps have feature, after researching on web, still can't tell how this.

at moment, take text content html code, takes lots of no need contents too.

textarticle = [webview stringbyevaluatingjavascriptfromstring:@"document.body.innertext"]; 

this question wanted, sadly not iphone app.
instapaper-like algorithm

this open source kind of feature, not sure if can use iphone app. https://github.com/jiminoc/goose/wiki

it seems smartr provided api before, not available now. http://smartrmobi.blogspot.com/2011/02/smartr-api-withdrawn-until-further.html

maybe, easiest way article content xml element, guess.

i know start i'd appreciate suggestions.

thanks

after researching, seems can use api extract text contents web. means need access webpage after got url , render result again.

it slower using js script showed above because needs access web api read later , instapaper both using approach guess.

the followings web api found far.

http://viewtext.org/

this api has nice feature combines multi-page articles one. using api because of feature other api not have.

http://fivefilters.org/content-only/

great thing can buy script , set on own server.

*update*

it seems apps use "readability" or "instapaper" or "google" mobilizer parse text contents web.

among them, favorite "readability" parser @ moment, since doesn't come advertisement instapaper parser. (nothing wrong putting ads cover server cost though)

pocket provides article parser developers creating pocket integrated apps.


Comments

Popular posts from this blog

c# - how to write client side events functions for the combobox items -

exception - Python, pyPdf OCR error: pyPdf.utils.PdfReadError: EOF marker not found -