iframe-proxy

huangsam · 2018-05-26T18:46:14Z

The previous implementation used Linux regular expressions. This was sufficient as a MVP but it was not accurate enough for corner cases. After doing some more research, it seemed as if using a HTML parsing library would be more efficient for this purpose. As such, the shell command has been scrapped away in favor of a more elaborate approach for detecting urls.

Implement skeleton for extract_urls
Detect html and markdown files
Use bs4 for parsing html
Convert markdown for bs4 parsing
Remove use of urlin.txt and urlout.txt
Remove unnecessary global vars

The previous implementation used Linux regular expressions. This was sufficient as a MVP but it was not accurate enough for corner cases. After doing some more research, it seemed as if using a HTML parsing library would be more efficient for this purpose. As such, the shell command has been scrapped away in favor of a more elaborate approach for detecting urls. - Implement skeleton for extract_urls - Detect html and markdown files - Use bs4 for parsing html - Convert markdown for bs4 parsing - Remove use of urlin.txt and urlout.txt - Remove unnecessary global vars

huangsam · 2018-05-26T18:46:46Z

mattmakai · 2018-05-26T21:50:29Z

Wow, this looks like a huge improvement. Thanks again for doing all this work @huangsam.

huangsam · 2018-05-26T22:32:50Z

My pleasure @mattmakai. It was great meeting you in person at PyCon 2018. I recall that you sent me an invitation to share my creation via http://twiliovoices.com - is it still possible?

mattmakai · 2018-05-27T13:38:13Z

huangsam mentioned this pull request May 26, 2018

Fix bad urls #167

Merged

mattmakai merged commit a5274df into mattmakai:master May 26, 2018

huangsam deleted the bugfix/url-discovery branch May 26, 2018 22:19

Sunbelt Computer Software

PL/B Language Development and Support

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve url detection#166

Improve url detection#166
mattmakai merged 1 commit intomattmakai:masterfrom
huangsam:bugfix/url-discovery

huangsam commented May 26, 2018

Uh oh!

huangsam commented May 26, 2018

Uh oh!

mattmakai commented May 26, 2018

Uh oh!

huangsam commented May 26, 2018

Uh oh!

mattmakai commented May 27, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Sunbelt Computer Software

PL/B Language Development and Support

Conversation

huangsam commented May 26, 2018

Uh oh!

huangsam commented May 26, 2018

Uh oh!

mattmakai commented May 26, 2018

Uh oh!

huangsam commented May 26, 2018

Uh oh!

mattmakai commented May 27, 2018

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants