Extract just URL using Regular Expressions

20 views (last 30 days)
Marko Rajkovic
Marko Rajkovic on 15 Oct 2018
Answered: Arjun on 3 Feb 2025
Hello everyone, does anyone of you know how to extract just the urls? x should provide a 1X3 cell array containing: {'www.lichess.org'} {'www.bing.com'} {'www.google.com'}
regularexpression = '';
script = ['I <I>used</I> <B>to</B> visit <THE> website <A HREF=www.lichess.org>twice a day</A>. ' ...
'Visit <I>the website</I> <A HREF=www.bing.com>or alternatively</A> just <A HREF=www.google.com> for more information</A>.']
x = regexp(script,regularexpression,'match')

Answers (1)

Arjun
Arjun on 3 Feb 2025
In order to extract only URL's from a given text or scripts, you can construct regular expressions and then match them using "regexp" function in MATLAB.
You can refer to the following related resources for constructing your own regular expression based on your requirements:
Additionally, refer to this documentation link to have better understanding about the process of building a regular expression: https://www.mathworks.com/help/matlab/matlab_prog/regular-expressions.html
I hope this will help!

Categories

Find more on Characters and Strings in Help Center and File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!